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Five investigations, 
validate an earlier study 
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Fisher Junior College 
anp BARBARA BOWEN 
Regis College 


and 51 Es, were conducted to cross- 
(Rosenthal & Fode, 1963, Exp. 1) that had clearly 


demonstrated the Experimenter Bias Effect. Each of the five investigations 


failed to demonstrate that Es’ 


expectancy-biases jnfluence their results 


(overall F < 1.0). The five investigations are related to other studies in this 


area, the majority of which 


also failed to 


Bias Effect. It is concluded that the effect is more difficult to demonstrate 
than was implied in several recent reviews and that, at the present time, it is 
not known what preconditions are necessary to obtain it. 


A recent text (Rosenthal, 1966) delineated 
several ways that an experimenter might in- 
advertently influence the results of his re- 
search. These “experimenter effects” include 
an Experimenter Personal-Attributes Effect in 
which the characteristics or personality traits 
of the experimenter—for example, his sex, age, 
race, anxiety, need for approval, hostility, and 
dominance—influence his Ss’ performance. 
They also include an Experimenter Bias Effect 
in which Z’s expectancies, hypotheses, desires, 
or biases affect the results. For instance, an 
Experimenter Bias Effect would be present if 
Es obtain significantly higher scores from Ss 
on a standard test when they expect high 
rather than low scores. 

This paper is concerned only with the Ex- 
perimenter Bias Effect. There is little doubt 
that Es generally expect or desire to obtain 
different results from Ss assigned to different 


1 These investigations were supported by research 
grants (MH-07003 and MH-11521) from the Na- 
tional Institute of Mental Health, United States Pub- 
lic Health Service, to T. X. Barber. 

2The authors are grateful to Robert Rosenthal for 
supplying the photographs for the person-percep- 
tion task and to the following for criticizing the 
manuscript: Robert Rosenthal, Maurice J. Silver, 
Ralph L. Rosnow, Herbert J. Greenwald, Nicholas 
Spanos, and Dick Murray. Requests for reprints 
should be sent to Tieodore X. Barber, Medfield State 
Hospital, Medfield, Massachusetts 02042. 


experimental treatments and that they rarely 
exclude the possibility that their Ss’ responses 
are influenced by these expectancies and de- 
sires. If the Experimenter Bias Effect is per- 
vasive—if Es inadvertently influence their Ss 
to respond in such a way as to confirm their 
expectancies or desires—a substantial propor- 
tion of the “facts” of present-day psychology 
would be open to question and many if not 
most earlier psychological investigations would 
need to be rerun to determine if the results 
were due to Es’ biases. Furthermore, as Ros- 
enthal (1966, pp. 331-400) has emphasized, 
if the Experimenter Bias Effect has general 
validity, it would be necessary in further re- 
search either (a) to institute complex pro- 
cedures to obviate the effects of the experi- 
menters’ expectancies and desires or (b) to 
remove human beings from the conduct of 
experiments—replacing them by automated 
procedures. 

In all of the formal experimental studies 
that aimed to demonstrate the Experimenter 
Bias Effect, the Es were college students 
(Rosenthal, 1966). The criterion instruments 
in most of these investigations was 4 person- 
perception task in which S was shown a series 
of photographed faces by the student-Z and 
was asked to rate on a scale whether he 
judged that the persons depicted had been 
“experiencing failure” or had been “experienc- 
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ing success.” The rating scale employed ran 
from —10 (extreme failure) to +10 (extreme 
success) with intermediate labeled points. In 
investigations which clearly showed the Ex- 
perimenter Bias Effect (e.g, Rosenthal & 
Fode, 1963, Exp. 1), one group of student-Es 
was told, prior to the experiment, that their 
Ss were expected to give high ratings (averag- 
ing +5) on the person-perception task and an- 
other group of student-Es was told that their 
Ss would give low ratings (averaging —5). In 
addition, all student-Es were told that (a) 
the expected results had been “well estab- 
lished” in previous investigations, (b) they 
would conduct the experiment to obtain prac- 
tice in “duplicating” the previously established 
findings, (c) if their results came out “prop- 
erly—as expected” they would be paid $2.00 
per hour, and (d) if their results did not come 
out “properly” they would be paid $1.00 per 
hour (Rosenthal & Fode, 1963, p. 507). An 
important consideration is relevant here: The 
possibility was not excluded in these investi- 
gations that, in order to obtain more money 
or to make the results come out “properly— 
as expected,” some or many of the student-Es 
may have intentionally failed to conduct the 
experiment in the way they were told to carry 
it out. For instance, the student-Es may have 
(a) verbally reinforced their Ss (“good,” 
“fine,” “that’s excellent”) when they gave the 
“proper” ratings or (b) misreported their Ss’ 
ratings. 

Our first investigation was conducted with 
the above considerations in mind. When plan- 
ning the first investigation, the project-co- 
ordinator (TXB) hypothesized that the stu- 
dent-Æs would obtain high or low ratings 
from their Ss in line with their expectancies. 
The main purpose of the first investigation, 
however, was not simply to demonstrate the 
Experimenter Bias Effect but to determine 
whether the effect was due to the student-Es 
verbally reinforcing their Ss for “proper” rat- 
ings or misreporting their Ss’ ratings. To as- 
certain how the student-Es tested their Ss, we 
carefully concealed a tape recorder in the 
experimental room and we did not inform 
either the Es or the Ss that the recorder was 
present. However, when we (TXB and DSC) 
analyzed the results of the first investigation, 
we were surprised to find that the Experi- 


menter Bias Effect was not manifested: st 
dent-Zs given an expectancy for high (+5 
ratings, those given an expectancy for | 
(—5) ratings, and a control group of studeni 
Es not given expectancies did not differ i 
the ratings they obtained (F < 1.0). 

Even though the Experimenter Bias Effeq 
was not demonstrated in our first investi 
tion, we proceeded to listen to the tapes 
determine how the student-Es had condu 
the experiment. Unfortunately, the tape ri 
corder had been placed at too great a di 
tance from the Es and Ss and, since the taj 
were barely audible, we were unable to ana 
lyze them. However, our inability to analy; 
the verbal interactions between the student- 
and the Ss did not alter the major finding of 
the first investigation, namely, that the Ex: 
perimenter Bias Effect was not noticeabl 
present. 

Having failed to demonstrate the Experi- 
menter Bias Effect in our first investigation, 
we (TXB and DSC) then asked: Is this effect 
replicable? To answer this question, we set up 
four additional investigations. The principal- 
investigators who were asked to supervise the 
second, third, fourth, and fifth investigations | 
(AF, JDM, JFC, and BB) were not informed 
of our previous negative results. Investi gations 
2-5 were conducted in the same way as Inves- 
tigation 1 with the exception that tape record- 
ers were not concealed in the experimental 
rooms. 

We attempted to design each of the five 
investigations so as to replicate, as closely as 
Possible, the design of a previous study 
(Rosenthal & Fode, 1963, Exp. 1) that had 
clearly demonstrated the Experimenter Bias 
Effect. Our investigations, however, extended 
the Rosenthal and Fode study in that they | 
included an additional (control) group of 
experimenters who were not given expectan- 
cies. The method and procedures in each of 
our investigations were as follows. 


METHOD 


_ A total of 558 individuals participated in the five 
investigations: 1 project coordinator (TXB), 5 7 
Principal investigators, 51 student-Es, and 501 Ss. 


Principal Investigators 


Four of the five principal investigators (DSC, AF, 
JDM, and JFC) were teaching classes in introduc- 
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tory psychology at four college in Massachusetts. 
The fifth principal investigator (BB) was a third- 
year honors student majoring in sociology at a fifth 
college. 

‘After each principal investigator agreed to super- 
vise an investigation at his college he was told by 
the project coordinator that he was to (a) use a 
table of random numbers to select about nine experi- 
menters and about 90 Ss from the students available 
to him, (b) randomly administer one of three sets of 
expectancy instructions (+5, —5, or control expectancy 
instructions) to each of his student-Es, (c) adminis- 
ter the person-perception task to each student-E as 
a “warm-up,” and (d) arrange for each student-E 
to test about 10 randomly selected Ss on the person- 
perception task. Finally, each principal investigator 
was asked to state his own expectancies concerning 
the outcome of his investigation. 


Experimenters 


Four of the five principal investigators randomly 
selected the Es from the students in their intro- 
ductory psychology classes; the fifth principal-inves- 
tigator (BB) selected the Es at random from the list 
of freshmen and sophomores at her college. Investi- 
gation 4 included 15 student experimenters and the 
remaining four investigations each included nine stu- 
dent-Es. 


Subjects 


The Ss were freshmen and sophomores at the five 
colleges enrolled in introductory psychology courses. 
The number of Ss participating in Investigations 1 
to 5, respectively, were 81, 90, 90, 150, and 90. 


Procedure 


There were two parts to the procedure. In Part 1 
the principal investigators instructed the student-Es 
and in Part 2 the student-Es tested the Ss. 

Part 1. Each principal investigator read the instruc- 
tions that are presented verbatim by Rosenthal and 
Fode (1963, Appendix C) to those student-Es who 


8Qne of the five principal-investigators (BB) ap- 
parently had not heard about previous research con- 
cerned with the Experimenter Bias Effect and did 
not proffer any expectancies concerning the outcome 
of the investigation. The other four principal investi- 
gators, who were teaching courses in psychology, had 
previously heard about the research in experimenter 
pias. Three of these (DSC, JDM, and JFC) pre- 
dicted that the ratings would be highest under the 
+5 expectancy condition, next highest under the 
control expectancy condition, and lowest under the 
—5 expectancy condition. The remaining principal 
investigator (AF) predicted that the student Zs 
“would lean over backwards not to bias their sub- 
jects” and, consequently, student-Es expecting high 
ratings would obtain significantly lower ratings than 
those expecting low ratings. As will be seen when we 
present the results,ein no case were the expectancies 
of the principal investigators confirmed. 


had been randomly assigned to the high (+5) ex- 
pectancy condition and to the low (—5) expectancy 
condition. The instructions read to the student-Zs 
who were randomly assigned to the control expect- 
ancy condition differed in that nothing was said con- 
cerning “expected” or “proper” results. After the 
principal investigator read the (high, low, or control 
expectancy) jnstructions to each student-E, he asked 
each student-E to read the instructions to himself. 

In accordance with the procedures used by Rosen- 
thal and Fode (1963, Exp. 1), each student-E was 
next asked, as a “warm-up,” to rate the standardized 
set of photographs himself. 

Part 2. The heart of the experiment came next, 
when each student-E administered the person-percep- 
tion task to his Ss. The procedures for administering 
the task were identical with those of the earlier 
study (Rosenthal & Fode, 1963, Appendix B)4 


RESULTS 


Table 1 presents, separately for each of the 
five investigations, the mean ratings per pho- 
tograph obtained by each student-E from his 
Ss, the mean ratings obtained by student-Es 
assigned to the high (+5), control, and low 
(—5) expectancy conditions, and the F ratios 
yielded by one-way analyses of variance ap- 
plied to the latter means. The final row of 
Table 1 presents the overall mean ratings 
obtained under the high, control, and low 
expectancy conditions in the five investiga- 
tions combined. These results can be briefly 
summarized as follows: 

1. One-way analyses of variance performed 
separately on the results of each of the five 
investigations yielded F ratios for expectancy 
conditions which were in each instance either 
less than 1.0 or close to 1.0. These Fs, of 
course, are far from significant. A two-way 
analysis of variance (Expectancy Conditions 
X Replications) was performed upon the 
overall results of the five investigations 
(Winer, 1962, pp. 241-244). This analysis » 
(summarized in part in the final note of Table 
1) showed a far from significant effect for 
expectancy conditions (F < 1.0), a nonsig- 
nificant effect for replications (p = 18), and 


4In these investigations Es and Ss were under- 
graduates taking their first college course in psychol- 
ogy. Their texts and lectures in psychology had not 
referred to the Experimenter Bias Effect. In none of 
the five investigations was there any indication that 
the student-Es were aware that they themselves were 
the objects of investigation or that prior awareness 
of the experimenter bias phenomenon on the part of 
the student-Es or Ss influenced the results, 
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TABLE 1 


Mean RATINGS ON Person PERCEPTION TASK UNDER 
HicH, CONTROL, AND Low EXPECTANCY 


CONDITIONS 
T 
High e Low 
Investigation expect- expect- expect- 
ancy | an cy | ancy 
1. Worcester Junior Colleges| —.63 | -+.14 +.17 
+.16} —.03 | —.17 
+.99 | —.39 | +.87 
M +.17 | —.09 | +.29 
2. Boston University> +.12) +.79 | —.29 
—.26 | +.70 | +.12 
+.34 | —.29 | —.10 
M +.07 | +.40 | —.09 
3. Fisher Junior Colleges +.67| +.66 | +.34 
+.27| +.52 | +.80 
+.72| —.02 | +.73 
M +.55| +.39 | +.62 
4. Massachusetts Bay Com- | +.66 -00 | —.02 
munity Colleges —.24) —.28 | +.72 
+.87 | +.14 | —.37 
—.64 00 | —.61 
—.10 | —.33 | +.22 
M +.11| —.09 | —,01 
5. Regis College4 +.24] —.18 | +1.20 
+.28) —.25 | +.10 
+1.22} —.02 | —.06 
M +.58 | —.15 | +.41 
Overall mean of five +.27 | +.07 | +.21 
investigations® 


Note—In Investigation 1 each cell mean is based on nine 
Ss. In Investigations 2-5 

aF <ii idf = 2/6 
«14; df = 2/6. 


6; b > .25. 
6 for expectancy conditions. 


a far from significant interaction (F < 1.0). 
In brief, analyses of variance performed sepa- 
rately for each of the five investigations and 
for all five investigations taken in combina- 
tion consistently failed to show that Zs with 
different expectancy-biases obtained different 
ratings from their Ss, Nonparametric Krus- 
kal-Wallis tests yielded the same null results, 

2. None of the five investigations showed a 
trend in line with the Experimenter Bias Ef- 
fect; that is, no tendency was noticeable in 
any one of the five investigations for the mean 
ratings to be slightly higher under the +5 ex- 
pectancy condition, next highest under the 
control expectancy condition, and lowest un- 
der the —5 expectancy condition. In Investi- 
gations 1 and 3 the highest mean rating (in 
absolute terms) was obtained under the low 


(—5) expectancy condition. In Investigation 
2 the highest mean rating was obtained unda 
the control expectancy condition rather th 
under the high (+5) expectancy condition, fy 
Investigations 4 and 5 the lowest mean ratin 
was obtained under the control expectang 
condition rather than under the low expedi 
ancy condition. Taking all five investigati 
in combination, the overall mean ratin 
(.27, 07, and .21 under the high, control, ang 
low expectancy conditions, respectively) wer 
not consistently in the expected direction 
the mean under the control condition is lowel 
than the mean under the low expectancy com 
dition. 

In brief, these five investigations, takel 
separately and also in combination, failed to 
demonstrate the Experimenter Bias Effect: 


Discussion 


Each of our five investigations attempted 
to replicate the procedures used in an earlier 
experiment (Rosenthal & Fode, 1963, Exp. 1) 
that had clearly demonstrated the Experi 


5 We gave the raw data of these investigations to 
Robert Rosenthal for further analyses. He (Rosens) 
thal, 1967b) informed us that if the data are partis 
tioned in various ways and chi-square tests are pi 
formed, some of the chi-squares yield ps rangin 
from .16 to .10. However, even if we accept ps of 


of variance (which yielded an overall F smaller than 
1.0 with an associated p that was very far from sig- 


ratings of 2.0 or 
task. We cannot accept these tests as showing the 
Experimenter Bias Effect because only a very small 
Percentage of Ss (7% to 11%), who scored 2.0 or 
above, were arbitrarily selected for these analyses; if 
equally arbitrary Statistical tests had been per- 
formed on the data of Ss scoring 3.0 or higher, of 
below 2.0, the results would have been far from sig- 
nificant. The issue here, pertaining to the misleading 
conclusions that derive from arbitrary selection of 
data for postmortem analysis, are discussed in detail 
elsewhere (Barber & Silver, 1968b). 


THE EXPERIMENTER Bras EFFECT a 


menter Bias Effect. Although we attempted to 
follow the procedures of the earlier study, we 
failed to demonstrate the effect. The results of 
our investigations appear to indicate that the 
Experimenter Bias Effect may be more diffi- 
cult to demonstrate than was implied in recent 
reviews (Kintz, Delprato, Mettee, Persons, & 
Schappe, 1965; Rosenthal, 1963, 1964, 1966, 
1967a). This conclusion is supported by the 
following considerations. 

When we (TXB and DSC) began the five 
investigations, we had. received the impression 
from reading the review papers in this area 
that the Experimenter Bias Effect had been 
demonstrated in practically all studies that 
had been designed to show it. In fact, we 
assumed that the effect would be demon- 
strated in the first investigation and we aimed 
to ascertain how the effect would be mediated. 
After failing to show the effect in five investi- 
gations, we (Barber & Silver, 1968a) looked 
very closely at all of the extant investigations 
that had attempted to demonstrate that the 
expectancy-biases of student-Es influence their 
Ss’ responses. Contrary to our original as- 
sumption, we found that the majority of stud- 
ies in this area had failed to show the Experi- 
menter Bias Effect (Barber & Silver, 19682). 
After considering the matter further, we were 
able to delineate several interrelated reasons 
why we and other investigators had failed to 
demonstrate the effect. To show that expect- 
ancies of student-Hs significantly bias their 
Ss’ responses, it is necessary for expectancies 
to be transmitted from the principal investi- 
gator to the student-E and from the student-E 
to S. In fact, it appears that a complex eight- 
step transmission process is involved (cf. Mc- 
Guire, 1967): 

1. The student-Z must attend to the expect- 
ancy communication from the principal inves- 
tigator. 

2. The student-E must comprehend the 
expectancy communication. 

3. The student-Z must retain the communi- 
cation. 

4. The student-E (intentionally or uninten- 
tionally) must attempt to transmit the ex- 
pectancy to S. 

5. The S_ (‘consciously” or uncon- 
sciously”) must eattend to the expectancy 
communication from E. 


6. The S (“consciously” or “uncon- 
sciously”) must comprehend E’s expectancy. 

7, The S  (‘“consciously” or “uncon- 
sciously”) must retain E’s expectancy. 

8. The S (wittingly or unwittingly) must 
act upon (give responses in harmony with) 
E’s expectancy. 

In investigations that failed to demonstrate 
an Experimenter Bias Effect, the transmission 
process could have broken down at any of 
these eight links in the chain. There are many 
possibilities here: the student-E may have 
failed to comprehend or to retain the expect- 
ancy communication from the principal in- 
vestigator; the student-E may have compre- 
hended and retained the expectancy communi- 
cation but may have failed to transmit it to 
S; S may have comprehended E’s expectancy 
but may have failed to respond in the way 
expected; etc. It is apparent that systematic 
research is needed to determine under what 
circumstances the transmission of expectancies 
is most likely and also least likely to break 
down at each of the eight steps in the chain. 

Although we and other investigators have 
failed to demonstrate the Experimenter Bias 
Effect, it must be emphasized that the effect 
has been demonstrated in a number of studies 
(Barber & Silver, 1968a). At present it is 
unclear why some studies succeeded and oth- 
ers failed in this demonstration. However, 
Rosenthal (1967b) has pointed out to us 
that the contradictory results obtained in our 
investigations and in the Rosenthal and Fode 
(1963, Exp. 1) study may be due to subtle 
differences such as the following: in our in- 
vestigations Es and Ss were enrolled in the 
same introductory psychology course; in con- 
trast, in the Rosenthal and Fode study Hs 
were enrolled in an undergraduate experi- 
mental psychology laboratory whereas Ss were 
enrolled in an introductory psychology course 
and three of the 10 Es were first-year gradu- 
ate students. These subtle factors are related 
to the following differences between the two 
sets of studies that may have produced the 
contradictory results: (@) in our investiga- 
tions, but not in the Rosenthal and Fode 
study, Es were previously acquainted with al- 
most all of their Ss; (b) in our investigations 
Es and Ss were of equal status whereas in the 
Rosenthal and Fode study Es were of slightly 
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higher status, and (c) in the Rosenthal and 
Fode study Es were in a situation (an under- 
graduate experimental psychology laboratory) 
in which “widespread data fabrication” is 
common (Rosenthal, 1966, p. 35). Whether or 
not these factors are sufficient to account for 
the contradictory results of our investigations 
and of the Rosenthal and Fode study is not 
clear at the present time. Rosenthal (1966, 
Pp. 239-241) has presented data which sug- 
gest the hypothesis that the Experimenter Bias 
Effect is more readily obtained when male Es 
are acquainted with their Ss and when female 
Es are unacquainted with their Ss. With re- 
spect to the status of Hs vis-à-vis Ss, the data 
are also contradictory (Rosenthal, 1966, pp. 
241-245): One study (Vikan-Kline, 1962) 

indicated that high-status rather than low- 
status Es are more likely to bias their results 
whereas another study (Laszlo & Rosenthal, 
1967) indicated that low-status rather than 

high-status Es are more likely to bias. Finally, 

although student-Æs who are enrolled in an 

undergraduate experimental psychology lab- 

oratory may be prone to fudge their data, we 

have no direct evidence that the presence of 

this factor accounted for the positive results 

in the Rosenthal and Fode study or that its 

absence accounted for the negative results in 

our investigations, 

In conclusion, it appears that the Experi- 
menter Bias Effect is more difficult to dem- 
onstrate than was implied in several recent re- 
views (Rosenthal, 1963, 1964, 1966, 1967a) 
and further research is needed to determine 
exactly what preconditions are necessary to 
obtain the effect. 


Editor’s Note 


The following three discussions wer 


Editor because of the earlier article’s 


as Prepared at the invitation of the 


work reported elsewhere by Robert Rosenthal, 
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ON NOT SO REPLICATED EXPERIMENTS AND NOT SO NULL RESULTS * 


ROBERT ROSENTHAL ? 
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Recent research by Barber, Calverley, Forgione, McPeake, Chaves, and Bowen 
(1969) has been interpreted as a failure to replicate an experiment (Rosenthal 
& Fode, 1963) showing significant effects on Ss’ responses of Es’ expectancies. 
The purpose of this paper is to show (a) that the above research, while helpful 
in its own right, can in no sense be regarded a serious effort to replicate, (b) 
that, in any case, the integrity of the null hypothesis cannot be staunchly 


defended by an overall low F (N =51 


Es) in the face of an overall x? (N = 


501 Ss) with an associated p value of 007 and (c) that science is not often 


served best by employing the null-hypothesis decision procedure in such a way 


as to avoid looking closely at data. 


This paper constitutes a comment on and an 
addendum to the research by Barber, Calverley, 
Forgione, McPeake, Chaves, and Bowen (1969). 
Tt was the intent of these investigators to repli- 
cate the first of a long series of studies of the 
effects of the expectancy of E on his Ss’ re- 
sponses. In a series of five samples, one-third of 
the Es were led to expect from their Ss positive 
ratings of the success of persons pictured in photo- 
graphs. Another third of the Zs were led to expect 
negative ratings, and one-third of the Es were 
given no expectation. This last group was a very 
instructive addition and, as we shall see when we 
look more closely at the data, provides us with 
important new evidence. First, however, we must 
consider the proposition that useful though it 
was, the research described by Barber et al. can 
by no stretch of the imagination be regarded as 
a serious effort at replication in the usual sense. 


PROCEDURAL DIFFERENCES 


Some more general issues surrounding the prob- 
lem of replication in the behavioral sciences have 
been raised elsewhere (Rosenthal, 1966) and need 
not be repeated here. It js enough to say that psy- 
chological research js characterized not only by 
too few efforts to replicate but also by a very 
high proportion of failures to replicate. In most 
replication efforts Ss are different, often E is dif- 
ferent, the time of year, the locale, and the physi- 
cal scene are different. With all these built-in 
differences, careful replication requires at least 
that the procedures of the original research be 


1 Preparation of this paper was facilitated by 
Grant GS-1741 from the Division of Social Sciences 
of the National Science Foundation. 

2 Requests for reprints should be sent to Robert 
Rosenthal, Department of Social Relations, Harvard 
University, Cambridge, Massachusetts 02138. 


followed closely. In the following ways that was 
not done in the research by Barber et al. 

Type of subject. The original experiment em- 
ployed, as Ss, students enrolled in a coeduca- 
tional state university. Only 18% of the Ss of 
the “replication” were from a similar population. 
Another 18% were from a 4-year Catholic college 
for girls and the remaining 64% were from junior 
colleges. Ordinarily such sampling differences 
might be regarded as trivial. However, because of 
the casual method of sampling Es, to be discussed 
presently, these differences become quite critical. 

Subjects per experimenter. In the original ex- 
periment each Æ contacted an average of over 20 
Ss. In the “replication” each E contacted an 
average of less than 10. Such a difference ought 
not to affect the magnitude of the grand means 
but it should affect the reliability of the mean 
response obtained by each Æ and it was these 
mean responses that were used as the basic data 
by Barber et al. 

Sex of experimenter. In the original experiment 
most (80%) of the Es were males while in the 
“replicate” most (59%) of the experimenters 
were females. For some studies this difference 
might not be important but evidence has been 
presented (Rosenthal, 1966) that female Es, es- 
pecially when contacting male Ss, are less likely 
to show expectancy effects. Indeed, they have 
been shown sometimes to obtain data signifi- 
cantly opposite to that expected. 

Experimenter status. In the original study Es 
were at least advanced undergraduate majors in 
psychology, 30% were graduate students and all 
had laboratory course experience. The Ss in the 
original, as in the “replicate,” were beginning stu- 
dents. In all our research involving several hun- 
dred Es and thousands of Ss we have always been 
careful to preserve the differential in status and 
educational background between Es and Ss, and 
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that fact has been explicitly emphasized (Rosen- 
thal, 1966, p. 306). Not only the evidence from 
our own research program but the weight of the 
evidence from the general literature on social in- 
fluence suggests that higher status influencers are 
more likely to be successful (Rosenthal, 1966). 

In the present study Es were selected at ran- 
dom from the subject population. These Es, then, 
were classmates of their Ss, with not even the 
slight advantage of knowing something more 
about laboratories, psychology, research, or sci- 
ence than their Ss, Perhaps most important of all 
is the fact that their Ss knew this. It is possible 
that when an Æ is of very high status he can 
influence unintentionally an S of equally high 
status, There are no data on this point but high 
absolute status may be sufficient without a status 
differential. In the case of Es employed by Bar- 
ber et al., not only was there no status differen- 
tial between Æ and S but the absolute status of 
E was about as low as it could get for a collegiate 
population. Most of the Zs were beginning junior- 
college students. 

Tf the research by Barber et al, had shown no 
effects of Æ expectancy (a conclusion drawn by 
the authors but as we shall see, one that is not 
all so clearly warranted by the data) an earlier 
hypothesis (Rosenthal, 1966) might have been 
given further indirect support. That hypothesis, 
and there are data to support it, is that more 
competent, more professional, more experienced 
Es of higher status are more likely unintentionally 
to affect their Ss’ responses by virtue of their 
expectancy, 


Two Types or REPLICATION 


It may be useful at this point to introduce a 
terminological and conceptual distinction between 
two types of replication. 

Replications of tenability. Replication of the 
first kind, or replication of an experiment, is 
characterized by an attempt to reproduce as 
closely as possible all of the conditions and pro- 
cedures of an earlier experiment. It is a type of 
replication addressed to the specific tenability of 
a specific outcome of a specific experiment. Such 
replications help us to rule out the operation of 
unsuspected idiosyncrasies in the original experi- 
ment, idiosyncrasies that made the original result 
unlikely to recur, Such idiosyncrasies include 
such human factors as errors of observation, re- 
cording, computation, and report. They include 
also such nonhuman factors as undetected equip- 
ment failures and differences in the physical ar- 

rangements of the laboratory. Finally, they in- 
clude also such individual difference variables as 
the particular Z’s characteristics, the Particular 


sample of Ss standing in a given relationship | 
some important national or local event, etc, 

Quite clearly, it was this type of replicatig 
that Barber et al. intended to conduct, Thus, the 
wrote of their attempt “... to replicate, , 
closely as possible . . .” It is only in the sense | 
which they intended it, the sense of a replicatig 
of tenability, that their “replication” cannot } 
considered even a close approximation. That is m 
to say that their research is addressed to th 
wrong question by virtue of their altered pr 
cedures, but only that it is addressed to a differen 
one. 

Replications of generality. Replication of th 
second kind, or replication of a relationship, 
characterized by a purposeful effort to cha g 
some potentially important aspect of a pi 
ously conducted experiment. It is a type of rep 
lication that usually occurs after some replication 
of tenability has been conducted and it is a 
dressed to a testing of the limits of the domai 
over which the earlier found relationship obtains 
It would seem quite reasonable to view the rë 
search by Barber et al. as a replication of thi 
second type. 

Similarly, recent summaries of experiments, 
showing the effects of one person’s expectations 
about the behavior of another person are essen 
tially summaries of replications of generality. One) 
of these summaries is a reconsideration of over | 
30 studies considered by Barber and Silver 
(1968) in which the relevant interpersonal ex 
pectation is held by an Æ and is about his Ss 


On Looxinc More Crosety at DATA 


On the basis of their analysis of the data, Bar- 
ber ét al. concluded that they had failed to dem- 
onstrate the effect of Es’ expectancies. If that 
were clearly the case we should not be unduly 
surprised. Barber et al. give many good reasons 
in their paper why such “null” results might have 
been obtained. Indeed there have been a number 
of studies which quite clearly showed no effects 


3 Barber et al. cite the conclusion by Barber and 
Silver (1968) that the majority of the studies they 
selected for review failed to show effects of Es’ ex- 
Pectancies. It would take us too far afield to discuss 
that conclusion here, but it has ‘been discussed else- i 
where in some detail (Rosenthal, in press). | 
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of Æ expectancy. It would be an unusual area of 
behavioral research that turned up only p values 
of .05 or less. Our purpose here is only to give 
the reader some additional data to guide his judg- 
ment. 

There were 51 Es and 501 Ss employed in the 
research reported by Barber et al. They presented 
the results based on the 51 Es. Table 1 shows the 
results based on the 501 Ss. For this single over- 
all analysis of all Ss’ photo ratings, intervals of 
“50 were employed, any smaller interval being 
such as to reduce the expected frequencies below 
the rounded value of 5. The x? based on Table 1 
showed an associated two-tailed p less than .007, 
but a x? based on 13 intervals in three conditions 
js not so directly informative even when it seems 
clear that the entries are not randomly arrayed. 
We shall first want to see which of the three 
groups show large differences from one another.* 

The two groups in which we are most interested, 
since they were most like the groups employed in 
other such studies, are those in which Es were 
given positive (+5) and negative (—5) expecta- 
tions. That x? was 25.59, df= 12, p < .02, two- 
tailed. Inspection of Table 1 suggests that this 
result was due to the Es’ expecting positive photo 
ratings obtaining too many of the very high rat- 
ings but too few of the slightly high ratings. 
When we compare the ratings made by Ss con- 
tacted by Es expecting negative photo ratings 
with the ratings made by Ss contacted by the 
Es given no expectancy we find a x? of 29.72, df 
= 12, p < .004, two-tailed. Relative to Es given 
no expectations, Es led to expect low photo rat- 
ings obtained too few very high, too many slightly 
high, and too few negative ratings. The compari- 
son of photo ratings given by Ss whose Es had 
been led to expect high photo ratings with re- 
sponses of Ss of the control condition showed 
only a much smaller difference (x? = 16.05, df 
= 12, p<.20, two-tailed). The direction of dif- 
ferences was such that Es expecting high photo 
ratings, relative to the controls, obtained too 
many very high ratings and too few slightly high 
ratings. 

The overall pattern of results described, while 


41 want to thank T. X. Barber for making these 
data available. In an earlier communication to Barber 
et al. and cited by them, the overall x? was less 
significant because coarser grouping had been em- 
ployed (below —2 00 to above +2.00 advancing in 
intervals of 1.00). In that communication, a number 
of additional subanalyses were reported which were 
interpreted by Barber et al. as alternative analyses 
rather than simply as an attempt to check the con- 
sistency from sampfe to sample of the overall effect 
obtained, 


TABLE 1 
MEAN RATINGS OF PHOTOGRAPHS BY 
501 SUBJECTS 
Expectancy of experimenter 
Photo ratings |————,7 |. 
+5 | None | —5 pi 
-+3.00 and above 9 5 6 20 
+2.50 to +2.99 9 6 0 15 
+2.00 to +2.49 8 2 5 15 


+1.50 to +1.99 11 19 10 40 
+1.00 to +1.49 14 15 25 54 
+0.50 to +0.99 14 17 28 59 

0.00 to +0.49 30 20 29 79 
—0.50 to —0.01 21 26 21 68 
—1.00 to —0.51 18 
—1.50 to —1.01 16 11 12, 39 


—2.00 to —1.51 4 10 8 22 

—2.50 to —2.01 5 9 0 14 

—2.51 and below 8 5 9 22 
z 167 167 167 501* 


wx? = 44.93, df = 24, p < :007, two-tailed test. 


surprising and certainly quite unpredicted, never- 
theless does not make for a strong defense of the 
null hypothesis that Es’ expectations have no 
effect on their Ss’ ratings of photographs. The 
curvilinear nature of the effect obtained does help 
us to understand, however, why the analysis of 
variance, with its insensitivity to such curvi- 
linearity, would show no significant effect of E 
expectations. 

How might we account for the effects obtained? 
If sheer speculation were permitted, we might 
wonder whether our sample was made up of two 
types of Ss. One type may have behaved as did 
the Ss of the earlier research in which Ss re- 
sponded substantially in the direction of their E’s 
expectation. The other type of S may have re- 
belled slightly, giving Æ responses in a direction 
slightly opposite to that expected by Æ. On this 
point we must agree with Barber et al.; more 
research is needed. 


SCIENCE AND STATISTICAL PROCEDURES 


Barber et al. conducted a series of studies on 
E expectancy effects and performed overall analy- 
ses of variance which led to Fs too small for them 
to reject the null hypothesis. In spite of certain 
fairly apparent trends in the data which were 
pointed out to the authors, they felt it was sta- 
tistically unjustifiable to look more closely at 
their data. But that closer look shows that much 
was missed by their procedure though theirs ap- 
pears to be a defensible procedure and, in the 
eyes of some psychologists, the only “proper” 
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one. The null-hypothesis decision procedure has 
been discussed in detail elsewhere (Bakan, 1966; 
Rozeboom, 1960). The view of science that un- 
derlies many workers’ usage of this procedure 
seems to have the effect of protecting the investi- 
gator from a full view of his data. The null- 
hypothesis decision procedure is perhaps too often 
employed as an opaque screen placed over the po- 
tential for learning something new. 

The position of the present writer is that it is 
not from data that investigators require protec- 
tion. Some protection is required from prema- 
turely firm conclusions drawn from a set of data, 
but the protection of a low alpha is not a very 
secure safeguard. The only ultimate protection is 
replication. It is often said that what is found 
serendipitously in the data requires replication. 
That seems unquestionably true. The implied 
corollary, however, that a predicted finding re- 
quires less replication does not seem to follow. 
All findings require replication, those that are 
clear and predicted no less than those that are 
less clear and “only” discovered. Even this can- 
not be interpreted too narrowly, however, If the 
predictions are based on earlier empirical tests of 
the same prediction we can have increasing confi- 
dence as more supporting evidence is obtained 
(Rosenthal, in press). 

There is a frequently held, and implicitly so, 
assumption that relationships that are not immedi- 
ately obvious from the way in which an investi- 
gator happens to have arrayed his data are some- 
how less real. It is as though a close look at the 
data changed the data and made falsely “true” 


what was truly “false.” The analogy to such 
assumption is the case of the lookout in 
crow’s nest who samples the horizon for a view 
land. If, deviating from the visual sweep 
scribed by his captain, he should see some 
expected land, shall we doubt the lookout’s Visi 
because the captain’s orders did not include 
view? 
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Rosenthal (1969) presented four arguments aiming to show 


presented a postmo: 
Rosenthal’s analysis indicates 


interpreted as confirming the experimenter 
results in line with their expectancies. 


that the five experi- , 
McPeake, Chaves, and Bowen (1969) 
e- Rosenthal-Fode (1963) 


own to be invalid. Rosenthal’s critique also 


It is con 


Bias Effect is very difficult to demonstrate. 


Rosenthal’s (1969) critique of the five experi- 
McPeake, 
s two points: 
al. experiments did not closely 
dures used in the original study 


ments by Barber, Calverley, Forgione, 
Chaves, and Bowen (1969) make 
(a) The Barber et 
replicate the proce 
by Rosenthal and Fode (1963, Exp. 1). (b) 
though Barber et al. concluded that their data 


not show an Experimenter Bias Effect, a different 
yields a significant 
chi-square. These two points will be discussed in 


way of analyzing their data 
turn. 


DISCREPANCIES IN REPLICAT. 
FODE PROCEDURES 


Rosenthal’s critique lists four discrepancies 


between the Barber et al. experiments and 
original study, by Rosenthal and Fode, that 
to be replicated: 
Type of subjects. 
used Ss from a coe 
whereas the five Barber et al. experiments 
conducted in a coeducational nonstate 
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The Rosenthal-Fode study 
ducational state university 
were 
university, 


colleges or universities, he should present data 
to support the contention. 

Subjects per experimenter. In the Rosenthal- 
Fode study, each Æ contacted an average of 20 
Ss whereas in the Barber et al. experiments each 
E contacted about 10 Ss. The point made in 
Rosenthal’s critique, that these differences would 
affect “the reliability of the mean response ob- 
tained by each Æ,” seems to imply that the Bar- 
ber et al. experiments were less sensitive than the 
Rosenthal-Fode study to any Æ bias effects that 
were present. The implication is invalid. The 
overall parametric analysis that was performed in 
the Barber et al. investigation was based on 51 Es 
(testing 501 Ss) and was at least as sensitive to E 
bias effects as the parametric analysis performed 
in the Rosenthal-Fode study which was based on 
10 Es (testing 206 Ss). 

Sex of experimenter. Most of the Es in the 
Rosenthal-Fode study were males whereas most of 
the Es in the Barber et al. investigations were 
females. Rosenthal’s critique stated that this dif- 
ference is important because “evidence has been 
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thal that it is misleading. Although there is some evi- 


dence indicating that female Es, especially when 
testing male Ss, show reduced or negative ex- 
pectancy effects (Rosenthal, Persinger, Mulry, 
Vikan-Kline, & Grothe, 1964), there is also con- 
tradictory evidence. For instance, a study by 
Silverman (1966) showed that, as compared to 
other combinations of Z’s sex with S’s sex, female 
Es testing male Ss obtain results more in line with 
their expectancies (not less in line or opposite to 
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their expectancies, as Rosenthal’s critique con- 
tends). 

Let us look more closely at Rosenthal’s con- 
tention that the Barber et al. investigations in- 
cluded too many female Es and that this might 
have made a difference. Did it actually make a 
difference? That is, when the data of the Barber 
et al. studies are analyzed to determine the effects 
of Es’ sex, do we find that female and male Es 
obtained different ratings from their Ss? Not at 
all! Under each of the expectancy conditions, fe- 
male and male Es obtained practically identical 
ratings (all Fs pertaining to the effects of Es’ sex 
were less than 1.0). Did female Es testing male 
Ss bias their Ss less than other £-S combinations? 
Not at all! The relevant interaction (Es’ Sex Xx 
Ss’ Sex X Treatments) yields an F much smaller 
than 1.0, which can be expected to occur by 
chance around 50 times in 100, 

Experimenter status. In the Rosenthal-Fode 
study 70% of the Zs were undergraduates whereas 
in the Barber et al. investigations all of the Es 
were undergraduates. Also, the Zs in the Rosen- 
thal-Fode study were enrolled in an undergradu- 
ate experimental psychology laboratory whereas 
the Zs in the Barber et al. studies were enrolled 
in an introductory psychology course. Rosenthal’s 
critique pointed out, in the same way as the paper 
by Barber et al. had pointed out, that these dif- 
ferences give rise to a discrepancy in the relative 
status of the Zs; that is, the Es in the Rosenthal- 
Fode study tended to be of somewhat higher 
status than those in the Barber et al. studies. 
Rosenthal’s critique also stated that higher status 
Es are more likely to be successful in biasing 
their Ss’ responses, The latter statement, how- 
ever, is not clearly in line with Rosenthal’s own 
data (Rosenthal, 1966, pp. 241-245). Two studies 
have been conducted in Rosenthal’s laboratories 
to determine whether high-status or low-status 
Es exert more bias. As Rosenthal (1966, pp. 244- 
245) himself clearly pointed out in his recent 
text, one of these studies (Laszlo & Rosenthal, 
1967) showed unequivocally that low- rather 
than high-status Es exert more bias. The other 
study, a master’s thesis Supervised by Rosenthal 
(Vikan-Kline, 1962), appeared to indicate that 
high-status Zs are “more Successful at influencing 
their subjects to yield the desired data, but only 
among subjects contacted later in the experiment 
[Rosenthal, 1966, p. 242].” However, as Rosen- 
thal (1966, pp. 242-245) clearly pointed out in 
his recent text, in the latter study the Es’ status 
was confounded with several other variables— 
€g., age and self-assurance—and the results 
could not be clearly attributed to differences in 
status. It appears that further research is needed 


to pinpoint the effects of Zs’ status on the 
menter Bias Effect. 

Fortunately, the required “further resea 
that was mentioned in the preceding sentence hy 
been conducted. Very recently, Wessler 
Strauss (1968) presented a study which attempt 
to replicate the standard Rosenthal procedure 
and which was very similar to the Barber et al 
studies. However, the Wessler and Strauss study 
differed from the studies by Barber et al. in th 
all Es were college seniors and their Ss we 
freshmen or sophomores. Although the Zs in the 
Wessler and Strauss study were higher in statu 
than their Ss, the results were practically identi 
cal with those obtained by Barber et al. Wesslet 
and Strauss reported that Es expecting high 
(+5.0) ratings obtained mean ratings of —0.5) 
whereas those expecting low (—5.0) ratings obs 
tained mean ratings of —0.64. These differences 
are very far from significant (p > .50). 

In brief, it appears that (a) the statement 
found in Rosenthal’s critique, that high- rather) 
than low-status Zs exert more bias, is unequivor 
cally contradicted by one study from Rosenthal’ 
laboratory and is not clearly supported by an 
other, and (b) when an attempt is made to repli- 
cate the basic Rosenthal procedures using Zs who) 
are relatively high in status, results are obtained 
which are essentially the same as those obtained 
by Barber et al. 


Postmortem Data ANALYSES 


their expectancies. (Supplementary nonparametric 
analyses, which included a Kruskal-Wallis test 
and a chi-square test, were also performed on 
the mean ratings obtained by the Es. These analy- 
ses also failed to show an Experimenter Bias 
Effect.) 

Instead of accepting the nonsignificant results 
that are obtained when the data of Barber et al. 
are analyzed in the same way that Rosenthal and 
Fode analyzed their data, Rosenthal’s critique 
Proposes that the Barber et al. experiments should 
be analyzed by a quite different method, namely, 


EXPERIMENTER Bras EFFECT 


by a nonparametric chi-square test which is ap- 
plied to the individual Ss ratings and which does 
not take the individual Zs into consideration. Al- 
though there are several objections we could 
raise here, let us accept Rosenthal’s proposal to 
perform a chi-square test on the ratings of the 
501 Ss, What is the logical way to partition these 
ratings for a chi-square test? The most logical 
cut is at the median. When the ratings are thus 
logically dichotomized above and below the me- 
dian, we find that (a) more higher ratings are 
obtained when low ratings are expected (oppo- 
site to the results predicted by the Æ bias hy- 
pothesis) but (b) these differences are not sig- 
nificant (x? = 4.36, df =2, p > .10). 

Since the ratings given by the 501 Ss can be 
partitioned in many other ways, many additional 
chi-square tests can be performed. Of course, if 
we continue to carry out postmortem statistical 
analyses, we may expect by chance to find some 
“significant” results (Hays, 1963, Chap. 14). Ta- 
ble 1 in Rosenthal’s critique presented one of 
the many additional chi-square tests that can be 
performed on the Barber et al. data, Rosenthal’s 
overall chi-square test, which was performed on 
the Ss’ ratings partitioned into 13 discrete inter- 
vals, yielded a significant $, indicating that one 
or more of the 13 intervals deviated from chance. 
However, Rosenthal’s critique failed to make 
clear exactly where the deviations from chance 
expectations were localized in the array of data. 
To determine where the ratings deviated from 
chance, it is necessary to look at the component 
chi-squares that were computed for each of the 
13 rows in Rosenthal’s Table 1. These chi-squares 
and their associated p values, which Rosenthal’s 
critique failed to present, are listed in Table 1 in 
this paper, The table shows the following: 

1. Contrary to the E bias hypothesis, the Es’ 
expectancies did not significantly affect 11 of the 
13 sets of ratings (ratings of —2.51 and below, 
—2,00 to —1.51, —1.50 to —1.01, —1,00 to —.51, 
—.50 to —.01, .00 to 0.49, +.50 to +.99, +1.00 
to +1.49, +1.50 to +1.99, +2.00 to +2.49, and 
+3.00 and above). 

2. Contrary to the E bias hypothesis, a signifi- 
cantly smaller number of Ss tested by Es expect- 
ing Jow (—5.0) ratings gave relatively low rat- 
ings between —2.50 and —2.01. 

3, In comparison with Ss tested by Es expecting 
low (—5.0) ratings, a significantly larger number 
of Ss tested by Es with no expectancies and by 
Es with expectancies for high (+5.0) ratings 
gave relatively high ratings between +2.50 and 
+2.99. 

It needs to be “emphasized here that (a) the 
appropriate analysis of variance applied to the 
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TABLE 1 


RATINGS or PHOTOGRAPHS BY 501 SS WITH COMPONENT 
CHI-SQUARES AND p VALUES 


Expectancy of Æ 


Photo ratings 

+5.0 |None| —5.0| x? ? 

+3.00 and above 9 5 6 | 1.30] ns 
+2.50 to +2.99 9 6 0 | 8.40 | .02 
+2.00 to +2.49 8 2 5 | 3.60 | ns 
+1.50 to +1.99 11 19 10 | 3.65 | ns 
+1.00 to +1.49 14 | 15 25 | 4.11) ns 
+ .50 to + .99 14 | 17| 28 | 5.51] ns 
.00 to + -49 30 20 29 2.30 | ns 

— .50 to — 01 21 26 21 73 | ns 
—1.00 to — .51 18 | 22 14 | 1.78 | ns 
—1.50 to — 1.01 16 | 11 12 | 1.07 | ns 
—2.00 to — 1.51 4| 10 8 | 2.54| ns 
—2.50 to — 2.01 5 9 0 | 8.71 | .02 
—2,51 and below 8 5 9 | 118| ns 
Total 167 | 167 | 167 |449 | .007 


mean scores obtained by the 51 Es in the Barber 
et al. studies yielded nonsignificant results (F < 
1.0) and (b) various supplementary nonpara- 
metric tests (Kruskal-Wallis tests and chi-square 
tests) also yielded nonsignificant results. Conse- 
quently, the three sets of “findings” presented 
immediately above, which are due to an addi- 
tional postmortem chi-square test performed on 
an arbitrary partitioning of the Ss’ ratings into 13 
intervals, can be viewed, at best, as three tenta- 
tive propositions that require experimental vali- 
dation, Since these three tentative propositions 
are very specific and have very little if any gen- 
eral interest, it is doubtful that investigators will 
bother to test them. However, if anyone wishes 
to validate these tentative propositions, he might 
note the following: (a) The first two contradict 
the E bias hypothesis which states that Es obtain 
results in line with their expectancies. (b) The 
third tentative proposition is partly inconsistent 
with the Æ bias hypothesis (in that Es expecting 
high ratings and those with no expectancies did 
not differ significantly in influencing the relatively 
high ratings between +2.50 and +2.99). 
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REFLECTIONS ON REPLICATIONS AND THE EXPERIMENTER 
BIAS EFFECT 


LEON H. LEVY 1 
Indiana University 


The value of replications in psychology, the need to distinguish between the 
probability of data and of hypotheses, and the implications of alternative 
conceptualizations of the Experimenter Bias Effect (EBE), were each discussed 
in the light of the failure reported by Barber, Calverly, Forgione, McPeake, 
Chaves, and Bowen (1969) to replicate earlier findings by Rosenthal and 
Fode (1963). The position was taken that the value of further research on the 
EBE will depend upon the prior development of a situational taxonomy and a 
theoretical formulation in which the effect may be conceptualized in terms of 


a broader class of phenomena. 


It is axiomatic among novelists, particularly of 
the existentialist persuasion, that character is 
revealed in conflict and crises, and this seems no 
less true when we turn to questions of the nature 
of scientific inquiry and thought (Kuhn, 1964). 
Thus, it would be unfortunate were we to focus 
only on questions of whether the study by Bar- 
ber, Calverley, Forgione, McPeake, Chaves, and 
Bowen (1969) should be counted as a serious 
and legitimate failure to replicate the results of 
the Rosenthal and Fode (1963) experiment dem- 
onstrating an Experimenter Bias Effect (EBE), or 
whether this effect might be revealed by a more 
fine-grained, post hoc analysis of their data, and 
failed to take advantage of the opportunity af- 
forded by their study to reflect upon some of 
the broader and more fundamental issues which 
are raised by it. 

These issues are joined once the question is 
asked: What are the implications of the failure 
of Barber et al. to demonstrate an EBE in their 
several replications? The first concerns the value 
of replications; the second concerns the relation- 
ship between data and hypotheses; and the third 
concerns the implications of the alternative 
stances which might be taken toward the EBE. 
These are certainly not independent of each other, 
nor are they specific to the EBE, but they are 
especially relevant at this time since it is ap- 
parent that the EBE has become an increasingly 
popular topic and bids fair to rival anxiety as a 
vehicle for the production of theses and journal 
articles. And in psychology, as elsewhere, there 
seems to be an inverse relationship between the 
popularity of activity and the amount of thought 
given to its meaning or value. 
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The Value of Replications 


The perfect replication is a fiction, and I shall 
take the heretical position that this is just as 
well. Therefore, let us stipulate that the Barber 
et al. experiment differs from Rosenthal and 
Fode’s in several details. Some of these are listed 
by Barber et al. But does this necessarily dis- 
qualify their findings as evidence vis-à-vis those 
of Rosenthal and Fode concerning the EBE? For 
obvious reasons, no experiment can ever duplicate 
another in every detail, and so this question 
turns on whether the variations between them 
were trivial or important. Thus, we have the 
choice of either of two conclusions: (a) Barber et 
al. failed to establish one or more of the condi- 
tions necessary for the demonstration of the 
EBE; or (b) their procedural variations were 
trivial and the ÆBÆ is unreliable and/or not very 
pervasive. 

But this choice is an impossible one, for it 
requires either a theory which states the param- 
eters involved in the EBE or a body of systematic 
research from which these parameters might be 
induced. While it seems clear that the former 
condition does not exist in the case of the ZEBE, 
for reasons which we cannot go into in the space 
available, it might also be argued that the ex- 
tensive research which has been performed in the 
name of the EBE cannot be considered system- 
atic, Thus, whether the findings of Barber et al. 
can be taken as contradictory to those of Rosen- 
thal and Fode is a moot question, and this, I 
would suggest, will be found true wherever repli- 
cations are attempted of experiments dealing 
with phenomena which are not embedded either 
within some theoretical framework or extensive 
body of systematic research. Conversely, it might 
be argued that any dispute over whether an ex- 
periment qualifies as a replication of another may 
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be taken as prima facie evidence of the weakness 
of the conceptual and empirical underpinnings of 
the phenomenon in question. And so long as this 
is the case, it would seem that unless we are 
interested only in the existence of a phenomenon 
in a particular situation (which is rarely, if ever, 
the case in science, and certainly not so in the 
case of the EBE) we must ask whether replica- 
tions are worth the effort. Can we not employ 
our resources more profitably? 

These questions also merit consideration in 
instances where phenomena are better endowed 
conceptually and empirically than the EBEZ. For, 
unless there are serious reasons to doubt the 
validity of the data reported by an investigator, 
it would seem wiser to invest our time in explor- 
ing the generalizability of the theory concerning 
the phenomenon through the systematic variation 
of its parameters and experimental procedures in 
a manner analogous to that represented by the 
multitrait-multimethod matrix proposed by Camp- 
bell and Fiske (1959) for the study of construct 
validity. Such experiments would permit an evalu- 
ation of the reliability and pervasiveness of the 
phenomenon in question while at the same time 
extending our theoretical understanding of it. 
Since variation from one experiment is inevitable, 
why not make the most of it? Surely when prac- 
tice departs from precept as radically as it does 
in the matter of replications, so that an investiga- 
tion such as that of Barber et al. is the exception 
rather than the rule, it may be time to examine 
the wisdom of the precept rather than lament its 
neglect. And as a start I would suggest that we 
consider how much more information was gained 
by Barber et al. through their second replication, 
or their third, fourth, or fifth. Measured against 
information gained, it would seem that these five 
replications were at least four too many; con- 
cerning the relationship between replications and 
the acquisition of knowledge, they suggest the 
aptness of the aesthetic dictum: Jess is more. 


The Relationship between Data and Hypotheses 


The demonstration of the EBE in the Rosen- 
thal and Fode experiment and others might be 
likened to a wife’s discovery of a strange shade 
of lipstick on her husband’s shirt collar, Whether 
or not she had suspected him of infidelity in the 
past, it seems unlikely that any number of un- 
blemished shirt collars in the future will com- 
pletely remove her concerns about his conjugal 
constancy, Thus it seems unlikely that the five 
failures to reject the null hypothesis by Barber 
et al. will in itself convince many of the rarity 
of the EBE or that it need not be taken into 
account in the design and interpretation of ex- 
periments. Nor should it. Why this should be so 


becomes apparent when we recognize the distinc- 
tion, which is often ignored, between the prob- 
ability of data and of hypotheses, and the diffi- 
culties entailed in drawing inferences from sig- 
nificance tests and null hypotheses. Since there 
are several excellent discussions of these issues in 
the recent literature (Bakan, 1966; Edwards, 
Lindman, & Savage, 1963; Rozeboom, 1960), I 
shall do no more than point out that they are 
made most timely by the Barber et al. paper. 


The Implications of Alternative Views of the EBE 


The EBE might be regarded either as a meth- 
odological problem or as a member of a class of 
phenomena of intrinsic interest to psychology. In 
the former case, it takes its place along with 
response biases and demand characteristics as, 
what we might call, a “spoiler variable,” in that it 
spoils the image of the rigor and purity of psy- 
chology’s data collection procedures and com- 
plicates the design and interpretation of research 
by requiring that an “irrelevant” dimension—the 
E—be taken into consideration. Regarded as an 
exemplar of a class of phenomena such as inter- 
personal influence or the effects of expectancies 
and values on perception and behavior, the ZBE 
might be considered a convenient vehicle for the 
investigation of the processes involved in these 
phenomena and as a medium for testing hypothe- 
ses and theories about them. Each of these 
stances, however, entails a different set of con- 
siderations and different order of priorities so 
far as research and development are concerned. 

Regarding the EBE as a methodological prob- 
lem leads to considerations of its pervasiveness, 
how much of the variance in experiments might 
be attributed to it, and how it might be controlled 
or eliminated. Although Rosenthal (1966) has 
suggested a number of means by which this last 
concern might be met, the decision to incur the 
additional costs involved in adopting them can- 
not be made independently of the first two con- 
siderations. However, the information which 
would be relevant to these prior considerations is 
dependent, in turn, upon the existence of a tax- 
onomy of situations (tasks or experiments). For 
without such a taxonomy and the sampling of 
situations it would permit, it is impossible to 
make any generalizations about the seriousness of 
the EBE as an epistemological threat to psychol- 
ogy on the basis of successes or failures in demon- 
strating it in particular instances. But although 
the value of such a taxonomy has long been 
recognized, and extends far beyond the problem 
of EBEs (Rotter, 1960), one, does not presently 
exist. Thus we must ask whether it would not be 
wiser to give precedence to the long overdue de- 
velopment of this taxonomy rather than to addi- 
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tional demonstrations of the EBE, however 
startling and impressive these might be. Cer- 
tainly, as matters stand at present, it should be 
obvious that there is no way of assessing the 
validity of any assertions about the generality or 
seriousness of the EBE as a methodological prob- 
lem. 

While the perverse gratifications which might 
derive from demonstrations of the ÆBÆ in its 
role as spoiler variable are not to be minimized, 
and regardless of its importance in this role, by 
far the more rewarding scientific stance toward 
the EBE is likely to be as an exemplar of some 
theoretically important class of psychological 
phenomena. The low epistemic yield of the five 
experiments reported by Barber et al. clearly 
reveals the wastefulness of research conducted on 
phenomena which are devoid of any theoretical 
conceptualization, For had the EBE been placed 
in some theoretical context, these five failures to 
demonstrate it might have been turned to good 
account in evaluating one or more theoretical 
propositions concerning its nature and the proc- 
esses involved in it, or in deciding on directions 
for future research. It should be noted also, that 
if the EBE is conceived of as an exemplar of 
some class of phenomena, it becomes irrelevant 
whether it is pervasive or rare; the EBE could 
be exotic and still of importance #f the class of 
phenomena were important and if it was of value 
in furthering our understanding of these phenom- 
ena. 

These ifs point to the kinds of questions which 
must be asked if the EBE is to be taken seriously 
from a theoretical standpoint. They require, again, 
a better conceptualization of the situations in 
which the EBE is manifested than presently 
exists. And, paradoxically, they also require a 
high degree of development in the theory and 
knowledge concerning the phenomena allegedly 
exemplified by the EBE and which the EBE is 
intended to further illuminate. For, as Bordin 
(1965) has pointed out in a different context, it 
is only as a phenomenon is more fully understood 
that it can safely be studied in the laboratory by 
means of simplifications and analogues without 
the fear that these may be irrelevant to it. This 
means, at a minimum, that if the EBE is to be 
regarded as a vehicle for the study of theoreti- 
cally important processes and phenomena, such as 
unintended covert communication, as Rosenthal 
(1967a) has recently proposed, it cannot be the 
sole means of doing so. It also means that hy- 
potheses based upon EBE-type experiments must 
be tested for their generality in other experiments 
involving the phenomena in question, and vice 
versa. 

Rosenthal (1967b) has observed that as re- 


sponse variance diminishes so does variance in 
the EBE (p. 621). If this is the case, since two 
of the most effective ways of reducing (un- 
wanted) response variance are through better ex- 
perimental controls and more valid and compre- 
hensive theories, it is appropriate to ask whether 
these should not be our first order of business, 
regardless of the stance taken toward the EBE, 
For if our reasoning is correct, this shift of em- 
phasis away from the EBE to more fundamental 
methodological and substantive concerns should 
have the interesting consequences of reducing the 
EBE as a methodological threat while at the same 
time helping us to understand and use it more 
wisely in the furtherance of psychological knowl- 
edge. 
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WHAT PATHOLOGICAL DIMENSIONS LIE BEYOND THE 
NORMAL DIMENSIONS OF THE 16 PF? 
A COMPARISON OF MMPI AND 16 PF FACTOR DOMAINS* 


RAYMOND B. CATTELL 
University of Ilinois 
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LAURA SPECHT BOLTON? 
Stanford University 


A set of mutually exclusive, exhaustive parcels of MMPI items and the 16 
personality factor scales from the Cattell 16 PF were factored together on 217 
normal and 40 abnormal adult Air Force men. The analysis of the resulting 64 
variables (45 MMPI parcels and 19 16 PF scales) yielded 21 factors, of which 
13 are clearly identifiable as factors in the 16 PF and 4 are clearly MMPI 
factors. MMPI data were scored also for the traditional 14 clinical scales, and 
their predictability from 16 PF source trials examined in terms of the squared 
multiple correlations. The 7-factor space typically represented by the MMPI 
clinical scales was found to lie within the dimensionality of the Cattell 16 PF. 
Furthermore, the total prediction of the set of MMPI clinical scales from a 


single form (A) of the 16 PF, uncorrected for attenuation, is represented by the 


canonical correlation of .8. 


The 16 PF claims, by the operational work- 
ing out of the personality sphere concept 
(Cattell, 1946, 1957) in the basic research 
underlying it, to represent the main dimen- 
sions of the normal personality. There are, it 
is true, some seven further factors (Cattell, 
1957) of small variance and rather unstable 
reappearance which research needs to pursue, 
but with one exception these also appear 
“normal.” 

What the clinician recognizes as abnormal- 
ity can be either a statistically deviant score 
on a normal factor—or, alternatively, a score 
on some factor the very existence of which is 
demonstrable only in pathological manifesta- 
tions, as is recognized on some MMPI scales. 

To ask how much of the MMPI response 
material is pathological in this second sense 
requires that we first ask how much of the 
variance in such behavior can be accounted 
for by deviation on already known normal 


1 This research was supported in part by Public 
Health Research Grant MH 1733-09 from the Na- 
tional Institute of Mental Health and by the Na- 
tional Science Foundation through an undergraduate 
fellowship in support of the junior author. The writ- 
ers wish to express their indebtedness to John Nes- 
selroade and Jack Ford for extensive computer 
assistance, to J. Wiggins and L. Triandis for valua- 
ble comment, and to the chief psychiatrist and the 
administration of Chanute Air Force Base for pro- 
viding Ss. 

2 Now at 155 Rose Street, Metuchen, New Jersey 
08840. 
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factors. A joint factoring of the MMPI and 
16 PF has therefore long been overdue, as 
many practicing psychologists have pointed 
out. The research here reported has as its 
immediate objective such a comparison of the 
factor space of the MMPI and the 16 PF. To 
appreciate all features of its design, however, 
one must recognize that it is also part of a 
series of researches (Cattell & Bjersted, 1967)* 
to provide the theoretical basis for the Clini- 
cal Analysis Questionnaire to be added to the 
16 PF. An instrument thus adding entirely 
pathological to normal factors would offer the 
clinician a means of handling diagnosis as 
well as the general personality of the indi- 
vidual. 

In designing the experiment for the com- 
parison, one must recognize that the MMPI 
was designed to differentiate among persons 
exhibiting various clinically defined surface 
traits (i.e., syndromes, formally and psycho- 
metrically correlation clusters) while the 16 
PF was designed to measure source traits 
(replicable simple structure factors). Ac- 
tually, there appears still to be no published 
basic research to show that the MMPI scales 
are constituted by discrete correlation clusters 
and the conclusion of correlational investiga- 


2R. B. Cattell and K. Delhees. Personality factors 
in the pathological domain, an@ the construction of 
the 16 PF Clinical Analysis Questionnaire, in prepa- 
ration. 
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tions in this area (e.g., Wiggins, 1962), has 
indicated that putting together items which 
correlate adequately with possession of a 
designated clinical syndrome does not itself 
guarantee adequate homogeneity in such a 
scale, For this reason, and because of obvious 
technical factor-analytic blunders, discussed 
below, no published factor analysis of the 
MMPI has yet supplied the answer to the 
question we now raise about its primary fac- 
tor and cluster structure. 

Welsh (1952) performed a centroid analy- 
sis of shortened nonoverlapping versions of 
the MMPI clinical scales. However, this study 
sampled less than one-third of the total 
MMPI item space. Comrey (1957a, 1957b, 
1957c; 1958a, 1958b, 1958c, 1958d, 1958e, 
1958f, 1958g) performed extensive factor 
analyses of individual items within given 
MMPI conventional scales, with cross-cul- 
tural validation (Comrey & Nencini, 1961), 
rotating to a varimax solution. However, none 
of these investigations covered more than 12% 
of the total MMPI space. Other investiga- 
tions, such as those of Wiggins (1966) and 
Mees, Gocka, and Holloway (1964) have ex- 
amined MMPI factor structure with respect 
to certain item characteristics, such as content 
and social desirability. 

With the above-mentioned exceptions, in- 
vestigators have factored the scores on con- 
ventional or overlapping scales, typically ob- 
taining some agreement on only five or fewer 
factors. Practically none have met the ordi- 
nary technical demands for tests of the num- 
ber of factors, proof of rotation to maximal 
simple structure, objective testing of alleged 
matches with other studies, and especially the 
avoidance of the fatal weakness, discussed in 
detail by Shure and Rogers (1965), of having 
algebraic dependencies, through item overlap, 
in the scales factored. (Among others, see 
Finney, 1966; Kassebaum, Couch, & Slater, 
1959; Lingoes, 1960; La Forge, 1962; Tyler, 
1951; and Wheeler, Little, & Lehner, 1951). 

The 566 items collected by the construc- 
tors of the MMPI probably represent as 
comprehensive a sampling of pathological 
behavior as the 564 in the 16 PF do of normal 
behavior. At any rate, we would accept them 
as the best samplfng of the domain yet avail- 
able, But to reveal the real dimensionality of 


the MMPI domain it is necessary to forget 
the present cluster scales, possibly arbitrary, 
and factor the full roster of items themselves. 
In effect this is what the present research, in 
conjunction with a coordinated study by Eber 
and Fowler, sets out to accomplish. 

Incidentally, in undertaking this factor 
analysis of parcels without regard to the 
structure imposed by the MMPI test con- 
structors in what we may, for brevity, call 
the “conventional cluster scales” (including 
the various additional scales added by other 
clinicians), it is not implied that measuring 
patients by syndromes (as technically defined 
by surface trait, cluster search operations 
Cattell, 1946, 1957; Cattell & Coulter, 1966) 
is unnecessary. On the contrary, in psychi- 
atric literature, Cattell (1966a, 1967) has 
advocated the special value of depth psy- 
chometry, that is, a planned simultaneous 
measurement of surface traits, for example, 
by the MMPI, and of source traits, for 
example, by the 16 PF, MAT, etc. If we 
are dealing with a surface-trait and a source- 
trait instrument, both occupying the same 
factor space, then the depth psychometry 
technique would give the clinician the advan- 
tage of understanding two patients with iden- 
tical scores on a given syndrome scale in the 
light of the different source-trait structure 
underlying each. This is a first step from 
descriptive to causal analysis. 

To explore and illustrate the possibilities of 
depth psychometry while investigating the pri- 
mary factor structure, two distinct experi- 
ments and factorings of MMPI-16 PF data 
have been undertaken here. The first takes 
the MMPI data as conventionally scale-scored 
and relates it to 16 PF source traits while the 
second partitions the MMPI, regardless of 
conventional scoring, to find what is contained 
in a set of mutually exclusive, exhaustive 
parcels from it. 


ExprrrMent I: MMPI SuRFACE-TRAIT AND 
16 PF Source-Trair RELATIONS 


For the proper determination of the factor 
structure, it was considered desirable, in ad- 
ministering the two scales together, not to use 


4H. W. Eber, Alton State Hospital, Alton, Illi- 
nois, personal communication, April 1966, 
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the usual student groups, but to find a more 
representative section of the population, and 
one, moreover, containing a reasonable per- 
centage of “clinical” cases. Accordingly, the 
MMPI and the 16 PF, the latter as Form A, 
were administered to 259 Air Force men, in- 
cluding, for “pathological” variance, about 40 
cases listed by the psychiatrists as being in 
some need of psychiatric help. 

The correlations between the two sets of 
scales—MMPI and 16 PF—are shown cor- 
rected for attenuation in Table 1. In order to 
obtain the closest statement of the real rela- 
tions of the scales’ concepts per se, it is nec- 
essary, of course (since we took only a very 
brief 10-item score of most 16 PF scales, in 
using Form A only), to correct for attenua- 
tion. The uncorrected correlations are avail- 
able from the National Auxiliary Publications 
Service® (see Table A) which also presents 
the correlation matrices among (a) the MMPI 
scales alone and (b) the 16 PF scales alone. 

A factor analysis of MMPI intercorrela- 
tions alone, which we shall not present here 
because we are not interested further in a 
priori MMPI scale scores as such, shows by 


5 For supplementary material, order NAPS Docu- 
ment NAPS-00121 from ASIS National Auxiliary 
Publications Service, c/o CCM Information Sciences, 
Inc., 22 West 34th Street, New York, New York 
10001; remitting $1.00 for microfiche or $3.00 for 
photocopies. 
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the scree test (Cattell, 1966c) that they con- 
tain either seven or eight factors. This finding 
is compatible with earlier studies (e.g., La 
Forge, 1962; Messick & Jackson, 1961; and 
others) which centered on five or six but 
typically did not iterate communalities, apply 
recent tests (or any true tests) for the number 
of factors, or use scales cleared of algebraic 
dependence. However, even if we accept, as a 
compromise, that the number is seven, we can 
see why it is impossible for some of the 16 
PF factors, for example, A, B (intelligence), 
L, M, Qı, and Qz to have more than trivial 
or insignificant correlations (Table 1) with 
these surface traits. The simplest ways to 
give expression to the amount of overlap of 
the two systems are (a) to ask how much of 
the variance of the scales of one can be pre- 
dicted from the other, that is, by obtaining a 
series of squared multiple Rs in both direc- 
tions, or (b) to calculate the canonical corre- 
lation. 

For most purposes a practitioner will be 
more interested in the first of these, and in the 
direction of asking how much of the MMPI 
surface-trait scores he can predict from the 16 
PF scale scores. These multiple correlations 
were corrected for attenuation to ameliorate 
the unreliabilities of both sets of scales, as 
reported in the respective handbooks of the 
16 PF and the MMPI. Table 2 is obtained by 
taking the corrected correlations of Table 1, 
converting them into weights (for estimating 


TABLE 1 
CORRELATIONS BETWEEN MMPI anp 16 PF Scares (CORRECTED FOR ATTENUATION ON BOTH) 


Scale als efz[rlc]z|r[z][ulwfola Q: a |a 
A -13 —26 -40 17 10 -31 —26 06 2 10 —35 63 —18 04 —52 62 
D =07 O1 29 —25 -02 14 01 —20 -21 03 19 —22 —06 -03 19 —13 
F —07 —38 —41 21 —15 —68 —10 03 23 19 -49 32 20 20 -62 39 
K —04 11 42 —27 02 26 09 —23 -36 -13 35 —43 —02 -06 46 —36 
Hs 01 —33 —45 13 —04 —46 —09 —01 24 29 —39 52 -10 12 -48 48 
D —13 —10 —23 —33 -27 -15 —25 00 12 16 -07 2 —20 21 -18 24 
Hy 01 30 20 —29 -04 01 03 —05 -13 23 11 —14 —07 -07 08 —05 
Pd —07 —12 —14 —02 01 -37 —05 -08 16 27 -04 24 —11 07 —30 36 
Mf 05 10 08 —33 -14 11 —05 14 -26 17 05—10 03 03 22 —06 
Pa —08 —24 —15 —08 —11 -33 —01 —05 08 20 —18 12 01 —04 —33 21 
Pt -12 —37 -64 02 -15 -47 -29 10 45 25 -29 68 —2 09 —74 71 
Sc -01 —47 —53 16 -09 -59 -13 12 35 23 —51 54 02 10 —72 56 
Ma -01 -15 19 15 2-25 13 -11 14 11 14 —09 -12 -14 12 
Si —33 —10 —17 —49 —42 —09 —44 -13 04 07 —04 26 -io 4&8 14 24 
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TABLE 2 


PREDICTIONS (MULTIPLE CORRELATIONS) OF MMPI 
SurracE Tratr SCALES FROM 16 PF Source Trait 
Scares (WiTH VARIOUS ATTENUATION 
CORRECTIONS) 


Corrected | Corrected 
for MMPI| for 16 PF | C9 


Scale scale unre-| scale unre-| „tO ideal | Uncor- 
liabilities | liabilities | Scales,for | rected 
only only 
Anxiety 49 68 58 
Lie 27 35 40 32 
Validity 46 58 73 51 
K Correction 37 49 55 43 
Hypochon- 

driasis 42 55 62 49 
Depression 32 36 47 32 
Hysteria 28 29 43 26 
Psychopathic 32 32 48 29 
Masculinity- 

Femininity 25 33 38 29 
Paranoia 30 36 48 32 
Poychaet iea 55 70 78 63 
Schizophrenia 48 $ 66 75 58 
Hypomania 33 35 48 32 
Social Intro- 

version 4T 52 64 47 


MMPI scores from 16 PF scales) and calcu- 
lating multiple Rs in the usual way. The 
weights are obtained by the matrix equation 


W=C,R* 


where W is the vector of 16 weights to be 
attached to the 16 PF factor scale scores in 
estimating the given MMPI factor, C, is the 
correlation matrix among the 16 PF scales. 
The latter turned out, incidentally, to be very 
much the same in this sample as in other sam- 
ples (Cattell & Eber, 1967) and when fac- 
tored® gives the typical second-order struc- 
ture. 

It becomes evident that some MMPI scale 
scores, notably schizophrenia, anxiety, psy- 
chasthenia, and social introversion, can be 
predicted with appreciable efficiency from the 
16 PF. A recent result by Eber (1966) inde- 
pendently supports these conclusions in that 
he obtains the appreciable multiple R of .7 
in predicting from the 16 PF the psychotic- 
nonpsychotic decision (mainly on a schizo- 
phrenic group) as given by the usual judg- 
ments on MMPI data. The results are also 
consistent with the findings that the direc- 
tions in which the 16 PF has been found most 
potent in clinical practice (Cattell & Scheier, 


6R. B. Cattell and J. Ford. An integration, by con- 
gruence coefficients, „of the principal researches on 
second order personality factor structure in Q-data, 
in preparation. 


1961; Cattell, Tatro, & Komlos, 1964) are 
those of neuroticism (including anxiety and 
psychasthenia) and separation of the schizo- 
phrenias. 

The converse prediction—MMPI to 16 PF 
scales—may be left to the reader,’ but the 
total overlap may be appreciated also from 
the first components of the canonical corre- 
lation which we have worked out from NAPS 
Table A, showing the possible prediction of 
MMPI from 16 PF space to be .80. However, 
these figures and the multiple Rs also say 
that an appreciable amount in the MMPI is 
unaccounted for by the 16 PF (and vice 
versa) and this fits our later evidence that 
pathological domains in the latter still remain 
to be accounted for by the new 16 PF Clinical 
Analysis Questionnaire. 


EXPERIMENT II: THE INHERENT COMMON 
DIMENSIONALITY OF THE MMPI AnD 16 PF 


The second and major experiment in our 
investigation aimed to evaluate the common 
dimensionality in the inherent item material, 
setting aside that scale development in the 
MMPI just considered, which, to a psy- 
chometrist or personality theorist, represents 
an odd history of piecemeal approaches and 
expediency. On contemplating the task of 
factoring 566 MMPI items one can, however, 
understand the previous reluctance to come to 
grips with the problem in more general theo- 
retical terms. 

Two possible approaches presented them- 
selves to us; one—building special, large com- 
puter programs for rotation—was proposed 
by Eber and Fowler, while the present writers 
resorted to the parceled factor-analysis method 
(Cattell, 1960) which has proved effective 
before with large problems. By this latter, the 
totality of the MMPI items were grouped 
into 45 small parcels, averaging a dozen items 
in length. Complete contents of these parcels 
in terms of MMPI item numbers are listed in 
Table 3. We repeat for emphasis that these 
parcels were derived from scales with cor- 


7 Jt can be seen, however, that the dimensionality 
of the pathological surface traits is only about half 
that of the 16 PF, so that the MMPI scales were not 
able to predict, for example, A (schizothymia), B 
(intelligence), Qı (radicalism-conservatism), and so 
on, in the normal personality sphere. 
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TABLE 3 
Irem Content oF MMPI PARCELS 
x 
Vari- stive ti MMPI item composition of parcel in terms of item 
able Descriptive tue cepas numbers in the MMPI test booklet (1951) 
20 | Welsh’s Anxiety scale (All true) 41, 67, 94, 305, 356, 382, 396, 414, 418, 443, 465, 499, 511, 
518, 544, 584. 
21 | Depression (true) 5, 13, 52, 86, 104, 130, 158, 159, 182, 193, 236, 259. 
22 | Depression (false) 46, 58, 64, 88, 95, 98, 131, 145, 154, 191, 207, 233, 242, 
270, 272. 
23 | Faking to look bad (true) 14, 31, 34, 35, 40, 48, 49, 50, 53, 56, 66, 85, 121, 139, 146, 
184, 197, 200, 202, ‘205, 206, 209, 211, 218, 227, 256, 291. 
24 | Faking to look bad (false) 20, 83, 113, 185, 199, 257. 
25 | Hypochondriasis (true) 23, 29, 43, 62, 72, 108, 114, 125, 161, 189, 273. 
26 | Hypochondriasis (false) 2, 9, Si, 55, 63, 68, 103, 153, 155, 175, 188, 190, 192, 230, 
243, 274. 
27 | Hysteria (true) 10, 32, 44, 47, 76, 179, 186, 238, 246, 258. 
28 | Hysteria (false) 6, 12, 71, 128, 129, 136, 147, 162, 174, 213, 234, 265, 279, 292. 
29 | K: Correction for faking (all false) 138, 142, 160, 170, 171, 296, 374, 397, 398, 406, 461, 482, 502. 
30 | L: Faking to seem good (false) 15, 30, 45, 60, 75, 90, 150, 165, 195, 225, 295. 
31 | Hypomania (true) 11, 59, 73, 100, 109, 143, 167, 181, 222, 228, 232, 240, 250, 
271, 277, 298. 
32 | Hypomania (false) 101, 105, 111, 119, 148, 166, 180, 267, 289. 
33 | Personal and emotional sensitivity (true) | 217, 226, 278, 282, 299. 
34 | Personal and emotional sensitivity (false) | 79, 99, 176, 198, 214, 254, 262, 264. 
35 | Female sexual identification or 19, 26, 28, 80, 89, 112, 120, 280. 
timidity (false) 
36 | Feminine occupational identification (true) | 4, 25, 70, 77, 87, 92, 126, 132, 140, 149, 203, 204, 261, 295. 
37 | Denial of masculine occupations (false) 1, 81, 116, 144, 219, 221, 223, 283. 
38 | Paranoia (true) 16, 22, 24, 27, 123, 151, 275, 293, 338, 365. 
39 | Paranoia (false) 93, 117, 268, 313, 316, 319, 327, 347, 348, 294. 
40 | Psychopathic deviate (true) 42, 61, 84, 118, 215, 216, 224, 239, 244, 245. 
41 | Psychopathic deviate (false) 37, 82, 91, 134, 173, 183, 235, 237, 287, 96. 
42 | Psychasthenia (true) 304, 317, 336, 337, 340, 342, 343, 344, 346, 349, 351, 357, 
358, 359, 360, 361. 
43 | Psychasthenia (false) 3, 8, 36, 122, 152, 164, 178, 329, 353. 
44 Welsh’s R (Repression) scale (false) 429, 445, 447, 468, 472, 516, 529, 550. 
45 | Schizophrenia (true) 168, 210, 297, 303, 312, 320, 324, 325, 334, 339, 345, 350, 
. ; 354, 355, 363. 
46 | Schizophrenia (false) 17, 65, 177, 187, 196, 220, 276, 281, 306, 309, 322, 323, 330. 
47 Social Introversion (true) 124, 172, 201, 377, 383, 411, 427, 436, 455, 473, 487, 549, 564. 
48 | Social Introversion (false) 33, 229, 391, 400, 415, 440, 446, 449, 450, 462, 469, 479, 
ee 505, 521, 547. 
49 | Edwards’ Social Desirability (true) 7, 18, 54, 107, 163, 169, 371, 528. 
50 | Edwards’ Social Desirability (false) 156, 241, 247, 252, 263, 269, 286, 301, 321, 335, 352, 424, 
431, 439, 555. 
51 | Crown-Marlowe Social Desirability (true) | 556. 
52 | Crown-Marlowe Social Desirability (false) | 39, 135, 157, 255, 481. 
53 | Response set true (true) 310, 311, 326, 332, 341, 367, 370, 401, 405, 412, 421, 425, 
430, 433, 435, 441, 452, 460, 463, 464, 467, 474, 478, 484, 
485, 492, 494, 496, 486. 
54 | Response set false (false) 498, 504, 507, 508, 517, 519, 522, 523, 524, 525, 530, 532, 533, 
534, 535, 537, 538, 539, 540, 541, 542, 543, 551, 557, 559, 
Vs 560, 561, 563, 565. 
55 | Fundamental religion (true) 373, 413, 420, 476, 483, 488, 490. 
56 | Fundamental religion (false) 491. 
57 | Agree with generalized ext. standards (true) | 302, 314, 364, 368, 378, 393, 409, 417, 438, 442, 448, 458, 
i à 466, 470, 493, 495, 510, 545, 548, 558. 
58 | Agree with generalized ext. standards 


(false) 


437, 444, 459, 513, 514, 566. 
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TABLE 3—(Continued) 


Vari- 


able Descriptive title of parcel 


MMPI item composition of parcel in terms of item 


numbers in the MMPI test booklet (1951) 


59 | Freedom from external control (true) 


308, 315, 331, 333, 362, 372, 380, 381, 386, 387, 419, 422, 
434, 456, 471, 506, 512, 536, 475. 


60 Freedom from external control (false) 399, 423. 
61 | Mutuality (true) 375, 388, 390, 404, 408, 416, 477, 489, 500, 503, 509, 515, 
527, 531, 554, 562. 


62 | Mutuality (false) 
63 | Expression of autonomy (true) 
64 | Expression of autonomy (false) 


426, 432, 453, 454, 520, 552. 
318, 366, 369, 376, 402, 407, 410, 428, 497, 501, 546. 
328, 385, 392, 457, 526, 553. 


responding labels, resulting from various pre- 
vious investigations, in such a way as to pre- 
clude item overlap and maximize parcel 
homogeneity, Thus, as may be observed from 
Table 3, the parcel corresponding to Welsh’s 
Anxiety Scale contains only 16 of the original 
items. The items were put together as far as 
possible with regard to item homogeneity, 
guided by content inspection and the way 
items were put together in the original clinical 
construction of the scales. Thus, some advan- 
tage was taken of clinical experience about 
homogeneity present in the scales, but the 
main dependence was on the work of Adams 
and Horn (1965) continued by Hundleby, to 
avoid any item overlap among the parcels. 
True and false components of every MMPI 
parcel were separated to permit a check on the 
acquiescence response set. Thus, a variable 
labeled “true” or “false” in Table 3 consists 
of the subsets of items for which “true” or 
“false” responses, respectively, would yield a 
high score on the corresponding MMPI scale. 
The plan now called for a factor analysis 
of the 45 parcels, from five to 20 items. From 
this, we would factor scores via parcels, and 
so obtain the correlations of all 566 items 
with the obtained factors. One should note 
that the success or lack of success in putting 
items in parcels with a better-than-chance 
degree of homogeneity is not crucial to suc- 
cess of the plan and does not fundamentally 
prevent the factor analysis having a correct 
outcome. For, in the last resort, even if the 
566 items were thrown together in random 
groupings, the dimensionality of the variance 
among the parcels should be the same as the 
dimensionality among the items, except for 
trivial instances, as may be observed from the 


principles of matrix algebra, upon which fac- 
tor-analytic procedure is based. The ade- 
quacy of the “parceled” factor description of 
the domain is limited only by the fact that 
we cannot extract more factors than half of 
the number of variables in attempting to reach 
the full dimensionality. Since 2/2 is still large 
—32—when all our variables are counted, 
there is slight danger that this could result in 
fewer dimensions being recognized than 
through factoring the items directly. The test 
of this is that when we ended by factoring 
(with the addition of 19 representatives of 
16 PF scales to 45 MMPI parcels) a 64 X 
64 matrix, the number of factors we obtained 
was almost exactly the same as that obtained 
by Eber and Fowler’s coordinate study fac- 
toring the 566 items directly. 

As Ss for this 64-variable battery we used 
virtually the same (2 missing) sample group 
of Air Force men as in the previous investiga- 
tion, namely, 257 Air Force men (not pilot 
cadets), ages 19-25. For administration pur- 
poses, the Ss were divided into two roughly 
equal-sized groups, each completing the test- 
ing in two 2-hour sessions (morning and after- 
noon) on the same day. One group took the 
MMPI first and the 16 PF second, and the 
other group reversed this order. The general 
testing conditions prevailing were satisfac- 
tory, and since the Ss were guaranteed ano- 
nymity, we had the impression that suitable 
psychological conditions prevailed. 


Tue ANALYSIS 


In proceeding to the correlation matrix, 
two possibilities arise: (a) to split each 16 
PF scale into two parts, in order to guarantee 
emergence, by two markers, of each 16 PF 
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factor concerned at the primary level, or (b) 
to work with a somewhat more convenient 
smaller matrix and have better scale reliabili- 
ties, by leaving certain 16 PF scales in only 
as single scales (among the 45 MMPI par- 
cels). The best solution seemed to leave the 
16 PF as a single scale wherever the first in- 
vestigation offered evidence that the 16 PF 
scale in question would certainly find some 
common variance with it in the MMPI ma- 
terial, but to put in two markers where this 
might be in doubt. Inspection of Table 1 
showed some resonance in the MMPI could be 
expected for all scales except possibly N, the 
shrewdness dimension, Q, the radicalism-con- 
servatism factor, and G, the super-ego dimen- 
sion. Accordingly, the latter scales were split 
into two markers for each, but the remaining 
13 factor scales were put in singly. 

The Pearson product-moment 64 X 64 
matrix was factored first by principal axis, 
inserting unities in the diagonals, in order to 
estimate the number of factors by the scree 
test (Cattell, 1966c). The results of the eigen- 
value scree test in Figure A (see Footnote 5) 
indicate that 21 or 22 factors needed to be 
taken out. We accepted the former for econ- 
omy and iterated 10 times upon which the 
communalities reached stability for 21 factors. 

The unrotated V, the transformation 
matrix, and the correlation matrix of first- 
order factor procedure are deposited with 
the National Auxiliary Publications Service as 
Tables B, C, and D (see Footnote 5). The 
rotated matrix is the end result of several 
successive procedures: Maxplane (Eber, 
1966), followed by a Procrustes clarification, 
followed by a long succession of Rotoplot ro- 
tations (see Figure B) to test that the highest 
possible simple structure had been reached. 
Since some who have been surprised by so 
much resolution of MMPI content into 16 
PF space have suggested that the greater re- 
liability of the 16 PF scales may have pro- 
duced this result, it must be pointed out that, 
if anything, the MMPI had the better chance 
of statistically predominating. For (a) there 
were some 45 MMPI scales against only 19 
16 PF subscales; (b) at the item level the 
predominance is even greater, 566 to 174; 
(c) the reliabilities of the two sets of scales 
(at least 20 parcels the same or greater item 


Raymonp B. CATTELL AND LAURA SPECHT BOLTON 


length than 16 PF variables) are about the 
same, and (d) the 16 PF variables did not 
benefit from initial “far out” leverage, in the 
rotation, for, as the V, matrix shows, the 
communalities of the MMPI scales are at 
least as high as those among the 16 PF scales; 
(e) the rotation is in any case finally fixed by 
the hyperplanes, not by high-loaded variables, 

The hyperplane history appearing in Figure 
B (see Footnote 5) shows that the rotation 
followed the usual course from a moderate 
45% of the variables in the + 10 hyperplane 
to a number which is increasingly difficult to 
improve upon. The last four or five of the 27 
rotations were persisted in without significant 
improvement (or loss) to make sure that the 
resolution had in fact reached a stable pla- 
teau. In fact, variation in any possible direc- 
tion from our accepted position tended to 
produce a slight drop rather than an improve- 
ment. The hyperplane count for the final 
rotation rested at 79%, which is, by all stan- 
dards in rotational experience in personality 
research, a highly adequate level. 

The resultant reference vector matrix at 
simple structure is shown in Table E (see 
Footnote 5). Since the overall patterns lead 
to the conclusion that the majority of the 
factors are definable as 16 PF factors, the 
reference vectors have been arranged in the 
usual order of 16 PF factors for easy ref- 
erence. 


INTERPRETATION 


By stating that the overall patterns re- 
vealed lead one to identify most as 16 PF 
factors, we do not mean that the 16 PF scale 
marker concerned necessarily has a really high 
loading, though it is (with two exceptions) 
statistically significant at p< .01 (Harris, 
1964) and generally the highest in the col- 
umn. These brief scales typically have a valid- 
ity against the factor of about .6, but in the 
military samples of the kind here used, reli- 
abilities run lower than usual and the validi- 
ties (factor correlations) at best would only 
Tun about 54 (assuming .5 reliability). It 
must be remembered that the experiment was 
not designed to measure the factor, but only 
to identify it, and identification is inferred 
from (a) the marker set to load this factor 
loading it significantly, the bulk of its com- 
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mon variance being accounted for by this 
factor and no other, (b) in the case of two 
markers for one factor from their loadings 
falling appropriately together on one factor, 
(c) the fact that no pair of MMPI parcels 
load the factor more highly than the markers, 
and, (d) the fact that no seriously competing 
alternative empirical pattern exists for the 
identification. 

Because doubt could still exist in a complex 
judgment despite four such sources of evi- 
dence (for example, if the marker correlation 
is quite low, one might argue that it is no 
more than the correlation of a marker for X 
with a factor Y decidedly oblique to X), 
some brief discussion of individual factors is 
needed. For example, Factor 1 is considered 
16 PF Factor A (affectothymia) character- 
ized by outgoing behavior, Though the marker 
loads on .30, the only other variables on 
Factor 1 surpassing this, except Parcels 17, 
Departing from the norm in occupational 
choice (.41), and 41, Freedom from external 
control (—.30), are highly consistent with the 
affectothymic interpretation. For Factor 2 
the highest loading, .57, is Factor B (intelli- 
gence) on the 16 PF. No other significant 
loadings for this factor appeared. On Factor 3, 
the identification is not too satisfactory, be- 
cause the C factor on the 16 PF (ego strength) 
loads only .25, and, in this special case, there 
are some six MMPI parcels which load as well 
or better: Welsh’s repression scale (.37), 
Psychopathic deviate (false) (.36), Agree- 
ment with general external standards (false) 
(.33), Expression of autonomy (true) (—.32), 
Fundamental religiosity (true) (—.31), and 
Denial of masculine occupations (false) (.30). 
Again, however, the new variables have a 
meaning fully consistent with ego strength 
and in no case does the C factor marker on 
the 16 PF load any other factor comparably 
with Factor 3. Factor 4 is loaded .33 by the 16 
PF dominance marker. Although one other 16 
PF variable, L, protension, suspicious, tops 
this loading with .43, the supporting loading 
of the MMPI variable Female sexual identi- 
fication (—.32) strongly suggests a dominance 
interpretation. It may be further noted that 
16 PF Factor L turned up later as a separate 
factor, principally*defined by 16 PF L (32), 
Faking to seem good (False) (.29) and Per- 


sonal and emotional sensitivity (false) (.46), 
consistent with the protensia interpretation. 
Factor 5 is loaded .43 by the surgency 
marker; the nearest other loading being —.41 
by the MMPI parcel, Personal and emotional 
sensitivity (true). Surgency did not load 
above .19 on any other factor. 

Parenthetically, since readers accustomed 
to analysis at a scale-correlational rather than 
factor-analytic level may be suspicious of in- 
ference from such comparatively low correla- 
tions, a word on method must be inserted. 
The present study is a first blocking in of a 
large area, which will, now that specific hy- 
potheses have received tentative support, 
justify a major research. To get 64 scale 
measures in 4 hours of available testing time 
each had to be short to the point where, even 
had it been a pure factor measure, it would 
not load more than .77 (assuming a reliability 
of no more than .6 for the average 10-item 
scale). We do not get a tidy diagonal of .77 
loadings athwart the reference vector struc- 
ture, but we do get a total structure which 
will not fit any alternative rearrangement one 
likes to try. The demonstration of the posi- 
tion of a factor depends on the accuracy, of 
hyperplane determination, which is excellent 
in the obtained structure, not on the presence 
of high loaded variables. Except for H and 
Qs, to be discussed, the marker loadings are 
statistically significant beyond p= .01 by 
Harris’s (1964) test. The contributions of 
these markers to the variance within any 
other factor are trivial and are generally the 
highest on the factor concerned. 

Markers for the super ego, Factor G, the 
reader will recall, were experimentally split 
into two. These consistently load .47 and 27 
on the same factor, No. 6. With the psycho- 
logically related self-sentiment strength fac- 
tor, Qs, on the other hand, a real problem 
arises. As not infrequently happens with ver- 
sions of Qs scales, some of its variance ac- 
tually belongs to the super-ego factor (No. 6 
here) and elsewhere, and only in Factor 15 do 
we find it having variance on a factor not 
otherwise identifiable. This factor is actually 
much more clearly marked by MMPI parcels 
which justify our calling it the general psy- 
choticism factor (mentioned below) and the 
Qs loading is trivial. Though no identification 
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with Qs can therefore be made, a hypothesis 
for further work may be noted that Qs, in 
mixed normal and pathological groups, has 
some strange relation to psychoticism. Ac- 
tually, this has been noted before (Cattell, 
Tatro, & Komlos, 1964) and has given rise 
to the theory that raised concentration on the 
approved self pattern (Qs) is a compensatory 
device when the ego (C) is psychotically dis- 
integrating. 

Factor I, tendermindedness, heavily loaded 
the eighth vector (.55) supported by the next 
highest loading (.27) of Female sexual identi- 
fication. 

Autia, M factor, characterized by imagina- 
tive behavior, is well marked (.47) and so also 
is N, Shrewdness. Guilt-Proneness, O factor, 
has a low loading (.26) on a factor which is 
loaded at the same time on the obvious ref- 
erence variable of Edward’s MMPI social de- 
sirability measure (.42). This is consistent 
with the conclusion (Cattell, 1957) that anxi- 
ety, and particularly the guilt proneness and 
inadequacy aspects of anxiety contributed by 
O factor, express themselves by lower social 
desirability scores. Radicalism, Q,, stands out 
on its own with virtually no MMPI loadings, 
as would be anticipated from Thurstone’s 
(1947) original definition of this normal 
factor. The 16 PF self-sufficiency scale, Qə, 
loads a dimension—Factor 14—.54, which is 
about its usual validity, that is, as a single 
short scale against the pure factor, and Factor 
14 has only trivial MMPI loadings, consistent 
with the normal associations commonly found 
for Qe. Ergic tension, or tenseness, Q4, has a 
loading of .31 on a factor and is accompanied 
expectedly, as an anxiety measure, by a nega- 
tive loading (—.34) on an Edward’s social de- 
sirability referent in the MMPI. 

After these first 16 factors, which, with the 
two exceptions (H and Qs) are marked most 
consistently by 16 PF markers, the remaining 
factors have very little of 16 PF variance and 
are clearly defined largely by MMPI varia- 
bles. Factor 17 is substantially loaded on the 
Hypochondriasis parcel items 6 and 7 (.50 and 
.34, respectively), and somewhat less loaded 
on the Depression, (.16 and .30), Faking bad 
(.30 and .39), and Hysteria (.39), and may 
perhaps best be called Hypochondriasis. Fac- 
tor 18 is recognizable as being probably a 


psychopathic deviate factor since it loads the 
Psychopathic deviate marker (.49) on the, 
MMPI and also to a lesser degree Hypomania 
(.21 and .21) and Paranoia (.24), which are 
consistent with the 16 PF profile for psycho. 
paths (high scores on dominance, surgency, 
and suspicion; Cattell & Eber, 1967). 

The third new factor, No. 19, may be in- 
terpreted in one or both of two ways. The 
most obvious interpretation, based on the only 
two consistently high positive MMPI parcel 
loadings on it, those of the true and false 
subsets of Psychasthenia items and substanti- ' 
ated by low social desirability, is that this 
factor should be labeled Psychasthenia. How- 
ever, closer examination of the marking varia- 
bles shows a noticeable split between true and 
false subsets, for example, those of Social in- 
troversion and Mutuality. This latter inter- 
pretation is reinforced by the high loading of 
the Acquiescent response set, which may very 
well be an alternative for this factor. This 
finding suggests that, if the labels for these 
parcels are accurate, psychasthenia may be 
highly contaminated by an acquiescence re- 
sponse set. Factor 20 has no pathology con- 
tent as such; its expressions are primarily 
strong tendency toward nonagreement re- 
sponses, fundamentalist religion, absence of 
depression, low dominance, and conservatism. 
Although the highest marker for this factor 
was the Negative response set, such an inter- 
pretation was not supported by the loadings 
of the previously mentioned true and false 
subsets of the MMPI parcels. Hence, Factor 
20 is labeled Religiosity with reference to the 
highest consistent pair of subscale loadings on 
it: those of Fundamental religiosity. It sug- 
gests an assured, resigned, simple faith, and 
though it occurs in MMPI items it would 
seem gratuitous from the content and associ- 
ations to label it as a pathological dimension. 
Factor 21 plays the usual role of the error 
factor—the last bit of covariance into which 
“garbage” gets rotated more or less auto- 
matically. Before this, however, comes the 
fourth of the meaningful pathological fac- 
tors (No. 15) set aside for discussion be- 
cause of its curious possible Qs association. 
This has every claim to be considered the 
“general functional psychosis” factor, hy- 
pothesized and set out with preliminary Q- 
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data evidence and discussion by Cattell in 
1957. It loads MMPI parcels for Schizo- 
phrenia, Depression, Paranoia, and Hysteria, 
as well as for the general subjectivity and 
autism of “anatomy.” This has the loss of 
reality contact and inappropriateness of emo- 
tional expression common to diverse psy- 
choses, which have been hypothesized as the 
hallmarks of the “general psychoticism” con- 
cept. 

Because of the instances discussed above 
where factor identification has had to rest— 
as far as direct loadings are concerned—on 
values below the .4 or .5 one would expect on 
psychometric grounds, it has seemed desira- 
ble to check by pursuing the independent 
source of evidence still available. This check 
on direct validity exists in the calculation of 
indirect or “circumstantial validity” (Cattell, 
1964). Does the factor identified by its in- 
ternal loadings here as X correlate with other 
factors as other researches show X to do? 
This is most completely evaluated by observ- 
ing its behavior in the second-order domain, 


and such a second-stratum structuring would 
be desirable also to increase our knowledge of 
the new four pathological factors. 
Accordingly the R, (reference vector cor- 
relation matrix) from the above rotations was 
converted to the R;, for the 21 factors and 
factored, yielding eight factors by the scree 
test, which were rotated 12 times to a mazi- 
mum hyperplane count. All three intermedi- 
ate-step calculation tables, R, V, and L, are 
deposited at NAPS as Table F of No. NAPS- 
00121 (see Footnote 5, and only Table 4, the 
final simple structure outcome, is presented 
here. Surveys presently being conducted at 
Cattell’s Laboratory of Personality and Group 
Analysis, which describe the factors resulting 
from a dozen studies on second-order person- 
ality structure in Q data, indicate that the 
four largest second-stratum factors here are 
clearly those usually obtained from person- 
ality primaries, namely, Extraversion, Anxi- 
ety (closely approximated by Welsh and 
Dahlstrom’s, 1956, estimation, as confirmed 
by the weighting patterns of Welsh’s A scale, 


TABLE 4 


Srconp-OrpER Structure oF 16 PF Factors Founp ın MMPI anp MMPI Factors 
SECOND-STRATUM Factors By NUMBER 


= 
1 4 
. Exvia 2 Inde- 
First-stratum factors (Extra- | Anxiety ee ae 5 6 7 8 
version) dence 

1. A Affectothymia A8 —.20 —.29 .02 19 63 —.18 —.11 

2. B Intelligence .06 —A7 —.05 10 —.02 —.38 87 09 

3. C Ego Strength —.417 —.26 .09 03 —Al —.22 —.52 —.02 

4, E. Dominance 58 —.00 .08 05 58 — Al .25 —.11 

5. F Surgency | 59 04 —.02 — 13 —.00 46 —.03 —.08 

6. G Super Ego —07 | —.04 02 | —.22- | =.17 16 65 11 

7. H Parmia 59 —.41 .02 —.04 33 —.02 .07 —.08 

8. I Premsia 03 —.03 —.69 —.03 —.10 —.19 .12 —.13 

9. L Protension 09 .20 29 .22 04 —.20 —.47 —.07 
10. M Autia —.04 —.41 —.15 39 —.23 —.25 .05 10 
11. N Shrewdness —.05 01 13 04 —.10 23 58 A2 
12. O Guilt Proneness 06 04 26 | —.17 .03 —.62 | —.08 01 
13. Qı Radicalism .08 —.03 —.01 H7 04 60 —.53 48 
14. Qe Self-Sufficiency —.50 06 07 63 - od ip = ie fie: 
15. General Psychoticism (Qs) | —-00 —30 | —05 | —.00 . A cK pe 
16. Qu Ergic Tension —.07 19 10 07 31 | —48 | —39 | —.0 

i 03 46 01 01 —.05 35 13 08 

17. Hypochondria ai a ae We 
18. Psychopathia —.05 — .03 00 —.06 - y a r h 
19. Psychasthenia 06 —.14 —.29 —.17 .82 H pan BS. 
20. Religious Resignation —.57 —.20 —.10 —.20 —.01 E n B 
21. Error —.25 —.09 04 -03 —.10 13 - 
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in the top row of Table 1), Inactivity, and 
Independence. Beyond these, also, as usual, 
the last two or three factors lack certainty. 

From Table 4 one concludes, therefore, that 
the factors identified internally by marker 
content as 16 PF factors (in MMPI item 
space) also occupy the positions relative to 
one another which factors of this identity 
would uniquely possess. Regarding next the 
four new factors, one finds that psychasthenia 
defines a second order largely to itself (No. 
5), as also does general psychosis (No. 6). 
The psychopathic factor is powerfully affected 
by some second stratum entity which also 
favors radicalism. Only the Hypochondriasis ê 
and Religious Resignation factors have 
transactions with secondaries already known 
in the general normal personality field, about 
half of their variances being due to the general 
anxiety factor and Introversion, respectively. 
Whereas the remaining Hypochondriasis 
variance falls into the pathological Factor 6, 
Religiosity divides its remainder between 
Severity (negative Anxiety) and Dependence, 
both normal second-order factors. 

In terms of clinical perspective, what 
emerges from the present study is that patho- 
logical behavior recorded in a broad range of 
clinical items, such as the MMPI pool, is 
partly accounted for by general personality 
source traits common to normals and patients 
and partly by dimensions which appear with 
any appreciable variance only in patients. It 
is appropriate, however, first to pick up and 
define the latter in mixed samples, as here, in 
order to get the normal-abnormal structural 
relations, and also for the practical reason 
that scales for diagnosis must be designed for 
diagnosis in populations containing both. 
However, supplementary refined analyses 
need to be done on entirely clinical popula- 
tions, as in the work of Lorr (1955) and 
Wittenborn (1951). 


8 One should be careful not to confuse this hypo- 
chondriasis factor, which is a source trait, with the 
MMPI hypochondriasis surface-trait scale, 
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Groups of male and female college students volunteering for summer work in 
mental hospitals were compared to control students on personality, occupational- 
interest, and life-history variables. Striking personality characteristics of mental 
health volunteers were maturity and control, drive for independent achieve- 
ment, and sensitivity to people and human problems. Vocational interests and 
life-history data on volunteers were consistent with these personality char- 
acteristics. In vocational interests, volunteers were similar to men and women 
in the independent professions and in professions emphasizing social service or 
the exercise of language and artistic skills. Life-history data indicated that 
volunteers were more service-oriented and committed to mental health work 
than other college students. The role of test-taking attitudes in producing these 


differences was discussed. 


One response to the manpower shortage in 
the mental health professions has been an in- 
creasing use of volunteers in treatment pro- 
grams for mental patients. The volunteers are 
most often college students who visit chronic 
patients on the back wards of mental hos- 
pitals, The students not only learn about the 
field of mental health through such programs, 
but they also provide needed human contact 
for mental patients. 

Gelineau and Kantor (1964) have reported 
on the personal characteristics of Harvard and 
Radcliffe students who make regular visits to 
a mental hospital during the academic year. 
These authors emphasized that volunteers 
were not deviants, but in many ways typical 
Harvard-Radcliffe citizens. Females, however, 
were highly overrepresented among volunteers, 
and the volunteers were distinctive in their 
occupational interests. The volunteers empha- 
sized service as a source of job satisfaction 
more than did their peers, and they tended to 


1The research here reported was completed as a 
part of the Connecticut Service Corps Research 
Project and was supported by Grant MH02127-03 
from the National Institute of Mental Health. The 
authors are indebted to Richard J. Wiseman, director 
of the Connecticut Service Corps, whose willing 
cooperation made the data collection possible. 

2 Requests for reprints should be sent to Karl E. 
Scheibe, Department of Psychology, Wesleyan Uni- 
versity, Middletown, Connecticut. 
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prefer mental health careers to careers in busi- 
ness, law, and the natural sciences. 

Knapp and Holzberg (1964) have charac- 
terized college volunteers in a similar pro- 
gram. In general, these authors concluded that 
Wesleyan University students who made 
weekly visits to the Connecticut Valley Hos- 
pital were not strikingly differentiated from 
the general student body. They found some 
evidence, however, that the companions were 
more intraceptive, morally concerned, person- 
ally compassionate, and introverted than the 
control group with which they were compared. 
Volunteers also differed from controls on cer- 
tain academic variables—major, incidence of 
disciplinary action, and fraternity membership 
(Holzberg, Knapp, & Turner, 1967). 

The Service Corps program of the State of 
Connecticut is similar to the programs at 
Harvard-Radcliffe and at Wesleyan in concep- 
tions and goals. It differs, however, in the 
amount and intensity of the volunteers’ con- 
tact with mental patients. The Service Corps 
students commit themselves to 40 hours 4 
week of interaction with chronic patients for 
8-10 summer weeks. The present report fo- 
cuses on the personal characteristics of these 
students since it seems possible that the spe 
cial intensity of their commitment leads to 2 
more distinctive group than those previously 
studied. The aim of this report is to describe 
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these students in terms of personality char- 
acteristics, interest patterns, and life-history 
variables. 


METHOD 
Subjects 


The experimental Ss were 41 male and 110 female 
students who served in the 1965 and 1966 Connecti- 
cut Service Corps (SC). These Ss represent the entire 
group who completed applications and accepted posi- 
tions in the SC in these years. 

The control groups consisted of 66 females and 76 
males enrolled in summer school at four Connecticut 
colleges. Each member of the control groups was 
paid $10 for his participation. 

A preliminary comparison of the SC and control 
groups indicated that the two samples came from 
similar populations with respect to demographic and 
sociological variables. Chi-square comparisons be- 
tween SC and control groups proved to be nonsig- 
nificant by conventional standards for the variables 
of year of colege graduation, number of siblings, 
ordinal position, church attendance, religious attitude, 
and mother’s educational level. Father’s educational 
level was not significantly different for the males, but 
the educational level of fathers of the SC females 
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was significantly higher than that of the comparison 
females (p< .05). No significant differences were 
found by t-test comparisons on the variables of age, 
father’s age, mother’s age, and family income, 


Procedure and Materials 


All SC volunteers were administered a battery of 
tests and questionnaires in group sessions during their 
first week at the hospitals. The test battery was 
about 4 hours in length and was completed prior to 
any patient contact. Members of the control groups 
were administered an identical test battery at their 
respective colleges at about the same time. 

Measures to be reported cover the areas of per- 
sonality, occupational interests, and life-history data. 
As a multidimensional measure of personality, the 
California Psychological Inventory (Gough, 1957) 
was selected. Other personality information was ob- 
tained by administering the Adjective Check List 
(ACL; Gough & Heilbrun, 1965), the Rotter (1966) 
Internal-External (I-E) scale and the Marlowe- 
Crowne Social Desirability scale (MC-SDS; Crowne & 
Marlowe, 1964). The Strong Vocational Interest 
Blank (Strong, 1959) was used as a measure of oc- 
cupational interests. In addition, five life-history 
variables derived from a biographical questionnaire 
were included as indicators of interests and attitudes. 


TABLE 1 
COMPARISON OF SERVICE CORPS AND CONTROL Groups ON THE CALIFORNIA PSYCHOLOGICAL INVENTORY 
Males Females 
eS ee 
Class Scale Service | Control Service | Control 
Corps group t test Corps group t test 
M M M M 
ASA P A A eee HEIER NTI ASE DS IV) 
I | Dominance 54.00 55.61 —.64 52.00 50.48 91 
Capacity for Status 56.34 55.50 49 54.77 51.55 1,96 
Sociability 49.29 53.34 —2.06* 51.19 52.14 —.64 
Social Presence 56.93 59.12 —1.09 56.62 53.86 1.54 
Self-Acceptance 57.78 62.12 —2.25* 56.38 58.61 —1.39 
Sense of Well-Being 42.46 42.08 AT 44,44 41.16 1.92 
ibili 50.83 48.97 1.04 50.07 47.81 1.61 
E pe! 47.85 47.67 09 46.41 46.31 06 
Self-Control 44.51 40.22 2.23* 44.26 40.22 2.61** 
Tolerance 51.44 48.53 1.56 52.67 46.94 3.42** 
Good Impression 45.66 41.79 2,13* 45.08 41.75 2.33* 
Communality 49.17 50.22 —.60 47.97 | 48.45 —.34 
m i t via Conformit: 49.78 47.92 85 48.07 45,14 1.92 
perme via EAE 60.81 56.65 2.24* 59.38 53.28 4.04%" 
Intellectual Efficiency 51.93 51.37 .28 53.44 | 48.25 3.08 
ical-Mi 57.24 53.26 1.89 55.00 50.95 2.60* 
"Ya [bse e sre | seit | 238° | ot.o6 | $7.56 | 2.27% 
youn? 56.10 49.74 3.96** 49.74 52.38 —1.81 
e 
*p <05. 
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RESULTS 
Personality Measures 


Table 1 lists the means of the 18 scales of 
the California Psychological Inventory (CPI) 
for the SC and control groups, together with 
the ¢ ratios indicating the significance of the 
difference on each scale. Thirteen of the 36 
comparisons reach significance at or beyond 
the .05 level. 

Class I scales of the CPI are generally 
interpreted as measuring social poise, ascend- 
ancy, and self-assurance. Scores of the SC 
males, but not of the SC females, are rela- 
tively low on this group of scales, Both male 
and female volunteers are higher than the con- 
trol Ss on Class II and Class III scales of the 
CPI. Class II scales are usually taken to be 
measures of maturity and control while Class 
III scales measure achievement potential and 
intellectual efficiency. On Class IV measures, 
SC males and females are more psychologically 
minded and flexible than their peers, but there 
is a sex difference on the femininity scale. The 
SC males are clearly more feminine or nurtu- 
rant than nonvolunteers whereas the female 
members of the SC appear to be slightly less 
feminine than their comparison group. 

In summary, SC volunteers appear to be 
mature, tolerant, and controlled; high in need 
for achievement, especially creative achieve- 
ment; and psychologically minded and flexi- 
ble in their thinking. Male volunteers, but not 
female volunteers, tend to be less socially 
poised and more nurturant than their non- 
volunteering peers. 

A very similar characterization of the SC 
females emerges from an analysis of ACL re- 
sults, SC females are significantly higher on 
scales reflecting self-control and achievement, 
and lower than controls on scales of Hetero- 
sexuality and Succorance. SC males are higher 
than controls on the Abasement variable of 
the ACL.* No differences were found between 
SC and control for either sex on the I-E or 
MC-SDS variables. 


3 Tables giving complete data for each subgroup 
have been deposited with the American Society for 
Information Science. Order NAPS Document 00213 
from ASIS National Auxiliary Publications Service, 
c/o CCM Information Sciences, Inc., 22 West 34th 
Street, New York, New York 10001; remitting $1.00 
for microfiche or $3.00 for photocopies. 


Vocational Interests 


In Table 2 are found the means for the mi 
groups on 31 of the 51 occupational scales 
the Strong Vocational Interest Blank (SVIE 
On these scales, the two groups differ signi 
cantly (p < .05). 

In vocational interests, SC males most 1 
semble men in occupational Groups I, V, Ų 
and X. Group I occupations are all professic 
which tend to be practiced independently. TI 
four differentiating scales in Group V-—voe 
tional counselor, social worker, school superi 
tendent, and minister—are directly concernt 
with welfare and service. Group VI a 
Group X occupations require artistic al 
verbal-linguistic interests. 

The vocational interests of SC males a 
clearly unlike those of certain occupation 


TABLE 2 


Comparison OF SERVICE Corps MALES AND Co! 
GROUP ON THE STRONG VOCATIONAL 
INTEREST BLANK 


Ser- 
Group Occupation et 
M 
I Artist 41.2 
Psychologist 46.6 
Architect 36.7 
Physician. 43.9 
Psychiatrist 45.5 
Veterinarian 13.5 
Il Mathematician 31.2 
il Production manager 21.5 
Iv Farmer 23.7 
Aviator 23.7 
Industrial arts teacher 5.0 
Vocational agriculture 
teacher 12.7 
Policeman 21.6 
Army officer 17.9 
Nj Vocational counselor 39.4 
Social worker 44.0 
School superintendent 34.7 
Minister 36.5 
VI Musician 50.5 p 
Music teacher 42.5 ” 
VII Senior CPA 27.5 | 34.2 
Accountant 17.0 | 24.8 
Cer akir 22.0 | 29.0 
manager 28.1 | 33.4 
Purchasing agent 13.6 | 24.4 
Banker 19.7 | 23.4 
Pharmacist 26.8 | 30.8 
IX Sales manager 27.1 | 31.4 
x Adve hiing man 4 | 39.2 
wyer 48.4 | 39.4 
Author-journalist 46.2 | 38.2 
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TABLE 3 


COMPARISON OF SERVICE Corps FEMALES AND CONTROL 
GROUP ON THE STRONG VOCATIONAL 


INTEREST BLANK 
Sis | tel, 
i ice | tro! 
Group Occupation Corps| group t ratio 
M M 
II | Artist 36.8 | 28.8 | 3.00** 
Author 37.9 | 30.6 | 2.61* 
III | Social worker 42.5 | 35.4 | 2.57% 
Psychologist 36.4 | 22.0 | 5.30** 
IV | Life insurance salesman 20.5 | 26.9 | —2.14* 
V | Buyer 14.0 | 26.2 | —4.54%* 
Business education teacher | 14.7 | 25.6 | -3.744 
Stenographer-secretary 27.8 | 36.5 | —3.81%* 
Office worker 20.1 | 31.8 | —5.15** 
VI | Elementary teacher 25.8 | 32.8 | —2.67%* 
Housewife 26.8 | 35.8 | —4.12** 
Dietitian 15.5 | 22.6 | —2.39* 
VIII | Physician 30.8 | 21.8 | 2.89%* 


Note—SVIB scores were not collected from females in 
1964, Consequently, this analysis is based on a reduced sample 
of 68 SC at control females. 


** p <.01. 


groups. In occupational interests, SC males do 
not resemble members of the technical or 
skilled trades (Group IV). Furthermore, the 
SC male does not share the interests of men 
in business administration or business detail 
(Group VIII). On seven of the eight scales in 
Group VIII, the SC are significantly lower 
than the comparison group. 

Table 3 presents comparable data from the 
SVIB for women. Thirteen of the 29 compari- 
sons are significant at or beyond the .05 level. 
SC females, like males, are distinctively high 
on occupations involving yerbal-linguistic and 
artistic interests—artist, author, and music 
performer. Females are also high on the Group 
TII social service occupations and on the phy- 
sician scale. On the other hand, the interests 
of female volunteers are unlike those of women 
in routine business and teaching posts. The 
control group is much higher than the SC 
group on the “nonprofessional” cluster of oc- 
cupations—office worker, stenographer, and 
housewife. 

In general, male and female SC volunteers 
have vocational interests in the independent 
professions; in social service; in careers in- 
volving the use of language or the exercise 
of artistic skill. By temperament, the SC vol- 
unteer seems unsuited to business, entrepre- 
neurial, and “ponprofessio ” occupations. 


Life-History Variables 


Five biographical items included on the 
questionnaire asked students about mental 
health interests and volunteer activities. Re- 
sponses to these items were analyzed sepa- 
rately for the male and female groups. Nine of 
the 10 chi-square comparisons were signifi- 
cant at or beyond the .05 level. 

SC volunteers clearly differ from nonvolun- 
teers in their interests in mental health, Volun- 
teers more often plan careers in mental health 
fields (p < .01 for both males and females) and 
more often choose academic majors in psy- 
chology and other mental-health-related fields 
(p < .01 for both males and females). More 
SC than control males have visited a mental 
hospital before the SC summer (p < 01); 
female SC, however, do not differ from con- 
trols on visits to a mental hospital. 

Comparison on biographical questions indi- 
cates that volunteer work is more character- 
istic of SC than control students. More SC 
members than control students have engaged 
in both college-related volunteer work (p< 
.05 for males; p < .01 for females) and extra- 
curricular volunteer work (p < .01 for both 
males and females). 

In general, the SC volunteers seem more 
service-oriented and committed to mental 
health as a profession than other college stu- 
dents. 


Discussion 


The composite picture of the mental health 
volunteer that emerges from the CPI is of a 
student who is mature and controlled, oriented 
toward independent achievement, and sensi- 
tive to people and human problems. The ele- 
vation of both male and female SC volunteers 
on the Good Impression scale of the CPI raises 
questions, however, about the accuracy of this 
picture. The possibility must be considered 
that differences on the Good Impression scale 
and other scales of the CPI simply reflect at- 
tempts by the volunteers to create a favorable 
impression and to comply with “role demands” 
of the testing situation. 

Some support that the present findings re- 
flect real personological traits rather than test- 
taking artifact comes from the work of Holz- 
berg, Knapp, and Turner (1967). These re- 
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searchers compared volunteers and controls on 
test information available from student files. 
The test data were obtained under uniform 
conditions at the time of students’ entry into 
college, before they had expressed any atti- 
tudes about volunteer work in mental hos- 
pitals. Holzberg and his colleagues found a 
similar lack of economic motivation, overrep- 
resentation of psychology as an academic 
major, and orientation to service. In addition, 
Holzberg refers to the lower incidence of dis- 
ciplinary action against volunteers during their 
college careers—confirmation from another 
domain of data that volunteers are at least as 
psychologically mature as nonvolunteers. 
Consistencies within the present study also 
lead the present authors to doubt that group 
differences on the CPI can be attributed com- 
pletely to “role demands” upon the volun- 
teers. The psychological-mindedness of SC 
volunteers, for instance, is reflected not only 
on the CPI but also in their occupational pref- 
erence for person-oriented careers and in their 
choice of majors in mental-health-related 
fields. Drive for independent achievement was 
expressed both on the personality and interest 
inventories. The data reported here suggest 
that participation in volunteer work is not 
motivated by overconcern with personal prob- 
lems but rather is partly attributable to a con- 
trolled drive for independent achievement and 
sensitivity to human problems. Such character- 
istics may have functional significance for 
volunteer programs. Scheibe (1965) has al- 
ready noted the low failure rate for these 
self-selected students, and follow-up question- 


naires indicate that the students develop a 
ever deeper commitment to the mental health 
professions. Thus, these self-selected students 
appear capable of responding to the challeng 
of mental health work and the program ap. 
pears to crystallize the students’ budding in. 
terest in the mental health professions. 
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NEED FOR APPROVAL AND POPULARITY: 
SEX DIFFERENCES IN ELEMENTARY SCHOOL STUDENTS + 


STEVEN R. TULKIN, JOHN P. MULLER, anp LANE K. CONN? 


Harvard University 


The Children’s Social Desirability Scale (CSD) and a sociometric measure were 
group administered to 192 fifth- and sixth-grade students. Intelligence and 
achievement scores were also obtained. High need for approval girls (as mea- 
sured by CSD) were found to be most popular, while high need for approval 
boys were least popular. Partialing out the effects of intelligence did not sig- 
nificantly alter these sex differences. Sex role theory was used to explain the 
findings, hypothesizing that behaviors associated with a high need for approval 
are consistent with the female sex role model but contrary to the male model. 


Crowne and Marlowe (1964) presented the 
view that the tendency to respond in a so- 
cially desirable manner is not merely a dis- 
tortion of test responses, but, more impor- 
tantly, can be considered as “meaningful prob- 
lem solving behavior indicative of an indi- 
vidual’s approach to being evaluated,” and is 
related to behavior in other situations. Rather 
than investigate the personality test responses 
themselves, Crowne and Marlowe postulated 
a motivational state—the need for approval 
(n App)—which was reflected in test-taking 
behavior, and sought its correlates in behavior 
outside the testing situation. Initial research 
indicated that high scorers on the Marlowe- 
Crowne Social Desirability Scale (M-C SDS) 
tended to conform in an Asch-type experi- 
ment (Strickland & Crowne, 1962) and to be 
more susceptible to manipulation of their at- 
titudes than individuals less motivated to- 
ward approval from others (Salman, 1962). 

In order to investigate the emergence and 
development of n App, 4 Children’s Social 
Desirability Scale (CSD) was constructed 


1 This research was partially supported by Na- 
tional Institute of Mental Health Grant HDO 1652 
to Lane K. Conn. Computer facilities for the analy- 
sis of the data were provided through NSF Grant 
Number GT2723 to the Laboratory of Social Rela- 
tions, Harvard University. i 

2The authors wish to acknowledge the assistance 
of Walter Gleason, L. Rich, and P. Caffrey in provid- 
ing the subject population, and to express their grati- 
tude to Kenneth Gergen for his suggestions on in- 
terpreting the data and Krayna Tulkin for clerical 
assistance. Requests for reprints should be sent to 
Lane K. Conn, Department of Social Relations, 
Harvard University, Cambridge, Massachusetts 
02138. 
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(Crandall, Crandall, & Katkovsky, 1965). 
Crandall (1966) observed a “striking simi- 
larity” between the behaviors characteristic 
of high n App adults and the personality at- 
tributes of high CSD children of high school 
age as revealed in their scores on the Cali- 
fornia Psychological Inventory. Specifically, 
high CSD children showed the same sorts of 
concern about others’ evaluations, the same 
low self-esteem and lack of self-confidence, 
and the same conventional, inhibited, sug- 
gestible, controlled behaviors and attitudes 
exhibited by the young adults tested with the 
M-C SDS in previous studies. 

Crandall (1966) also reported that high 
CSD males and females behaved differently. 
For the females, significant correlations were 
reported between CSD score and instigation 
of physical aggression (—.40), verbal aggres- 
sion (—.49), and withdrawal from aggressive 
attacks (.33), while for males the correlations 
with the same variables were .14, —.08, and 
—.02. Males with high CSD scores were char- 
acterized by infrequent achievement behav- 
iors. They showed significantly fewer attempts 
at mastery of fine motor skills, spent less 
time working alone in achievement tasks, 
made significantly less effort to achieve, and 
in general tended to be less persistent when 
they were engaged in achievement activities. 

Crandall et al. (1965) also reported large 
sex differences on the CSD itself. Females 
had higher scores at every grade tested, 
and the authors speculated that it “may be 
more important for American girls to appear 
socially acceptable than for American boys 
to do so.” 
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The present investigation focuses on the 
relationship between popularity and n App: 
How successful are approval-oriented indi- 
viduals in achieving social acceptance? Bank * 
reported a correlation of —.33 between scores 
on the M-C SDS and likeability ratings among 
a group of fraternity brothers. High n App 
scores were also associated with a greater ex- 
pectancy of rejection in TAT stories, and with 
overestimation of popularity on the socio- 
metric measure. Salman (1965) used female 
Ss in a similar investigation and found no cor- 
relation between M-C SDS scores and popu- 
larity (r = .07). 

Other studies of popularity further define 
the relationships to be investigated in the 
present research. Iscoe and Carden (1961) 
administered the Witkin Embedded Figures 
Test, the Children’s Manifest Anxiety Scale 

(CAS), and a three-question peer rating in- 
dex to a sixth-grade class. The Witkin Test 
is relevant to the present discussion in light 
of Rosenfeld’s (1967) report that high n App 
Ss were field independent. Iscoe and Carden 
found a signicant rank-order correlation of 
-57 (p < .05) between sociometric status and 
field dependence for girls, in contrast to a sig- 
nificant negative correlation of —.51 (p< 
.05) for males. In addition, a correlation of 
—.60 (p< .01) was obtained between CAS 
and field dependency for females indicating 
that field independent girls tended to be more 
anxious than field dependent ones. For males, 
the correlation was positive but not significant. 
The authors explained their results by refer- 
ring to culturally learned sex-role behavior: 


Boys are expected to be somewhat aggressive, direct, 
and analytic while girls are taught a more submissive 
and conforming ‘ladylike’ type of behavior. The girl 
who identifies with this role gains acceptance .. . 
(while) an analytic field independent mode of per- 
ceiving results in less popularity and more anxiety. 
The reversal of this relationship in boys perhaps 
emphasizes the cultural rewards for their exhibiting 
initiative [Iscoe & Carden, 1961, p, 84]. 


Similarly Jones, Jones, and Gergen (1963) 
found that male Ss negatively evaluated male 
conformers and, in fact, gave the highest rat- 
ings to those who did not conform even when 
under strong pressures to do so. Females, on 


8S. P. Bank, unpublished manuscript, 1963. Cited 
by Crowne and Marlowe (1964). 


the other hand, showed a general tendency tọ 
give more positive evaluations to males who 
conformed. 

The purpose of the present study is to 
further investigate the relationship between 
approval-oriented behavior and both actual 
and perceived popularity. It was hypothesized 
that high n App males would be unpopular 
while the negative relationship between n App 
and popularity would not hold up for the 
females. 


METHOD 
Subjects 


The Ss for the present study were 192 fifth- and 
sixth-grade students from a suburban Boston school 
system. The sample comprised the entire fifth and 
sixth grades in two schools. There were an equal 
number of fifth and sixth graders and an equal 
number of males and females. An estimate of each 
family’s social class was determined by the occupa- 
tion of the student’s father using the Hollingshead 
(1957) Index of Social Position. Each of the seven 
occupational levels except for the lowest (unskilled 
and unemployed) was well-represented.* 


Measures 


The Children’s Social Desirability Scale (CSD), 
Intermediate Level, was group administered to each 
of seven classes with the teachers present. Previous 
inquiries had shown that students had no diffculty 
with reading or understanding the questions. Intelli- 
gence and achievement test scores were taken from 
the students’ record cards. Intelligence was measured 
by the Otis Quick Scoring Test and achievement by 
the Stanford Achievement Test. 

A modified sociogram was administered on which 
the students first listed names of three classmates for 
whom they would vote on each of five questions: 
president of the class, representative to a television 
quiz show, person played with the most, person liked 
best, and person liked least. These lists were col- 
lected and a second sheet handed out on which each 
student was asked to guess how many votes he or she 
would receive on each of the questions. Four varia- 
bles were created for each question: the actual vote 
received, the estimated vote, the absolute discrep- 
ancy between the vote and the estimate, and a rela- 
tive discrepancy between the estimate and the vote 
(hereafter termed “base”) which indicated whether 
the estimate was greater or less than the actual vote. 
An estimate of the independence of the five socio- 
metric questions was obtained by correlating the 
number of votes which each student received on ea 
of the five questions, These correlations are present 
in Table 1. While some of the questions share a con- 
siderable amount of common variance (“president” 


‘Level 1, 32; Level 2, 17; Level 3, 33; Level 4 
37; Level 5, 48; Level 6, 23; and Level 7, 2. 
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TABLE 1 


INTERCORRELATIONS OF VARIOUS 
SOCIOMETRIC MEASURES 


ts Likes | Likes 

Measure Quiz Play ma a 

President .854 .468 569 | —.325 
Quiz .382 439 | —.259 
Play 783 — .393 
Likes best —.384 


Note.—All correlations are significant at p > .01. 


and “quiz,” for example), many of the correlations 
reflect a moderate amount of independence. It ap- 
pears that two clusters emerge: one dealing with 
task competence (“president” and “quiz”) and the 
other reflecting a more social or affective popularity 
(“play” and “likes best”). 


RESULTS 


Table 2 presents descriptive statistics on the 
CSD for the present sample and that 
reported by Crandall, Crandall, and Katkov- 
sky (1965). The results are quite similar, ex- 
cept for the sixth-grade males where the dif- 
ference is significant at the .05 level. 

Table 3 presents the correlations of major 
variables with the scores on the CSD for males 
and females. The significant differences be- 
tween the correlations of the males and fe- 
males indicate that high n App is related to 
different clusters of behaviors for the two sex 
groups. For females, high n App is signifi- 
cantly correlated with higher peer ratings on 
all of the four “positive” questions, and also 
to higher estimates of the number of votes 
that would be received. High n App females 


TABLE 2 


Normative DATA ON THE CSD FOR PRESENT 
SAMPLE AND CRANDALL, CRANDALL, 
Anp Katovsky, 1965 


Crandall et al., 
Present sample 1965 
Grade 
N SD | N M SD 
5 
Females | 45 |24.47 | 7.09 | 53 23.00 | 9.33 
Males 43 | 20.63 | 8.51 | 53 21.75 | 8.84 
6 
Females | 46 | 23,41 | 8.73 73 | 22.12 | 8.82 
Males 49 | 20.67 | 7.02 | 93 | 17.68 8.33 


also tend to have higher discrepancies be- 
tween actual and predicted vote, and the 
correlations with “base” variables show that 
the females tend to overestimate. For males, 
on the other hand, high n App is associated 
with fewer votes on all questions, although 
the correlations are significant only for the 
“friendship” categories (“play with” and 
“likes best”). Also in contrast to the females, 
high CSD males give lower estimates of the 
number of votes they will receive (although 
only significant for the vote for president). 
Their self-depreciating tendency is confirmed, 
however, by the significant correlation of 
“base least” which shows that the high CSD 


TABLE 3 


Propuct-Moment CorrELATIONS or CSD 
WITH OTHER VARIABLES FOR 
Mates AND FEMALES 


Significance 
of differ- 
Variable ences be- 
tween cor- 
relations 
oS ee EE, 
1Q 
Achievement 
Vote for 
President o 
Quiz : 
Play 01 
Likes best 01 
Likes least ? 
Estimate for 
President 01 
Quiz 01 
Play 01 
Likes best 01 
Likes least 01 
Discrepancy for 
President 01 
Quiz 05 
Play 
Likes best 05 
Likes least 05 
Base for 
President 
Quiz 
Play 05 
Likes best 
Likes least 05 
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TABLE 4 


PARTIAL CORRELATIONS BETWEEN CSD AND 
PEER RATINGS HOLDING INTELLIGENCE 


CONSTANT 
Significance 
of differ- 
Variable Males | Females | ences be- 
tween cor- 
relations 
Vote for 
President —.024 .337** .05 
Quiz 2051 .314** .05 
Play —.263** .272** 01 
Likes best —.195* .365** 01 
Likes least —.106 —.161 
Estimate for 
President —.146 .476** .01 
Quiz —.026 432"* 01 
Play —.158 .399%* 01 
Likes best —.111 .224* .01 
Likes least —.166 —.044 
Discrepancy for 
President —.136 .252* .05 
Quiz .103 .391** 05 
Play —.110 .221* 05 
Likes best —.130 .204* .05 
Likes least 179 075 


* p <.05; two-tailed test. 
> <.01; two-tailed test. 


males overestimate the number of votes they 
think they will receive in the “likes least” 
category (p < .05). 

Table 3 also reveals a significant correla- 
tion between intelligence and CSD scores for 
the males, Since intelligence was significantly 
related to several of the sociometric ratings, 
the possibility existed that intelligence, rather 
than n App, was responsible for the correla- 
tions presented above. To investigate this pos- 
sibility, partial correlations were computed be- 
tween CSD and the sociometric measures with 
the effects of intelligence partialed out. Ta- 
ble 4 presents these partial correlations and 
demonstrates that, even with intelligence held 
constant, there are significant differences be- 
tween the sexes on the relationship between 
CSD and the sociometric ratings. 


Discussion 


The present study suggests that high n 
App has an opposite relationship with popu- 
larity for male and female fifth- and sixth- 


grade students. It therefore seems necessa 
to specify further the explanations that h 
been offered for the lack of popularity of f 
high n App individuals, and to suggest tl 
these hypotheses are valid only for mal 
Marlowe and Gergen (in press), for examp 
suggested that the high n App person wo 
too hard at winning acceptance, is seen | 
unrealistically accommodating, and, as a co 
sequence, is disliked. Similarly, Barth 
(1963) noted that high n App males appea 
be cautious in their relationships with other 
over-dependent upon the guidance of 
around them, and eager to create favorabl 
even if distorted, impressions. Barthel submi 
ted that such behavior, although tolerai 
might be perceived as “phony” by persons i 
close contact with the high n App individua 
The present research suggests that these argi 
ments are applicable only to males, while fo 
females this type of behavior tends to be : 
sociated with greater popularity. 
We might hypothesize that low n App fi 
males and high n App males violate the 
cultural role patterns and are therefore 1 
jected by their peers. This hypothesis s 
to be supported by descriptions of high 
males as not evidencing achievement bi 
iors (Crandall et al., 1965) and as si 
cantly more often imitating the play activiti 


frequently “initiating aggression of 
physical and verbal nature” and as less fr 
quently withdrawing from attacks. This be 
havior seems quite incompatible with th 
feminine sex-role model. 

Further clarification is offered by 
vitch (1960), who investigated compatibi 
of personalities among male and female friend 
ship groups. He found that among boys thet 
were significantly more complementary 
than common traits, while for girls, friend: 
ship was “based more on similarity than dif 
ference of attributes.” Girls tended to be 
friend other girls if they had a lot in common 
while males seemed to prefer friendships wit 
others who showed individuality and coi 
compleet each other, rather than be identi 
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Tables 3 and 4 reported significant sex dif- 
ferences in correlations between CSD and 
the estimate and discrepancy variables. For 
the females, this suggests that although 
those with higher CSD scores received more 
votes, they were deficient in role taking: that 
is, they were less accurate in their perceptions 
of how they were viewed by their peers. The 
high n App males, on the other hand, were 
more accurate in their perceptions. We might 
hypothesize that approval-oriented females 
orient themselves to their peers as a reference 
group. On the other hand, higher CSD males, 
who were less discrepant in their estimations, 
may be oriented to an adult reference group, 
and through interactions with adults have en- 
countered more complex role-taking capabili- 
ties. 

The CSD may have greater predictive util- 
ity when further research is undertaken to- 
ward understanding how the need for approval 
is sought. The fact that a person may have 
many reference groups toward which he ori- 
ents himself differentially implies that the n 
App construct would have greater predictive 
validity if reference group norms were taken 
into account. 
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Each of seven patients saw each of four therapists twice. After each session, 


therapists rated how they thought the patient and therapists should behave 
during psychotherapy. In addition, these patient and therapist behaviors were 
independently rated from tapes of the meetings. A balanced 4 X 7 design was 
utilized, Behavior rating data indicate that therapists are more influenced by 
patients than patients are by therapists. Comparisons of questionnaire and | 
behavior rating data indicate that differences among therapists are more evident | 
in how therapists think than in how they or patients behave. Implications for 
understanding the different roles of the psychotherapist are discussed. | 


While the importance of viewing the psy- 
chotherapeutic interview as an interdependent 
system of patient and therapist behaviors has 
often been asserted, there has been little re- 
search explicating the degree and types of such 
interdependence in psychotherapy sessions. A 
research design where each of the several pa- 
tients meets with each of several therapists is 
one technique for quantitatively studying such 
interdependence. In such a design, patients 
can be represented by rows and therapists by 
columns with one cell for each patient-thera- 
pist combination. This design allows data 
analysis using a two-way analysis of variance. 
The degree of patient-therapist interdepen- 
dence can then be assessed by the percentage 
of variance accounted for by the patient main 
effects and interactions when studying thera- 
pist behaviors, and by the percentage of vari- 
ance accounted for by the therapist main ef- 
fects and interactions when studying patient 
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behaviors. These percentages of variance will” 
indicate the degree to which differences be- 
tween patients or unique patient-therapist 
combinations influence therapist behaviors, 
and the degree to which differences between 
therapists or unique therapist-patient combi- 
nations influence patient behaviors. Con- 
versely, a lack of patient-therapist interde- 
pendence can be shown by the size of the 
therapist main effects when studying therapist 
behaviors and of patient main effects when 
studying patient behaviors, The percentage of 
variance accounted for by these main effects 
will indicate the degree to which there are 
consistent differences between patients or be- 
tween therapists, independent of the person 
with whom they are meeting. 

Two earlier studies utilizing this design 
have investigated the nature and degree of 
patient-therapist interdependence. Van der 
Veen (1965), working with chronic, hospital- 
ized schizophrenics, rated problem expression 
and immediacy of experiencing for patients 
and congruence and accurate empathy for 
therapists. He reported significant patient 
main effects, therapist main effects, and inter- 
actions for both patient and therapist behav- 
iors. He found patient main effects are the 
strongest of the three factors in patient be 
haviors, and therapist main effects the strong- 
est factor in therapist behaviors. Moos and 
Clemes (1967), working with an outpatient 
clinic population, measured for both patients 
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and therapists the total number of words 
spoken, the percentage of feeling words, the 
percentage of action words, and the number of 
reinforcements. They also reported a number 
of significant patient and therapist main ef- 
fects as well as interactions. However, in con- 
trast to Van der Veen, they reported that, 
with the exception of “number of reinforce- 
ments,” patient main effects accounted for a 
larger percentage of the variance than did 
therapist main effects in both therapist and 
patient behaviors. For most of the behaviors 
measured, they concluded that patients influ- 
ence therapists more than therapists influence 
patients. Methodologically, the Moos and 
Clemes findings are somewhat stronger than 
Van der Veen’s because Van der Veen’s be- 
havior ratings involved subjective estimates 
by raters who rated all meetings. Thus, rat- 
ers’ expectations about consistent differences 
between therapists or patients could con- 
tribute to the size of either patient or thera- 
pist main effects. Moos and Clemes’ variables, 
on the other hand, required very little rater 
inference and therefore minimized the possi- 
bility of rater bias contributing to main effects. 
Utilizing the same design as the Van der 
Veen and Moos and Clemes studies, the 
present study adds an additional dimension, 
namely how therapists think patients and 
therapists should act during the psychothera- 
peutic interview as well as the behavior ratings 
which were parallel to the questionnaire mea- 
sures. The purpose of this study was to com- 
pare the percentages of variance accounted 
for by patients, therapists, and interaction in 
questionnaires and behavior ratings and thus 
study the magnitude of patient and therapist 
interdependence in both types of data. 


METHOD 


This research was carried out in a clinic where 
patients routinely rotated for 25-minute therapy 
sessions among a resident therapist pool. This was 
the same rotating patient clinic (RPC) used and 
described by Moos and Clemes. The present sample 
included seven female patients representative of the 
RPC population and four male psychiatric residents; 
two in their second and two in their third years of 
residency. Each of the seven patients met twice with 
each of the four therapists for a total of 56 psycho- 
therapeutic interviews, each of approximately 25- 
minutes duration. interview was tape recorded. 
Following each interview, the therapist rated on a 


4-point scale (a) what he thought each patient 
should do during therapy sessions to accomplish his 
goals and (b) what he thought the RPC therapists 
should do during therapy sessions to help the patient 
accomplish these goals. Questionnaire content was 
originally obtained through interviews with the 
RPC patients and therapists. Items which were par- 
ticularly appropriate for that clinic setting were 
chosen. 

In addition to these postmeeting questionnaires, 
two raters independently rated the 56 tape record- 
ings using the same patient and therapist behaviors 
which were in the questionnaire. For example, the 
therapists’ questionnaire asked whether the patient 
should tell intimate and private details of her life 
and the behavior raters scored the degree to which 
the patient did tell intimate and private details of 
her life during that interview. 


RESULTS 


Correlations between the two behavior rat- 
ers range between .37 and .89 on the 20 be- 
havior ratings, with a median of .52. The two 
raters’ scores were summed on each item and 
these summed scores were used in the analy- 
ses. While many of these reliabilities are not 
high, it will be shown later that our findings 
are independent of item reliabilities. 

Therapist questionnaire scores were trans- 
formed into standard scores for each thera- 
pist separately, in order to minimize the ef- 
fects of general response set on therapists’ 
answers. For example, for each therapist, all 
of his answers to questions about how the 
therapist should act were combined. Since 
each therapist filled out the questionnaire 14 
times, and each time rated seven items on 
how therapists should act, the mean and 
standard deviation of 98 responses were used 
in transforming the scores to standard scores. 
These transformed scores can be interpreted 
as the relative emphasis which each therapist 
gave to each item. The same procedure was 
followed with the questionnaire responses to 
patient behaviors as well as with the inde- 
pendent behavior ratings of therapist and 
patient behaviors from the taped sessions. 

Each of the 20 items in the therapist’s 
questionnaire and each of the 20 ratings of 
behavior were analyzed with analyses of vari- 
ance. A two-way design was utilized, four 
therapists by seven patients, with two repli- 
cates per cell. 

Table 1 reports results from therapists’ 
questionnaires and independent behavior rat- 
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TABLE 1 
PATIENT BEHAVIORS: PERCENTAGE OF VARIANCE ACCOUNTED FOR BY EacH SOURCE 
Therapist questionnaires Behavior ratings 
Patient behaviors = am 
Patient on Inter | within | Patient| TPS | action | WAN 

1. Be businesslike toward doctor 6 138 0 81 54 0 0 46 
2. Tell doctor intimate details of her 

life 218 1 6 72 27 108 0 63 
3. Tell what happened since clinic visit} 4 29b 0 67 0 0 0 100 
4. Try to talk about what doctor 

thinks is important 4 53b 4 39 19% 0 0 81 
5. Tell about important people in her 

life 8 13 328 47 26> iis 1 62 
6. Talk about her feelings toward 

doctor ul 66> 11s 22 13 9 0 78 
7. Ask the doctor to tell her what to 

do 3 158 gi 78 215 7 0 72 
8. Tell doctor what other doctors 

have told her 0 49 0 51 30» 108 0 60 
9. Summarize her problems for each 

new doctor 0 2 1 97 2 4 0 94 
10. Tell about her physical condition 125 23> 17 48 80> 0 1 19 
11. Let doctor decide what they 

should talk about yi 13° 0 80 39> 0 248 37 
12. Be logical and unemotional in de- 

scribing her problems 2 138 1 84 71> 0 0 29 
13. Tell how she gets along with other 

people 15* 18> 15 52 47> 12> 0 41 


^ F value significant at 
» F value significant at 


p <.05, 

p <.01. 

ings for the patient behaviors in terms of per- 
centage of variance accounted for by each 
source of variance. Looking first at the thera- 
pist questionnaire results, there are more sta- 
tistically significant therapist effects than 
either patient main effects or interactions. The 
percentages of variance accounted for by pa- 
tient, therapist, and interaction effects was 
compared using the Friedman nonparametric 
analysis of variance (Siegel, 1956). The 
Friedman test is more appropriate for these 
data than the parametric analysis of variance 
because the assumptions of equal variance and 
independence required by the parametric tests 
cannot be met. The results indicate that the 
patient, therapist, and interaction percentages 
of variance do differ significantly (x2, = 11.6, 
p< 01). 

Comparison of the average percentage of 
variance accounted for by the different effects, 
using the nonparametric Wilcoxon test (Sie- 
gel, 1956), shows that therapist main effects 


account for more variance than either patient 
main effects (z = 2.55, p < .01) or interac- 
tions (z = 2.34, p < .05), and that the pa- 
tient main effects and interactions do not dif- 
fer significantly from each other (z = .82). 

For the behavior ratings on patients, there 
are more statistically significant patient main 
effects than either therapist main effects or in- 
teractions. A comparison of average percentage 
of variance accounted for by each effect shows 
that average patient, therapist, and interac- 
tion effects are significantly different (x° = 
15.3, $ < .001), and that the patient main 
effects are, on the average, larger than the 
therapist main effects (z = 2.98, p < .01), 0 
interactions (z = 3.18, p < 01). The thera 
Pist effects do not differ significantly from the 
interactions (z = 1.41). The therapist main 
effects are the strongest in the questionnaire 
results while the patient main effects are 
strongest in the behavior ratings. 

It is also possible to compare directly the 
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main effects and interactions in therapist 
questionnaire and independent behavior rat- 
ings by computing from their F values an es- 
timate of the noncentrality parameter of the 
noncentral F distribution (Scheffé, 1959). 
Using these estimates, we can compare the F 
statistics for patient main effects, therapist 
main effects, and interactions between the 
questionnaire and behavior ratings. These 
comparisons show that the average patient 
main effects are larger in the behavior ratings 
than in the questionnaires (z = 3.04, p < .01) 
while the average therapist main effects are 
larger in the questionnaire results than with 
the behavior ratings (z= 2.76, $ < .01). 
There is no statistically significant difference 
between the two interaction values (2 = 1.89). 

In summary, Table 1 indicates that differ- 
ences among therapists are very important in 
how therapists think patients should act, 
while differences among patients are more im- 
portant in how patients do act in psycho- 
therapy sessions. It is interesting to note at 
this point that the four behavior ratings 
which do show significant therapist main 
effects (2, 5, 8, 13) all concern the content of 
what the patients talk about and that none 
of the ratings about the manner in which the 
patients act towards the therapist (1, 4, 7, 11, 
12) show significant therapist main effects. 
There is some evidence that therapist differ- 
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ences influence the content of what patients 
discuss more than the manner in which pa- 
tients present that content. Finally, we can 
note that interactions are not large in either 
set of data. 

Table 2 reports results from therapists’ 
questionnaires and independent behavior rat- 
ings for therapist behaviors. The question- 
naire data about desired therapist action show 
approximately an equal number of statistically 
significant patient main effects and therapist 
main effects with relatively few statistically 
significant interactions. Again, using Fried- 
man’s nonparametric analysis of variance to 
compare the average percentage of variance 
accounted for by the different effects, the find- 
ings are that the three effects do not differ 
significantly (x*r = 3.71). 

The independent behavior ratings show evi- 
dence only for patient main effects and no 
evidence of statistically significant therapist 
main effects. The Friedman test shows that 
the three effects differ significantly from each 
other (x2, = 8.86, p < .02), and further anal- 
ysis utilizing the Wilcoxon test shows that the 
patient main effects account for more variance, 
on the average, than do therapist main effects 
(z = 2.37, p < .05) or interactions (z = 2.20, 
p < .05), while the therapist main effects do 
not differ from the interactions (2 = 1.75). 
The transformation of the F values to a non- 


TABLE 2 
THERAPIST BEHAVIORS: PERCENTAGE OF VARIANCE ACCOUNTED FOR BY EacH SOURCE 


Therapist questionnaires 


Behavior ratings 


SS on 


Therapist behaviors 
3 Ther- | Inter- H É Ther- | Inter- ais 
Patient apist | action Within | Patient apist | action Within 
— Eee theory) (Seis 
1, king at he: 
Sre aR ways of looking at her A ig i p n z 
25 bout he 
pee, to do about ber Re A an b $ i 
3. Help her to understand why her 
personality is the way it os 28> 48 33% 0 2 65 
4, Speak to other people with whom 
she has difficulties 24> a oe ik 0 5 
5. Mostly listen to what she has to say 1% 
6. Encourage and support her in what ai 5 a i Ea 
she wants to do 5 as 4 5 sf 
7. Give her medicine 50> o 16 28 
s: it < 05. i 
Saee 
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central parameter estimate was again used to 
compare directly the questionnaire and be- 
havior ratings. The results show no statisti- 
cally significant differences between the pa- 
tient main effects in the two measures (2 = 
1.18), nor between the interactions in the two 
measures (z = 1.35), but do indicate that the 
therapist main effects are stronger in the 
therapist questionnaires than they are in the 
behavior ratings (z = 2.37, p < .05). In con- 
clusion, patient main effects are strong both in 
how therapists think therapists should act and 
also in how therapists do act. However, thera- 
pist main effects are more evident in the 
thinking of therapists than in their indepen- 
dently rated behavior. 

Returning now to the question of whether 
our findings were influenced by the reliability 
of the behavior ratings, two points should be 
made. First, rater reliabilities would influence 
the residual error terms in our analyses of 
variance, not the relative sizes of main effects 
and interactions with which we were con- 
cerned. Second, by transforming the F values 
to a noncentral parameter estimate, we were 
able to compare the questionnaire and behav- 
ior rating results independent of any residual 
error-term differences. The comparisons of 
main effects and interactions reported above 
are independent of rater reliabilities. 

Since rater bias may have contributed to 
the main effects observed in the behavior rat- 
ings of both patient and therapist behaviors, 
four additional raters rescored first meeting 
tapes from four patients randomly chosen 
from the seven study patients. These 16 tapes 
were assigned, four each, to the four new 
raters in such a way that each rater scored 
each patient-therapist combination only once. 
This procedure eliminated the possibility that 
main effects would be due to a rater imposing 
a constant stereotype for how a therapist or a 
patient behaved from one interview to the 
next. A 4 X 4 matrix with one replicate per 
cell was used to analyze both these ratings and 
ratings of the same meetings from the main 
study. The F values for each behavior rated 
were transformed to an F nhoncentrality 
parameter estimate and the transformed val- 
ues for the two sets of ratings were compared. 
The average main effect and interaction val- 
ues do not differ significantly between the 
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two sets of ratings for either patient or thera. 
pist behaviors. There is no evidence that rate 
bias contributed substantially to our main 
effect findings. 


Discussion 


Our behavior rating findings, in common 
with Moos and Clemes’, indicate that thera 
pists are more influenced by patients than pa 
tients are by therapists. This is indicated by 
the strong patient main effects and relatively 
weak therapist main effects in ratings of both 
patient and therapist behaviors. These find- 
ings differ somewhat from those of Van det 
Veen who reported strong therapist main ef- 
fects in his measures of therapist behaviors, 
These differences may be due to differences in 
the behaviors rated or in the kinds of Ss uti- 
lized in the two studies. This study also indi- 
cates that one area of therapist influence on 
patient behaviors appears to be in the content 
of what patients discuss. Interactions play a 
negligible role in all of the behavior ratings, a 
finding consonant with both Van der Veen 
and Moos and Clemes’ results. 

The therapist questionnaire findings show 
an important contrast to those with behavior 
ratings. While there is evidence for patient 
differences influencing questionnaire responses, 
there is also strong evidence for the influence 
of therapist differences, independent of the 
patient they are seeing. Interactions again 
play a negligible role in these findings. 

Direct comparison of the size of the main 
effects and interactions between the question- 
naires and behavior ratings, using a transfor- 
mation of the F values to a noncentral pa- 
rameter estimate, indicates that differences 
between therapists carry more weight in the 
questionnaires than they do in the behavior 
ratings for both therapist and patient behav- 
iors. This indicates discrepancy between what 
therapists think should happen in psycho- 
therapy and what does happen and that, in 
part, these discrepancies are due to the fact 
that differences between therapists are more 
important in the thinking of therapists than 
they are in the behavior of either therapists 
or patients. 

__ Tn discussing the implication of these find- 
ings, it is important to consider their gen- 
erality. Clearly, we had a limited sample of 
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both therapists and patients and, even more 
important, we were studying an unusual kind 
of therapy where patients expected to see a 
different therapist each week. It may be, for 
example, that long-term intensive psycho- 
therapy provides greater opportunity for con- 
sistent differences between therapists (i.e. 
therapist main effects) to become evident in 
therapist behaviors. Because our research de- 
sign requires this somewhat unusual pattern 
of patient-therapist contacts, it would be diffi- 
cult to replicate these findings in other types 
of psychotherapy settings. 

While recognizing these limitations, we can 
also note that our results are consistent with 
findings reported in other settings and that 
they can suggest hypotheses to be investigated 
in other settings as well. For example, the 
finding that differences in how therapists 
think about their behavior are not evident in 
ratings of their behavior in therapy sessions is 
consistent with Fiedler’s (1950) finding that 
therapists from different theoretical persua- 
sions do not differ in how they behave in 
therapy sessions. Since many of our questions 
dealt with issues independent of theoretical 
schools, this suggests that Fiedler’s findings 
may be just one example of a wide range of 
discrepancies between therapists’ styles in 
thinking about psychotherapy and what, in 
fact, transpires in psychotherapy. 

One source of these discrepancies may be 
that the therapist role includes, within it, dif- 
ferent expected behaviors in different settings. 
Two such settings are the psychotherapy in- 
terview and the rehash and discussion sessions 
with fellow psychotherapists. In the psycho- 
therapy interview, the therapist is expected to 
be sensitive and responsive to patient differ- 
ences. He learns, both through observing €x- 
perienced therapists and through being ob- 
served himself, that the good therapist is 
constantly alert to what the patient is feeling 
and communicating. Our behavior rating data 
showed that this is, in fact, what occurred in 
the sessions we studied. In the rehash and 
discussion sessions, on the other hand, thera- 
pists must learn a somewhat different set of 
skills in dealing with their peers and super- 
visors. In these sessions, which are particu- 


larly prevalent during residency training 
such as the therapists in our study were un- 
dergoing, they must constantly explain their 
actions to peers and supervisors. This pres- 
sure of public explanation may encourage 
psychotherapists to develop a consistent 
framework with which they can quickly or- 
ganize an explanation for their behaviors. As 
a therapist finds one framework useful for 
such presentation, he may repeat it until it 
becomes his style of thinking about therapy. 
Since these explanations are, for the most part, 
ex post facto, they need not correspond to 
what in fact occurred in the sessions. What is 
more important is that they communicate to 
the audience of fellow professionals that the 
speaker has a sophisticated grasp of the con- 
cepts and rationale of psychotherapy. 

In conclusion, our study indicates that in 
this brief contact clinic setting and for the 
behaviors we measured, differences between 
patients are a major determinant of both 
patient and therapist behaviors, Consistent 
differences between therapists, on the other 
hand, are more evident in how therapists 
think about patient and therapist strategies to 
achieve psychotherapeutic goals than they 
are in either therapist or patient behaviors 
during therapy interviews. We have suggested 
that these discrepancies between behavior rat- 
ings and therapists’ questionnaire responses 
may be due to differences in role expectations 
for psychotherapists when they are with pa- 
tients and when they are with professional 
colleagues. 
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SUICIDE AND AGE: 
A SUICIDE NOTE ANALYSIS 


ALLEN R. DARBONNE?* 


Veterans Administration Center, Los Angeles 


The present study tested the hypothesis that suicidal ideation varies with age. 
259 suicide notes were analysed. This represented all of the suicide notes written 
by individuals who committed suicide during a 3-year period who resided in 
the 34 moderately advantaged natural communities of Los Angeles County. 
The method of analysis involved categorizing each note according to a scheme 
of five general categories: (a) Addressee, (b) Reasons Stated, (c) Affect Indi- 
cated, (d) Specific Content, and (e) General Focus of Note. The notes were 
divided into four age groups—20-39, 40-49, 50-59, and 60 and over, and com- 
pared with reference to the five general categories. Differences between age 
groups were found significant at the .05 level for Categories b and e. Specific 
items within each category were discussed when such items were found sta- 
tistically more or less often in the notes of one age group than in the notes of 
the other age groups. Suggestions for treatment and prevention of suicide were 


offered. 


Despite effort and theory which began long 
before Freud and Durkheim, one of the most 
pressing questions of society remains un- 
answered. Why does man so often choose to 
take his own life? Each year in the United 
States at least 25,000 persons make this 
choice, resulting in suicide ranking among the 
top 10 causes of adult deaths. 

There seems to be a growing agreement 
among current investigators of suicide that 
present theories do not adequately handle the 
data. A need has been expressed for an ampli- 
fication of current conceptual systems to bet- 
ter account for the many variables related to 
this phenomenon. Among these variables is 
the factor of age. Statistics clearly indicate 
that suicide is not restricted to any specific 
age group. Farberow and Shneidman (1957) 
pointed out in a theoretical review that there 
has been no differentiation in suicide theories 
for age of the suicidal person and indicated 
the need for this on the basis of their re- 
searches, 

An understanding of the role of age in sui- 
cide would seem to be a desirable early step 
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in the development of an adequate theory. 
Suicide is frequently viewed as a result of a 
combination of the individual’s psychic drives 
and environmental factors. Inasmuch as both 
intrapsychic needs and environmental stresses 
can be reasonably expected to vary for differ- 
ent age groups, it can be assumed that sui- 
cidal ideation within the groups will reflect 
these differences. 

The present study is an attempt to test this 
assumption and to investigate further the re- 
lationship of age to behavior in suicidal indi- 
viduals. The analysis of suicide notes (written 
shortly before individuals take their lives and 
sometimes as they die) has proved itself an 
important method in obtaining data regarding 
ideation and affect of suicidal persons. The 
necessarily projective quality of such a com- 
munication lends itself well to psychodynamic 
interpretation. Although only about one sui- 
cide in three leaves a note, studies by Shneid- 
man and Farberow (1960) and Tuckman, 
Kleiner, and Lavell (1959) indicated no sig- 
nificant differences between persons who leave 
notes and those who do not. Tuckman et al. 
(1960) investigated the validity of notes and 
reported an agreement with information from 
an informant in over 90% of the cases. Reli- 
ability of suicide note analysis has also been 
reported to be acceptably high. 
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METHOD 


The suicide note analysis technique is the method 
used in the present investigation. Two hundred and 
fifty nine notes written by 156 individuals were ana- 
lyzed. This number represents the total number of 
suicide notes written by men and women who com- 
mitted suicide during a 3-year period and who re- 
sided in the 34 moderately advantaged natural com- 
munities of Los Angeles County. 

The use of socio-cultural-economic area types as 
study areas is described in detail by Shneidman and 
Farberow in their 1960 report of a socio-psychological 
investigation of suicide. Therefore, only a brief de- 
scription will be given here. Based on the census data 
of Los Angeles County, the Los Angeles Welfare 
Planning Council (1958) studied the county and 
divided it into 100 relatively stable, homogeneous 
“study areas.” Basis for division was average educa- 
tion, income, home ownership, population, and 
number of people living alone. These 100 areas were 
redistributed among nine area types on the basis of 
“urbanization and social rank.” 

The group of moderately advantaged natural com- 


munities, designated Area Type V, was selected for 
this investigation. Its 34 communities seemed to rep- 
resent best the average community of Los Angeles. 
The total Area Type V population was roughly 1,700,- 
000, which represented 31% of the county’s popu- 
lation. It is composed of 31% under age 20, 627% 
aged 20-65, and 7% over age 65. The annual ratio of 
number of suicides to population in this area was 14 
per 100,000, which was identical with the average 
suicide ratio for the county. 

The suicide note writers were divided into four 
age groups, 20-39, 40-49, 50-59, and 60 and over. 
The age groups were selected in an attempt to re- 
flect different stress periods in a lifetime. 

The technique employed in note analysis is a 
minor modification of that used previously by 
Farberow and Shneidman (1960). The method: of 
analysis involved categorizing each note according to 
a scheme of five general categories: (a) To whom 
the note was addressed—interpersonal involvements, 
dyadic relationships, etc.; (b) Reasons for suicide 
explicitly stated in the note—conscious reason for 
victim killing himself; (c) Affect indicated or im- 
plied—varied emotional expressions; (d) Content 


TABLE 1 
OUTLINE For SUICDE Note ANALYSIS 


A. Addressee of suicide note 
1. Personal affiliation (family or friend) 
2. Impersonal affiliation or no addressee (police, 
to whom it may concern, or other) 


B. Reasons stated in suicide note 


11, No reason stated 

12, Il health, illness, physical disability, symptoms, 
pain 

13, Rejection by another, jilted, unloved, not 
understood, can’t live without you, love tri- 
angle (other man, woman) 

14, Finances, money, bills, debts, job, occupation, 
unemployment 

15. Ennui, tired of life, as a way out, reached end, 
couldn’t go on 

16. No point in living, not worth trying 

17. Interest in death, other world, hereafter 

18. To “join” a (deceased) loved one 

19. Isolation, loneliness y 

20. Confusion, depression, fear, anxiety Ţ z 

21. Being persecuted, hearing voices, losing mind 

22. Sex 

23. Other (specify) 


C. Affect indicated in suicide note 


31. No affect 

32. Hostility, criticism, blame, revenge | 

33. Absolution of other, giving forgiveness (of 
specific person) 

34, Sorrow, seeking 
persons) e 

35. Seeking forgiveness from deity 


forgiveness (from specific 


C. Affect indicated in suicide note (Continued) 


36. Self-depreciation, self-derogation, self-criticism, 
guilt, self-blame (fault) 

37. Love, idealization, praise, defense 

38. Other (specify) 


D. Specific content other than affect 


51, Nospecificcontent, other than affect 

52. Mention of religion, fate, life, world, death 
(abstraction) 

53. Goodbye, farewell 

54, Reference to suicidal act, no one responsible 

55. Reference to suicide note 

56. Instructions re money, business, power of at- 
torney, funeral expenses 

57. Instructions re insurance 

58. Instructions re material possessions 

59, Instructions re children 

60. Instructions re own remains 

61. Instructions re notification of others 

62. Instructions re message to others 

63. Other (specify) 


E. General focus of the suicide note 


71, Primarily reason for suicide 

72. Primarily affect 

73. Primarily instructions 

74, Primarily abstractions 

75. Primarily content other than affect 
76. Primarily reflecting own confusion 
77, Extremely short, cryptic, enigmatic 
78. Other (specify), no content 
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other than affect—victim’s main concern just before 
killing himself; and (e) General focus of the note— 
overall tenor of the last written communication. 

Each general category was subdivided into several 
more specific descriptive scoring categories by which 
the notes were analyzed. Table 1 lists these scoring 
categories. Every note was scored according to all 
five general categories. The four age groups were then 
compared with respect to the characteristics of each 
category. 


RESULTS 


After each note was assigned a score accord- 
ing to the five general categories listed in 
Table 1, a chi-square analysis was done to de- 
termine the relationship between the four age 
groups and each of the five general categories. 
Where a specific item had a frequency so small 
as to be misleading, it was combined with the 
“other” category before computation of the 
degrees of freedom and the chi-square value. 
The result of this analysis was that “reasons 
for suicide” and “general focus of the suicide 
note” were found to be related to age at the 
.05 level of significance. Table 2 indicates the 


TABLE 2 


RELATIVE FREQUENCIES OF CATEGORIES WITHIN 
Surcme Notes OF DIFFERENT AcE Groups 


Age group 


Category 
20-39 | 40-49 | 50-59 |60 and 
over 


Reasons stated 

None 

Health 

Rejection or love tri- 
angle 

Tired of life as a way 
out 

Isolation or loneliness 

Other 

Affect indicated: 

None 

Hostility 

Absolution of others 

Sorrow 

Seeking deity’s for- 
giveness 

Self-depreciation 

Affection 

Other 

General focus of note: 

Primarily reason 

Primarily affect 

Primarily instructions 

Other 

c 


High | Low 


Low 
High 
High 


Low 


$ 
specific scoring items (of the statistically sy . 
nificant categories) which were found ing 
relatively high frequency or a relatively Loy, 
frequency in one age group as compared toal 
other age groups. | 


DISCUSSION 


A comparison within each age group of 
notes suggests that the categories which ay 
statistically related to age have a certain dé 
gree of consistency in meaning. They permit 
interpretation which may have further imple 
cations for suicide prevention and therapeutit 
approach. In some of the statements which 
follow, the affect category is also mentionél 
in relationship to the categories of specifi 
reason and general focus of the suicide note 
The affect category is considered in this dis 
cussion because there was a very strong trend 
toward establishment of a relationship between 
age and affect in the suicide note (p < .10), 
Because of the exploratory nature of the note 
analysis technique and the critical nature of 
the problem under study, it was felt that some 
reference should be made of this observed 
trend. 

The problems which seem to overwhelm thé 
young adults (20-39) more frequently than 
the older individuals were related to feelings 
of rejection and especially problems concern 
ing heterosexual love objects and competition 
for them. There is a notable absence of rea- 
sons related to illness and pain, They could 
probably cope with the world physically but 
seemed unable to cope with their problems 
concerning interpersonal relations, and some 
of the deeper psychic problems related to set- 
ual needs, At their dying moments these pe 
ple seemed both willing and able to express 
affect in their suicide notes. A closer look at 
this affect indicates it is primarily the in- 
ternalization of blame and guilt and self- 
depreciation. At the same time they express 
love and praise for others. Seeking forgiveness 
from the deity was less frequent in their notes 
than in the notes of older suicides. Suicide 
Prevention with this type of individual might 
be best accomplished through dynamic psy- 
chotherapy. This could help the individ 
gain insight into feelings which stem from 
early object relationships and which precipi- 
tate his recurrent interpersonal problems. 
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Healthy expression of affect should be en- 
couraged. Where the stress is marital and 
where it is apparent that the patient loves and 
praises his spouse but feels unworthy and 
misunderstood the partners might be brought 
into the therapy. This is a typical note from 
this sample, written by a 28-year-old woman: 


Dear Mom: 


I've tried very hard to hide it, but I’m still as much 
in love with Bill as always. I don’t know how he feels 
about me, but my life is in his hands now. He knows 
I can’t go on without him and if you read this it 
means he didn’t care enough to stop me. I’m sorry 
to have failed you this way, but I love Bill so much, 
Tm not quite sane, let alone intelligent without him. 
It’s better for the babies that I just check out this 
way than for me to keep on being so unhappy 
around them. There’s a letter somewhere in my grey 
suitcase for Tim when he is 16. 


All my love forever, 
Mary 


Individuals from ages 40 to 49 who com- 
mitted suicide left a communication of the 
vanquished. They seemed unequal to the de- 
mands of life and were tired or bored and 
wanted a way out. They entered the period of 
the change of life already stating they had 
reached the end and couldn’t go on. This 
group looked beyond life to the deity and pe- 
titioned for forgiveness. A combination of 
individual and group psychotherapy would 
possibly be most effective in these cases. The 
aim here might be a more accurate assess- 
ment of the self. It is likely that many of these 
people have spent their earlier years attempt- 
ing to achieve above their potential or operat- 
ing below it. The result is a defeatist feeling 
and/or guilty thoughts of a wasted life. In 
either case the future is viewed with pessi- 
mism. Closing the gap between self-ideal and 
self would be an immediate goal of the ther- 
apy. The special interest of this age group in 
the deity should be explored. In some cases, 
and especially in crisis therapy, important 
psychological support may be gained from the 
religious convictions of these people. An ex- 
ample of a note from this age group was writ- 
ten by a 43-year-old man. 


Please God forgive me, never again after this is 
over will I hurt anyone. I love my wife and chil- 
dren too much to continue on not knowing when Tl 
do something again to hurt them . . - My back will 
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soon give me real trouble and then I'll be worthless to 
everyone—Something is wrong with me inside and 
Tve tried to find out just what that something is as 
far back as I can remember. I’m lost, If I continue as 
T have in the past my family will not have a chance. 
Now they can. 
Til pay for all when my maker sees me. I cannot 
continue to hurt people here on earth and I will not. 
Tm not weak. I must die. 

Bill 
PS. God try to forgive me. I believe in God and 
always have—a person no good like myself should 
leave this earth. 


With the 50-59-year-old suicides, an inter- 
esting finding was presented, Even though 
responding to a need to write a final com- 
munication, they less frequently than any 
other group gave a reason for their suicide. 
This group’s use of the note to give instruc- 
tions and to convey information and unemo- 
tional messages may indicate a plausible ex- 
planation. Expression of affect was found sig- 
nificantly less than in other notes. Possibly 
these people have arrived at the conclusion 
that others would not care or understand their 
feelings. They may have decided it is better 
not to permit themselves emotional involve- 
ment, Another reason could be that these are 
the individuals who are beginning to experi- 
ence failing health and other “more rational” 
reasons for suicide. They have given their act 
a great deal of thought and within their per- 
sonal logic have concluded death for them- 
selves. Until more is known about this group, 
they will present a difficult problem for pre- 
vention. Depth therapy seems less likely to 
succeed here than with the previous groups. 
These individuals would probably be reluctant 
to accept involvement in a therapeutic rela- 
tionship. When the therapist does treat such 
patients, he may find active environmental 
manipulation much more his role than usual, 
If one relates significant world events to the 
lifetime of these suicides, it is not too diff- 
cult to imagine from whence they learned to 
be realists. These were the Americans who 
met World War I as teenagers, the Depres- 
sion during the most productive years of their 
lives, World War II as they approached mid- 
dle age, and then saw their children called for 
the Korean conflict. Certainly treatment 
would need to consider the effect of such 
events on the individual. An example of a 
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note from this age group was written by a 
50-year-old woman: 


To Whom It May Concern, 


I, Mary Smith, being of sound mind, do this day, 
make my last will as follows—I bequeath my rings, 
Diamond and Black Opal to my daughter-in-law, 
Doris Jones and any other of my personal belong- 
ings she might wish. What money I have in my sav- 
ings account and my checking account goes to my 
dear father, as he won’t have me to help him. 

To my husband, Ed Smith, I leave my furniture and 
car. 

I would like to be buried as close to the grave of 
John Jones as possible. 


Mary Jane Smith 


The age group 60 and over attempted to 
answer the question “why” more than any 
other age group. Their reasons more fre- 
quently than those of younger suicides cited 
illness, pain, and physical disability which 
accompanied their increasing years. They also 
most often told of loneliness and isolation. 
Their frequently expressed affect of sorrow 
and seeking forgiveness from specific persons 
seemed most often related to having caused 
trouble to others, There is an absence of state- 
ments of self-depreciation and guilt. They 
apparently retained good thoughts of them- 
selves, but sensed rejection by others. Though 
these individuals had lived the longest, their 
notes indicated that they were no more tired 
of life than any other age groups. This seems 
to suggest that for individuals at this age, once 
the situation becomes critical, environmental 
and milieu therapy combined with medical 
pain relief would be most useful. Family 
therapy may also be advantageous. This age 
group presents a special problem for preven- 
tion, if only for the reason that they con- 
tribute more than their share to suicide sta- 
tistics. Preventative programs should take into 
consideration the psychological side effects of 
compulsory retirement rules, the importance 
of educating adults to understand and meet 
the needs of aging parents, and the need for 
general medical staffs to be cognizant of psy- 
chological aspects of physical ailments among 
the old and the emotional counterparts of 
physical aging. This is the note of a 71-year- 
old man in this group: 
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John, I do not seem to be wanted living with you 
and Betty so the best thing I can do is put myself 
out of the way. When you brought up about the 
large telephone bill you had sure upset me. I have 
not been using your phone so I do not understand 
why you said that . . . So goodbye to you all and 
please forgive me. I will never get any better at my 
age so what is the use of trying to go on. . . just 
a lot of suffering and expense. Say goodbye to 
Martha Smith for me and treat her right. You 
know we all have our faults when we get older. She 
has been very nice to me for what she has done since 
Mom died. I have had this heart trouble since 1948 
and it sure is not going to get better. I do not 
want to spend several months or years in a rest home, 
Good Bye. Dad 


Two categories of reasons for suicide were 
relatively absent from the notes of all age 
groups. These were reasons related to fi- 
nances and reasons indicating confusion or 
bizarre psychotic thoughts. Further investi- 
gation is necessary to determine the full mean- 
ing of this finding. 

The evidence presented supports the hy- 
pothesis that type of suicidal ideation, includ- 
ing conscious reason for the act, varies with 
age of the suicidal person. These results are 
in accord with the view that suicidal behavior 
is a symptom of various kinds of personality 
disorganization and not the result of a single 
suicidal personality syndrome. Attention must 
be paid to the effect of age variables upon 
such disorganization in order to enhance sci- 
entific understanding and increase effective 
prevention of suicide. 
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The Defense Mechanism Inventory, a paper and pencil test which purports to 
measure the relative intensity of usage of five major groups of defenses, is 
described. The inventory consists of 10 brief stories, two per conflict area, 
followed by four questions regarding S’s actual behavior, fantasy behavior, 
thoughts, and feelings in the situations described. Five responses typifying the 
five defenses (ie., hostility-out, projection, principalization, turning-against-self, 
and reversal) are provided for each question, from which S$ selects the one 
most representative and the one least representative of his reaction, Reliability, 
validity, and normative studies are presented, all indicating that this inventory 
has many potential uses as a clinical and research instrument. 


Increased preoccupation with the theory of 
ego processes during the last few decades has 
led to greater concentration on operational 
definition, classification, and measurement of 
various ego functions. Recently, several in- 
vestigators (e.g., Haan, 1963; Kroeber, 1963) 
have made progress in differentiating and 
assessing defensive and adaptive ego function- 
ing. However, there is still much need for ob- 
jective studies in this area. This preliminary 
report on the construction and validation of a 
new instrument designed specifically for the 
measurement of five general defense mecha- 
nisms should be, therefore, of considerable 
interest to researchers and clinicians. 

Many defense mechanisms have been identi- 
fied, some with rather tenuous differences. 
This multiplicity has led to attempts to 
achieve a more parsimonious classification 
system (e.g, Blum, 1953; Hilgard, 1949; 
Miller, 1953; Miller & Swanson, 1960). Most 
such systems, however, have not provided 
clear-cut criteria by which the various defense 
Mechanisms might be grouped. The need for 
such a grouping is apparent both in the area 
of measurement and research and in the clini- 
cal situation where one may wish to assess 
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the major defense mechanisms of an indi- 
vidual. When defenses are evaluated clini- 
cally, either from interview material (Raines 
& Roher, 1955) or from projective test proto- 
cols (Filer, 1952), consensus is minimal, In 
those instances where substantial agreement 
has been obtained from projective techniques 
(e.g, Gardiner, Holzman, Klein, Linton, & 
Spence, 1959), judges inferred a specific set 
of defenses on the basis of careful training in 
the scoring procedure, 

Only a few objective scales have been de- 
veloped to measure defenses. The most popu- 
lar are the Rosenzweig Picture Frustration 
Test, the Blacky Defense Preference Inquiry, 
and Byrne’s Repression-Sensitization scale. 
All three have demonstrated some stability 
over time, but their usefulness as measures of 
the extent to which certain defenses are em- 
ployed is still open to question. Fry (1949) 
and Vane (1954), among others, question the 
validity of Rosenzweig’s (1950) Picture Frus- 
tration Test for assessing defenses. With re- 
gard to the Blacky Defense Preference In- 
quiry, Blum (1956) reports that avoidance 
is the only defense for which some validity 
has been established. While Byrne’s (Byrne, 
Barry, & Nelson 1963) Repression-Sensitiza- 
tion scale is supported by a number of posi- 
tive validity studies, a method that provides 
information on only two defenses is likely to 
be of limited value either for research pur- 
poses or for the clinician. Furthermore, we 
have found that the Repression-Sensitization 
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scale and Welsh’s (1956) Factor A correlate 
.97 for males and .96 for females in two inde- 
pendent samples of psychiatric outpatients. 
This finding raises a question as to whether 
the Repression-Sensitization scale measures 
any construct other than anxiety. 


THE CLASSIFICATION SYSTEM OF THE DMI 


Underlying the formulation of the Defense 
Mechanism Inventory (DMI) is the general 
assumption that the major function of de- 
fenses is the resolution of conflicts between 
what is perceived by the individual and his 
internalized values (Kroeber, 1963; Miller & 
Swanson, 1960). The conflict is resolved by 
a process whereby the ego attacks, distorts, 
or becomes selectively unaware of certain 
aspects of the internal or external world. 

On the basis of this a classification system 
was devised which appears economical and 
yet is general enough to encompass the most 
important defense mechanisms previously 
identified. Five clusters of defenses were 
delineated. 

1. Turning against Object (TAO). This 
class of defenses deals with conflict through 
attacking a real or presumed external frus- 
trating object. Such classical defenses as 
identification-with-the-aggressor and displace- 
ment can be placed in this category. 

2. Projection (PRO), Included here are de- 
fenses which justify the expression of aggres- 
sion toward an external object through first 
attributing to it, without unequivocal evi- 
dence, negative intent, or characteristics, 

3. Principalization (PRN). This class of 
defenses deals with conflict through invoking 
a general principle that “splits off” affect from 
content and represses the former. Defenses 
such as intellectualization, isolation, and 
rationalization fall into this category. 

4. Turning against Self (TAS). In this 
class are those defenses that handle conflict 
through directing aggressive behavior toward 
S himself. Masochism and autosadism are ex- 
amples of defensive solutions in this category. 

5. Reversal (REV). This class includes 
defenses that deal with conflict by responding 
in a positive or neutral fashion to a frus- 
trating object which might be expected to 
evoke a negative reaction. Defenses such as 
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negation, denial, reaction formation, and 
repression are subsumed under this category, 

The usefulness of this method of classifica- 
tion has been enhanced by recent findings 
from the field of perception. Specifically, 
Witkin and his colleagues report (Witkin, 
1965; Witkin, Lewis, & Weil, 1966) that field 
independent Ss are high on hostility out, 
projection, or isolation, whereas field depen- 
dents Ss are high on either repression or 
hostility turned inwards. Measurement of 
these five defenses by means of the DMI 
should permit further exploration of the rela- 
tionship between defenses and cognitive styles. 


DESCRIPTION OF THE DMI 


Twelve stories were developed, two for 
each of six conflict areas. The conflict areas 
tapped are authority, independence, masculin- 
ity (male form only), femininity (female 
form only), competition, and situational. The 
stories were structured in this manner to in- 
sure a broad coverage of the areas in which 
most persons encounter conflicts and to make 
possible an examination of the notion that 
persons will differ in the defenses they use 
according to the nature of the conflict. 

The Ss are given either the masculine or 
the feminine form of the DMZ. Each version 
contains 10 stories, After reading each story 
S is asked to respond to four questions cor- 
responding to four types of behavior evoked 
by the situation described in the story: (a) 
proposed actual behavior, (b) impulsive be- 
havior (in fantasy), (c) thoughts, and (d) 
feelings. Five responses are provided for each 
question, each response representing one of 
the five defense mechanisms listed above. The 
S marks a plus for the response most repre- 
sentative of his reaction and a minus for that 
least representative. This structured inquiry 
makes it possible to examine substantive 
questions regarding the consistency of a per- 
son’s defenses over various levels (i.e., actual 
behavior, fantasy behavior, thoughts, and 
feelings). 

The Tesponses marked with a plus sign by 
S are given the numerical value of two, those 
marked with a minus sign are scored zero, 
and the unmarked responses are given the 
value of one. Thus the score for any one 
defense can Tange from zero to 80, but the 
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sum of scores for the five defenses must equal 
200. A scoring template is superimposed over 
the answer sheet making it possible to sum- 
marize the numerical value of the five de- 
fenses across the four levels of behavior in 
3-5 minutes. To complete the test itself 
requires between 30 and 40 minutes. 


Preliminary Studies 


Five clinical psychologists? were given the 
names of the six conflict areas and were asked 
to classify the stories, assigning two to each 
area. Four accomplished the task in perfect 
agreement and in accord with the authors’ 
intent. The fifth psychologist did not note 
that he was to assign two stories to each 
area and combined the stories involving 
“authority” and “independence” into one 
category. All of the others were correctly 
classified. 

The response alternatives have been revised 
several times. In one study the five previously 
mentioned psychologists were asked to match 
each set of five responses with the five defense 
mechanisms according to the definitions pro- 
vided. Their judgments agreed with the key 
for all but 18 responses out of the 240. An 
examination of these alternatives revealed 
ambiguities and the responses were duly 
revised. In another study, the test was admin- 
istered to 42 Ss and the responses to each 
question were analyzed to reveal alternatives 
which were either too popular or unpopular. 
After this revision, a final check of the inter- 
item consistency and the popularity (social 
desirability) of the response alternatives was 
made using additional samples of 66 male 
and 52 female students. Equating the alterna- 
tives for social desirability tends to minimize 
the general effect of a set to make a good 
impression. 

When these DMI response revisions were 
completed, another group of clinicians* was 
asked to match the responses with the five 
defense mechanisms in accordance with the 
definitions provided. One clinician matched 
all but two responses with that given in the 


2 The authors want to express their thanks to Wil- 
liam Seeman, George Kisker, Virginia T. Graham, 
Kayla Springer, and Kenneth Kuntz. 

8The authors want to express their thanks to 
Michael Hirt, Richard Kurtz, and Leon I. Maizlish. 


scoring key; the second, all but four; and the 
third, all but 10 responses. At least two of the 
three clinicians agreed with the scoring key 
on each response. 

The effect of free- versus forced-choice in- 
structions on the spread and level of defense 
mechanism scores and on their generalizability 
to a universe of content was next investigated. 
The test was administered to two new samples 
of male and female undergraduate students at 
the University of Dayton and the University 
of Cincinnati. One group was given the 
forced-choice instructions; the other group 
was instructed to indicate true or false in 
front of each response and to put a plus sign 
in front of the one response that was most 
representative of the way the respondent 
would react. 

Figure 1 shows the average scores for males 
and females under free- and forced-choice in- 
structions. The profiles are strikingly similar 
despite the difference in raw score level. 
Generalizability of defense scores over a uni- 
verse of tests structured like this one was 
ascertained for each sample separately, using 
a mixed model analysis of variance to take 
into account the stratified nature of the test 
(Rajaratnam, Cronbach, & Gleser, 1965). 
Since the generalizability of scores obtained 
from a multiple-scored test using free-choice 
instructions is augmented by any general re- 
sponse tendency, as for example, a tendency 
to accept or reject relatively few alternatives 
(Bell, 1962), the coefficients of generalizabil- 
ity obtained under these instructions were 
corrected by removing the between-persons 
variance for the average score over all de- 
fenses. This correction resulted in more nearly 
comparable coefficients for the two types of 
instructions. These coefficients are presented 
in Table 1. The generalizability of the sepa- 
rate scores are very similar from one sample 
to another with two exceptions—the PRN 
scale in the male sample using free-choice 
instructions and the TAS scale in the female 
sample with forced-choice instructions. These 
coefficients are attenuated by the small vari- 
ance among persons in the samples tested. The 
error variances for these scales were no larger 
than for the remaining scales; hence the 
low coefficients are probably attributable to 
sampling fluctuations, Since there was no evi- 
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Fic. 1. Mean scores on defenses by sex and type of 
Tesponse. 


dence of any loss in differential information 
of the five defense mechanisms resulting from 
use of the forced-choice format and it had 
the advantage of eliminating any response 
bias not attributable to defenses per se, it 
was decided to use forced-choice in all further 
studies. 


Distribution of Scores 


The means and standard deviations of 
scores on the five defenses obtained on several 
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TABLE 1 


GENERALIZABILITY OF DMI Scores DERIVED 
FROM INTERNAL CONSISTENCY ESTIMATES 


Males Females 
PA Free re- | Forced | Free re- | Forced 
sponse* | choice | sponse* | choice 
TAO -78 60 83 
PRO .78 75 63 
PRN — 31 70 WD 
TAS 63 .73 21 
REV 17 56 65 
Total 
profile -70 .69 -71 


Note.—For Males, Free response, N = 15, Forced choice, 
N = 19; for Females, Free response, N = 16, Forced choice, 

= 27. 

a Coefficients are corrected for elevation factor. 


samples of Ss are presented in Table 2. 
Michigan University college students tended 
to obtain higher scores on TAO and TAS and 
lower scores on PRN and REV than did the 
general adult Ss. These latter were unsystem- 
atically selected Ss mainly of middle socio- 
economic status and employed as probation 
officers, social workers, vocational counselors, 
teachers, secretaries, college students, and 
housewives. Since the adult Ss were somewhat 
older than the college students it is possible 
that the observed trends are at least partially 
a function of age. Further evidence that TAO 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF THE Five DEFENSE MECHANISMS 
or THE DMI ror SEVERAL SAMPLES oF Ss 


Sophomore college students* 


General adult subjects 


Psychiatric outpatients” 


Defense Males Females 
Aa) SD 
pes 36.8 | 10.7 
ee 36.2 6.1 
F 45.6 6.2 
rae 39.6 7.4 
41.9 9.4 
Age 32.6 8.3 
Education 12.3 2.3 


Note.—For Sophomore college students, males, N = 226, 


226, females, x 
N = 71; for Paychiatric outpatients, males, N = 124, females, N = 1102 ~ 180% for General adult Ss, males, N = 43, females, 


a University of Michi; 
» Combined allo 


data for all outpatients to whom DMI was administered. 
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decreases with age while PRN and REV in- 
crease is indicated by the correlations between 
age and each of these scores in male and 
female outpatient samples. (See Table 3.) 

In both the college and the general adult 
samples and also in the psychiatric outpatient 
samples, males are significantly higher than 
females on TAO and significantly lower on 
TAS. Males are also consistently higher on 
PRO than females, and significantly so in the 
student sample. 

The male psychatric patients tend to have 
lower scores on TAO and PRO and higher 
scores on TAS and REV than do either of the 
nonpatient samples. 


Stability of Defenses 


The stability of defense scores over time 
was investigated in two small samples of 
male and female Ss. One group consisted of 
12 counselors who were administered the 
DMI before and after a T-group experience, 
an interval of a week. The product-moment 
correlations between the separate defense 
scores ranged from .85 for PRO to .93 for 
TAO. The average correlation was .89. The 
second sample consisted of 11 first-year psy- 
chology graduate students who were adminis- 
tered the DMI at the beginning and end of 
a course with 3 months intervening. The cor- 
relations for this sample ranged from .69 for 
PRN to .87 for TAO with an average of .76. 
These results are encouraging, but further 
studies of stability are needed. 


CONSTRUCT VALIDATION 


In order to provide a more stringent valida- 
tion of our classification system we provided 
a list of 15 defenses * in alphabetical order to 
three psychologists and seven social workers 
and asked them to match each of the 240 re- 
sponses of the DMI with one defense from 
the list. The judges were also allowed to mark 
in front of a response the name of a defense 
not mentioned in the list or note that in their 
opinion the response did not represent a de- 
fense mechanism. Analysis of the data indi- 


4 Avoidance, denial, displacement, identification 
with the aggressor, intellectualization, isolation, ne- 
gation, projection, rationalization, reaction forma- 
tion, repression, regression, reversal, turning against 
self, undoing. 


cated that there was satisfactory agreement 
with the key (more than 60%) on those re- 
sponses keyed TAS, REV, and PRN but not 
on responses keyed TAO and PRO. Almost a 
third of the judgments regarding TAO re- 
sponses were to the effect that these responses 
did not constitute a defense, while 19% of 
the judgments of responses keyed PRO indi- 
cated that they were not defenses and another 
14% were identified as defenses which we 
subsume under TAO. These results indicate 
that further revision is needed on some of 
the responses in these two defense categories. 


Relationship among Defenses 


It was expected that the five defense group- 
ings would interrelate systematically both in 
accordance with predictions derived from psy- 
choanalytic theory (Bellak, 1958; Fenichel, 
1945; Hartmann, 1952; Rapaport, 1951) and 
with previously published findings. From an 
empirical standpoint it was hoped, however, 
that these correlations would not be so high 
as to imply that only a single factor was 
being measured. 

The intercorrelations obtained for the pre- 
viously described samples of college students, 
unselected normal Ss, and outpatients, each 
stratified by sex, are presented in Table 3. 
The matrices are remarkably similar, indi- 
cating considerable stability in the relation- 
ship of defenses from sample to sample. 

TAO and PRO are positively correlated to 
a moderate degree (.29 to .63). This rela- 
tionship is in agreement with the findings of 
Caine (1960) and Peak, Muney, and Clay 
(1960). Lesser (1958) reports a positive cor- 
relation between hostility and anti-Semitism, 
presumably a form of projection. From a 
theoretical standpoint a positive relationship 
was expected since both modes of defense 
entail the expression of aggression. The only 
other positive correlation among the five de- 
fenses is between PRN and REV (.33 to .68). 
There is a paucity of pertinent research find- 
ings here. A related finding, however, is that 
of Chodoff, Friedman, and Hamburg (1964), 
who reported that parents of leukemia pa- 
tients tended to employ denial and isolation 
of affect as their major defenses. On a theo- 
retical basis, repression of affect is common to 
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TABLE 3 


INTERCORRELATIONS AMONG THE FivE DEFENSES OF THE DMI 
FOR THREE SAMPLES STRATIFIED BY SEX 


Sample TAO PRO | PRN | TAS | REV 
I. University of Michigan 
Shee to. 46 =.51 — 39 —.64 
PRO 45 —.48 —.37 —.58 
PRN —.60 —.34 —.24 40 
TAS — Al —.42 —.21 —.07 
REV —.67 —.66 45 06 
. General adult lation 
bee Tao .29 —.58 rs —.69 
PRO 31 — 44 -.17 —.56 
PRN —.66 —.61 —.29 33 
TAS —.31 = AS —.56 —.09 
REV -—.71 —.55 68 —.03 
TAO PRO | PRN | TAS | REV | Age | Education 
II. Outpatient 
Ei TAO p i 7" —02 
PRO ~.06 —.03 
PRN 21 «05 
TAS —.08 —.06 
REY, .23* <05 
Age -00 
Education —05 


Michigan sample, 


dult population, N71 Temas, Michlenn sample, N= 
a ul » N = 71 fer y 
triange, N = 67 males lower triangie w 


s 


mp 0l. 


both types of defenses, which would lead one 
to expect a positive relationship. 

Both TAO and PRO are substantially nega- 
tively correlated with PRN and with REV 
(—.44 to —.79). Negative correlations of hos- 
tility or projection with some of the specific 
defenses subsumed under reversal have been 
reported by Goldstein (1952), Peak et al, 
(1960), and Shipman and Marquette (1963). 
The negative correlations between hostility or 
projection and principalization have not been 
reported elsewhere to our knowledge but 
might be predicted on the basis that both 
hostility and projection are considered more 
primitive, immature responses to conflict than 
is principalization (Bellak, 1958; Fenichel, 
1945; Rapaport, 1951). TAS is negatively 
correlated to some slight degree with each of 
the other defenses except REV. TAS and 
REV are independent. Relevant research on 


fe 
43 males lower triangle; 


upper tangle, 
O] 


N = 226 males lower triangle; for General 
Outpatient sample, NV = 93 females upper 


the relationship of TAS to other defenses is 
meager and even conflicting. For example, 
Mussen and Naylor (1954) found that ag- 
gression directed outward and that directed 
inward tend to coexist to a large degree, while 
Purcell (1956) reports findings of a negative 
relationship between them, 


Relationship between Scores on the DMI and 
MMPI Scales 


MMPI data were obtained on 67 male and 
93 female psychiatric outpatients who had 
also been administered the DMJ. The MMPI 
responses were scored on all original scales 
as well as for some of the better-known ex- 
perimental scales. Complete matrices of inter- 
correlations were obtained for males and 
females separately. A portion of these results 
is shown in Table 4, A 


All of the DMZ scales except PRO have 4 
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sizable number of MMPI correlates. TAO is 
positively correlated with scores on F, Pd, 
Sc, and Ma and negatively correlated with L. 
Furthermore, TAO is positively correlated 
with Mf and A for males but not for females. 
PRO has a somewhat similar pattern of cor- 
relations with the MMPI scales but the corre- 
lations are smaller and generally not signifi- 
cant for one or the other or both sexes. PRN 
and REV, on the other hand, are negatively 
correlated with F, Pd, Pa, Pt, and Sc. 
Social introversion (Si) and anxiety (A) are 
negatively correlated with REV for males and 
with PRN for both sexes. Both these scales 
are positively correlated with TAS as is D. 
Scores on Barron’s (1953) ego-strength scale 
are negatively correlated with TAS. These 
patterns of correlations are for the most part 
consistent with theoretical predictions. 

The correlates of L and K are of particular 
interest since these scales are sometimes con- 
sidered to be measures of a set on the part 
of S to appear socially desirable. In general, 
Land K are negatively correlated with TAO 
and PRO, and positively correlated with REV, 
particularly in the male sample. 


If the correlations are attributed to a com- 
mon factor of social desirability it would 
imply that the more socially desirable re- 
sponses in the DMZ are the REV alternatives 
while the least desirable are the TAO and 
PRO responses. But then we have the peculiar 
situation that psychiatric outpatients obtain 
a more socially desirable profile than do 
general adults or college sophomores, (See 
Table 2.) A more likely explanation for the 
correlations is that they are due to a common 
factor of evasiveness or denial (versus forth- 
rightness) which is revealed both in S’s atti- 
tude about himself (MMPI) and his handling 
of conflict situations (DMI). 


Relationships between Defenses from the 
DMI and Haan’s Defense Scales 


Haan (1965) developed scales for eight 
defenses using MMPI items. Her definitions 
differed somewhat from ours and the scales 
have never been cross-validated but it seemed 
reasonable to explore their relationship to the 
DMI scales. Hence scores were obtained on 
these scales for the above-mentioned psychi- 
atric clinic patients and also for a sample of 


TABLE 4 


CORRELATIONS BETWEEN DMI anp MMPI SCALES FOR 67 MALE 
AND 93 FEMALE PSYCHIATRIC OUTPATIENTS 


MMPI 

variables 
L 
PF 25% 
K —21 
Hs -17 
D —03 
Hy —03 
Pd 19 
Mj 38** 
Pa 19 
Pi 18 
Se 34** 
Ma 20 
Si 16 
A (Welsh) 25* 
Es (Barron) 20 


Se points omitted for ease of reading. 


DMI Scales 
TAS REV 
M F M F 
05 02 sa 24* 
10 37%% | —36** | —36** 
-11 —26** 39** 09 
07 24* 23 —01 
42** 24* | —15 —06 
06 16 01 01 
08 18 —23 —23* 
—05 —14 —50** | —05 
06 46** | —25* | —21* 
22 368% | —25% |! = 12 
10 4i** | —34** | —28** 
—23 f| —01 =21 =i 
32** | 32%* | —35** | —04 
23 } 36** | —39%* | —06 
—34** | —35** | —03 —02 
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23 male and 24 females patients at another 
outpatient clinic. Those correlations which are 
consistent across samples and significant in 
at least one sample are the following: Haan’s 
Denial scale is positively correlated with REV 
and PRN and negatively correlated with 
TAO. Doubt is positively correlated with TAS 
and negatively with PRN. Repression is posi- 
tively correlated with REV and negatively 
correlated with TAO. Primitive Defense is 
positively correlated with PRN and REV. One 
possibly interesting sex difference is that TAS 
and Repression are positively correlated for 
males and negatively correlated for females. 
(See Table 5.) 


Other Validity Studies 


Responses of male alcoholics to the DMI. 
Michigan State University graduate students 
conducted a study of the DMI responses of 
54 male alcoholics (Aldridge, Baxter, Nop- 
ziger, Roggenbuck, Shimansky, & Wolthuis, 
1967). Relying on psychoanalytic theory to 
the effect that the alcoholic personality is 
rooted in a self-destructive, orally dependent 
orientation to life (Fenichel, 1945; Zwerling 
& Rosenbaum, 1959) and on the empirical 
findings that alcoholics demonstrate low ego 
strength (Borowitz, 1964), self-destructive- 
ness (White, 1966), and dependency (Witkin, 
Karp, & Goodenough, 1959), the investigators 
predicted that the alcoholics would be high 


TABLE 5 


CORRELATIONS BETWEEN SCORES ON THE DEFENSE 
Scares or THE DMI Anp Tuose or HAAN 


Females 


Scales 


Clinic 1) Clinic 2 |Clinic 1 |Clinic 2 


TAO X Displacement A x =. 
TAO X Denial — foes ~ 3s Tio 
TAO % Repression —33** | 33 | [20 
rojection ~. 
PRN X Intellectual- aE 
izing 09 =! 
PRN x Doubt “gN T33 -2 
en; 33 i 
PRN X Primitive ae MEE 
Defense 33% | 24 07 
TAS X Doubt 26" | 135 130% 
TAS X Repression 35% | —10 | — ose 
REV % Denki Tiaa | ee | T10 
jer ~ e 
REV X Primitive paa | ae 
efense 37% | ager | — og 


pm anak Ral Be a L E 
Note.—For Males, Clinic 1, M = 23, Clinic 2, V = 67: 
Females, Clinic 1.7 = 24, Clinic 2, N = 93. ss 

bad 335 


on TAS and REV and low on TAO and PRy 
relative to the normals. These predictions were 
all substantiated at an alpha level of 05 
with the exception of PRN, for which p < 10, 
The alcoholics also deviated more frequently 
from the intermediate range (2 SD above or 
below the mean of standardization group) on 
at least one defense mechanism relative to 
the normal sample (p < .001). 

The relationship of DMI to dream recall, 
The 180 University of Michigan female stu- 
dents on whom mean scores are reported in 
Table 3 were administered a questionnaire 
related to dream recall and were asked to 
keep a dream diary for 1 month, noting in 
it every morning upon awakening various 
aspects of their dreams.® Using the dream 
data drawn from the questionnaire, two pre- 
dictions were substantiated. The Ss (V = 29) 
who reported frequent dream recall (34 
times per week or more) were higher on TAO 
(p < .05) and lower on REV (p < .05) than 
were low dream recallers (i.e., eight Ss who 
reported they dream at most a few times per 
year). These findings are in line with those 
reported by Tart (1962) and others who at- 
tribute infrequent dream recall to the use of 
repression against threatening dream content. 
Significant correlations were not obtained, 
however, between defenses and frequency of 
dream recall when data from the dream diary 
were used. One possible interpretation of these 
results is that people who report low dream 
recall on a questionnaire are doing so as 4 
result of a continuous process of repression 
rather than as a consequence of immediate 
repression following the dream process. 

Psychotherapy and DMI scores. In an 
exploratory study on the effect of psycho- 
therapy on defense mechanisms, we asked 
each of five experienced psychotherapists to 
test two or three of his patients, a man and 
woman preferably, who had been in treat- 
ment with him for approximately 6 months. 
We compared these results with those ob- 
tained from a group of patients, matched for 
education, sex, age, and race (Caucasian) 
drawn from the waiting list. The results re- 
vealed (see Table 6) that patients on the 


"The authors wish to thahk David Cohen for 
permitting the use of this data. 
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waiting list were significantly high on REV 
(male p< .02; female p<.01) and low 
on PRO (male p < .05; female p < .01) rela- 
tive to patients in treatment. The difference 
on REV might indicate a reduction of denial, 
repression, and other avoidance mechanisms 
for patients in therapy, probably as the result 
of the therapists stressing and encouraging 
their patients to confront their feelings and 
face up to their problems. The other finding, 
that the patients in treatment were higher on 
PRO, was unexpected. One possible explana- 
tion is that the group of patients that stayed 
in treatment for 6 months was a more 
pathognomonic group relative to the waiting 
list patients (Grummon as quoted in Seeman, 
1965), many of whom may drop out before 
treatment begins or within a few sessions. 
The relationship of DMI to field articula- 
tion. Witkin and his colleagues (Witkin, Dyk, 
Faterson, Goodenough, & Karp, 1962), rely- 


TABLE 6 


MEANS AND STANDARD DEVIATIONS OF THE FIvE 
DEFENSE MECHANISMS OF THE DMI ror A Group 
OF PSYCHIATRIC OUTPATIENTS IN TREATMENT 
FOR 6 MONTHS AS COMPARED TO A 
GROUP OF PATIENTS ON THE 
Warrine List 


In therapy i On waiting list 


Scale | — t 
PE SD la ASD. 
Males 
Age 28.7 6.64 | 26.0 6.72 
Educa- 
tion 12.8 1.40 | 13.2 2.18 
TAO 44.2 | 9.88 | 38.8 7.65 | ns 
PRO 42.3 7.09 | 35.7 5.98 | <.05 
PRN 41.3 5.66 | 44.3 5.57 | ns 
TAS 40.7 | 10.06 | 40.5 5.50 | ns 
REV 31.6 8.07 | 40.6 7.57 | <.02 
Females 
Age 29.4 6.65 | 26.1 4.85 
Educa- 
tion 12.4 247 | 11.7 2.41 
TAO 39.3 | 11.34 | 30.8 | 11.02 | <.10 
PRO 39.3 7.44 | 32.5 3.47 | <.01 
PRN 44.4 7.87 | 48.0 9.79 | ns 
TAS 42.9 7.94 | 46.2 6.32 | ns 
REV 34.1 9.29 | 42.5 3.70 | <.01 
Note.—Patients on waiting list had been interviewed, 


diagnosed, and accepted for treatment. For Males in Therapy, 
aN List, N = 11; for Females in Therapy, 


ing on what they call the “differentiation 
hypothesis,” propose a network of association 
between degree of field articulation and vari- 
ous other measures of differentiation (e.g., 
body concept, sense of identity, cognitive 
styles, and defenses). Such defenses as denial, 
repression, and hostility-turned-inward reflect, 
according to Witkin, a lower degree of dif- 
ferentiation than do isolation, projection, and 
hostility-turned-outward. Lesser differentiation 
is assumed in the first group of defenses be- 
cause their operation involves a more primi- 
tive self-structure, as well as lesser separation 
of self from nonself, In a recently completed 
study, Ihilevich (1968) related the DMI de- 
fenses to the cognitive style of field articula- 
tion. A sample of 110 psychiatric outpatients 
(50 males and 60 females) were administered 
the DMI, the Embedded Figure Test, and the 
Figure Drawing Test. As predicted from 
Witkin’s “differentiation hypothesis,” Ss-who 
relied mainly on “global” defenses (TAS and 
REV) were more field dependent relative to 
Ss who relied excessively on “differentiated” 
defenses (TAO and PRO) (p < .01). 
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A battery of procedures for assessing ego functions was administered to 
normal elementary school children and institutionalized emotionally disturbed 
children, subdivided into 3 age levels. Orderly, predictable developmental 
differences in ego functioning were found within the normal sample, but there 
was little regularity or consistency within the abnormal sample. Comparison 
of the normal and abnormal groups revealed no differences in motor manipu- 
lation or level of aspiration, but the disturbed children were less able to in- 
hibit motor responsivity, less accurate in time estimation, and less willing to 
seek delayed gratification in preference to immediate reward. The findings are 
in keeping with expectations based on psychoanalytically oriented theories of 
ego development. Their implications for early diagnosis, prevention, and educa- 


tion were discussed. 


In recent years, an increasing amount of 
psychological research has been devoted to 
studies of ego functions. Singer (1955) pre- 
sented a comprehensive review of theories of 
ego development and indicated the implica- 
tions of these theories for clinical and experi- 
mental research. Singer and his colleagues 
(Singer, Wilensky, & McCraven, 1956) also 
reported empirical findings from studies of 
basic ego functions such as delaying capacity, 
fantasy, and planning ability in adult schizo- 
phrenics. Levine, Spivack, Fuschillo, and 
Tavernier (1959) and Spivak, Levine, and 
Sprigle (1959) published reports of research 
on ego functioning in which they focused on 
intelligence, motor inhibition, time sense, and 
self-control in disturbed adolescents. Similar 
phenomena have also been studied by Siegman 
(1961) in a group of young prisoners. Cross- 
cultural studies of delay of gratification in 
normal children have been conducted by 


1 This investigation was facilitated by small grant 
M-4327 (A) from the National Institute of Mental 
Health. The author wishes to acknowledge the as- 
sistance of Robert B. Carlin and Alexander W. 
Carnathan who collected the data for this study. The 
author is indebted, also, to Lewis P. Lipsitt who ar- 
ranged for the testing of normal children from the 
Seekonk, Massachusetts, public schools. The emo- 
tionally disturbed children were studied at the Emma 
Pendleton Bradley Hospital in Riverside, Rhode 
Island. 5, 

2 Also at the Emma Pendleton Bradley Hospital, 
Riverside, Rhode Island. 


Mischel (1959, 1961). Davids and his col- 
leagues (Davids, Kidder, & Reich, 1962; 
Davids & Parenti, 1958; Davids & Sidman, 
1962) studied time orientation in emotionally 
disturbed children and institutionalized juve- 
nile delinquents. Recently, Davids and Sid- 
man (1962) reported findings from a study of 
impulsivity, time orientation, and delayed 
gratification in underachieving and high 
achieving secondary school students. 

Most of these independent investigations 
show either an explicit or an implicit link 
with psychoanalytic theory (Freud, 1920, 
1946) and they provide empirical evidence 
pertinent to Rapaport’s thesis (1950, 1951) 
that development of thought processes in 
childhood results from inevitable delay of 
gratification and the necessity of learning to 
inhibit impulse expression. On the basis of 
these psychoanalytically oriented theories of 
ego development and previous empirical find- 
ings, the following predictions were formu- 
lated for examination in the present study: 
(a) basic ego functions are less well devel- 
oped in younger children than in older chil- 
dren, and (b) at each developmental level, 
emotionally disturbed children evidence less 
adequate ego functioning than do normal 
children. 

Although the present investigation is re- 
lated conceptually to several of the studies 
cited above and employs experimental pro- 


61 


62 


cedures similar to those utilized independently 
in some of these varied researches, it is de- 
signed to provide findings that have not been 
available previously. The experimental design 
is more comprehensive in that it includes both 
normal and disturbed Ss at younger develop- 
mental levels than those studied previously 
and includes a wider variety of procedures 
for assessing ego functions. More specifically, 
the present study includes normal children 
from public schools and children who were 
institutionalized for emotional disturbances. 
Within both samples, the children were sub- 
divided into three developmental levels and 
were administered a battery of experimental 
procedures designed to assess ego functions 
such as goal setting, motor performance, con- 
trol of impulsivity, time estimation, and delay 
of gratification. 


MerxHop 
Subjects 


This study employed two samples of children—34 
normal boys who were attending public school and 
38 emotionally disturbed boys who were undergoing 
residential psychiatric treatment. The normal sample 
was subdivided into the following three groups: 
thirteen 7-year-olds in the second grade, ten 9-year- 
olds in the fourth grade, and eleven 11-year-olds in 
the sixth grade, The average IQ of the children in 
each of these grade levels was between 100 and 110, 
with an overall mean of 105. The emotionally dis- 
turbed sample was subdivided into the following 
three groups: four 7-year-olds, thirteen 9-year-olds, 
and twenty-one 11-year-olds. In the disturbed sam- 
ple, the average IQ for each age group was between 
90 and 100, with an overall mean of 96. Many of 
these disturbed children do not perform on intelli- 
gence tests at a level in keeping with their estimated 
potential based on clinical evaluation (Davids, 1958), 
Therefore, the sample of disturbed children was also 
subgrouped on the basis of their mental ages as fol- 
lows: eleven 7-year-olds, thirteen 9-year-olds, and 
fourteen 11-year-olds, The mean mental age in both 
the overall group of normal children and the overall 
group of disturbed children was identical (10.0 
years). In analyzing findings obtained in this re- 
search, separate comparisons were made for sub- 
groupings based on chronological age and mental age. 
Since essentially identical results were obtained from 
the two methods, the results Presented in this report 
will be based on chronologi age subgroupings 
within the disturbed and normal samples. It should 
be emphasized, however, that the small number of 
younger children in the abnormal group may limit 
the generality of ‘findings obtained, 

The attempt was made to include all testable boys 
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within the residential treatment setting. This group 
of disturbed boys included four schizophrenic chil. 
dren and four who were classified as neurotic, but 
the vast majority (V=30) were classified as cases 
of severe behavior disorder (passive-aggressiye 
personality). These are the kind of boys who are 
termed “acting-out” children—hostile, antisocial, ex. 
plosive, and often unable to learn in the conven- 
tional school situation. 


Measures 


The children were seen individually, and the pro- 
cedures were administered in the order described 
below. 

Rate of manipulation and level of aspiration. The 
child was presented with the Minnesota Rate of 
Manipulation Form Board and was instructed to tum 
over as many of the blocks as he could before being 
told to stop. Following this 30-second trial, E re 
corded the number turned and announced it to the 
child. He then asked the child to state how many he 
thought he could turn over if he were given another 
trial. The level of aspiration was recorded, but the 
child was not permitted a second trial, as the pres- 
ent experimental design attempted to eliminate the 
effect of success or failure on the remainder of the 
experimental procedures. Since some of the children 
were unable or unwilling to state their level of 
aspiration in terms of a specific number, but gave 
responses such as “I'll do more more next time,” or 
“Tl do the same as (or less than) I just did,” the 
Tesponses were dichotomized into the categories of 
“better” and “same or worse.” The data from this 
Procedure were analyzed in terms of these two 
global measures of aspiration. 


| 
| 


Motor inhibition. The child was presented with & | 


mimeographed sheet containing a spiral. He was told 
to place his pencil at the center of the spiral and to 
draw his way out (in a manner similar to tracing & 
path through a maze) » going as slowly as he possi- 
bly could, but without stopping the movement of 
the pencil. 
path from the center of the spiral to the outside, and 
the child was supposed to traverse the path as 
slowly as possible but with continuous motor move- 
ment. The time to complete this task was recorded, 
and a maximum allowable time was set at 2 minutes. 

Time estimation. 
child and he was told that his task was to estimate & 
30-second passage of time. The E then started the 
stopwatch, hidden from the child’s view, and stopped 


the watch when the child indi, t he be- 
lieved to be 30 E ar 


provement or decrement, 

Delay of gratification. The child was asked the fol- 
lowing questions, First, “If I gave you 10¢, what 
would you do with it?” Then, “If I gave you $1, 
what would you do with it?” The answers were 
recorded verbatim and, for each of the two amounts, 
were later classified into the following three cate- 


In other words, there was a winding | 


A stopwatch was shown to the — 
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gories: (a) immediate gratification (e.g., buy candy, 
buy a coke, etc.), (b) short delay of gratification 
(eg, go to movies on the weekend, go to a skating 
rink within a few days, etc.), (c) longer delay of 
gratification (e.g., put it in the bank, save it toward 
a bicycle or a dog, etc.). 


RESULTS 
Rate of Manipulation and Level of Aspiration 


There is a consistent developmental pro- 
gression indicating increased efficiency in mo- 
tor manipulation. In the normal group, the 
three means (youngest = 20, middle = 30, 
oldest = 37) are significantly different. In the 
abnormal group, the youngest and middle sub- 
groups do not differ significantly (23 vs. 26), 
but both of them perform significantly less 
efficiently than the oldest subgroup (M = 
35). In comparing the normal and abnormal 
groups at each of the three developmental 
levels, no significant differences were obtained. 
The mean number of blocks turned in the 
overall abnormal group was 30, while the 
mean for the normal group was 28, indicating 
that psychopathology had no marked effect 
on this measure of motor performance. 

The level of aspiration findings reveal no 
developmental consistency. In the normal 
group, the youngest subgroup contained the 
largest percentage of Ss (54%) stating they 
would do “better” on a second trial. In the 
abnormal group, the oldest subgroup con- 
tained the largest percentage of Ss (57%) 
aspiring to improve on a second trial, Within 
both the normal and abnormal groups, none 
of the differences between percentages (rang- 
ing from 40 to 57%) were statistically sig- 
nificant. Comparing the total normal group 
with the total abnormal group reveals a non- 
Significant difference between the proportions 
(53% vs. 47%) aspiring to do “better” on 
the second attempt at motor manipulation. 
Thus, in the present experimental situation 
the aspirations of emotionally disturbed and 
normal children were very similar. 


Motor Inhibition 


The findings presented in Table 1 reveal 
Several noteworthy differences between the 
Normal and abnormal groups. Within the 
normal sample, there is an orderly develop- 


mental trend indicating a decrease in impul- 
sivity and increase in motor control with in- 
creasing age. The two younger subgroups do 
not differ significantly, but both of these 
subgroups obtain significantly lower scores 
than the oldest subgroup. Within the ab- 
normal sample, there is no orderly develop- 
mental progression. In fact, the youngest sub- 
group in this sample obtained the highest 
mean score on the motor inhibition task. None 
of the differences between means in this ab- 
normal sample, however, are statistically sig- 
nificant. Comparisons between the normal and 
abnormal samples reveal highly significant 
differences at both the middle and oldest age 
levels. Comparison of the scores in the two 


. overall samples reveals that the mean score in 


the normal sample was almost twice as high 
as the mean in the abnormal sample. These 
findings, then, provide definite evidence of 
greater impulsivity and less motor control in 
the emotionally disturbed children. 


Time Estimation 


The results obtained from the two attempts 
at time estimation reveal several similarities 
and some significant differences between the 
disturbed and normal groups of children, On 
the first trial, within the normal group, the 
youngest and the middle age groups make 
significantly higher error scores than do the 
children in the oldest subgroup. Within the 
abnormal group, there is a trend indicative of 
decreasing error scores on the first time-esti- 
mation trial as a function of increasing de- 
velopmental levels. However, there are no 
significant differences among the error scores 
in the abnormal sample. As shown in Table 1, 
at each of the developmental levels, the dis- 
turbed children receive higher error scores 
than do the normal children, but the only 
statistically significant difference is found at 
the oldest age level. At this developmental 
level, the mean error score for the disturbed 
children is 13.6, which is markedly higher than 
the mean error score of 7.7 in the normal 
children.® 


8 These mean error scores are in the direction of 
underestimation. That is, they indicate the average 
amount of time by which the children responded 
before the 30 seconds had elapsed. In both the dis- 
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TABLE 1 
COMPARISON OF SCORES (IN SECONDS) FOR NORMAL AND ABNORMAL Groups on 
MOTOR INHIBITION AND TIME ESTIMATION 
E Normal group Abnormal group 
Item F t 
M Variance M Variance 
Motor inhibition p 
Youngest 43.3 520.2 40.0 2846.0 5.47* ols 
Middle 46.6 393.4 24.2 499.7 1.27 Ae p 
Oldest 76.8 2079.9 29.3 1086.1 1.92 3. ra 
Combined 55.1 1160.6 28.3 1000.9 1.16 3.44 
Time-estimation error 

Trial 1 f 
Youngest 13.5 94.4 16.3 90.9 1.04 50 
Middle 13.7 69.1 14.2 102.4 1.48 À GSi) 

, Oldest 7.7 40.4 13.6 55.3 1.37 2.23 
Combined 11.6 73.1 14.1 711 1.03 1,22 

Trial 2 ; 
Youngest 13.8 60.7 15.5 123.0 2.03 4 
Middle 11.2 92.4 14.5 86.8 1.06 oe 
Oldest 4.2 18.8 9.4 72.1 3.84* pies 
Combined 9.8 70.3 11.8 84.4 1.20 96 


Note.—For the normal group, N = 34; for the abnormal group, N = 38, 
*p < 05 


ep < 101. 


On the second time-estimation trial, again 
there are significant differences within the 
normal sample, with the two younger sub- 
groups obtaining significantly higher error 
scores than the oldest subgroup. On this sec- 
ond trial, within the disturbed sample there 
is also a trend indicative of decreasing error 
scores with increasing age, but none of the 
differences within this sample are significant, 
Comparison of the disturbed and normal 
children, as shown in Table 1, reveals higher 
error scores for the disturbed children at each 
of the three developmental levels, although 
the difference is Statistically significant only 
at the oldest age level in this study. At this 
developmental level, the mean error score in 
the disturbed subgroup is over twice as high 
as the mean error score in the normal chil- 
dren. 

Comparison of each child’s error scores (in 
seconds) on the two trials provides a measure 


turbed and normal groups, very few children erred in 
the. direction of waiting beyond 30 seconds before 
responding. Thus, although errors could be made in 
either direction, almost every child in this study 
responded too soon. 


of improvement on the basis of this one-trial 
learning. That is, if a child obtains a lower 
error score on the second trial it is assumed 
that he learned something from his perform- 
ance on the first trial and that this one-trial 
learning leads to improvement on the second 
trial. Within both the disturbed and normal 
samples there is an orderly progression indi- 
cating increased improvement scores as a func- 
tion of increasing developmental levels. How- 
ever, none of the differences among the mean 
improvement scores (.7, 2.7, 4.5) are signifi- 
cant within the normal sample and the only 
Statistically significant difference within the 
abnormal sample (.8, 1.2, 4.2) is between the 
middle and oldest subgroups. Comparison of 
the disturbed and normal children at each 
developmental level reveals no statistically 
significant differences between the groups. In 
general, then, the findings indicate rather 
consistent tendencies for older children to be 
more accurate judges of the passage of brief 
intervals of time and for normal children to 
be more accurate than emotionally disturbed 
children. Although few of the differences 
within and between the groups are highly sig- 
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nificant statistically, there is consistency and 
orderliness within the findings, and in many 
instances the actual magnitude of the dif- 
ferences is noteworthy. 


Delay of Gratification 


As shown in Table 2, the normal group 
contained a significantly greater proportion 
who would delay gratification if they were 

given 10¢. Whereas 50% of these normal chil- 
dren would spend the money immediately, 
76% of the disturbed children would seek 
immediate gratification. At each of the three 
age levels, there were more disturbed children 
who would spend the 10¢ immediately, al- 
though the difference between proportions is 
Significant only in the comparison between 
the youngest subgroup in the disturbed and 
normal samples. Chi-square analyses within 
the disturbed group and within the normal 
group revealed no significant association be- 
tween the age groupings and the three cate- 
gories of delay of gratification. In fact, in the 
normal sample, the youngest subgroup con- 
tained the smallest proportion of children 
who would seek immediate gratification, while 
in the disturbed sample the youngest sub- 
group contained the highest percentage of 
children who would not delay gratification as 
Measured by this procedure. As mentioned 
Previously, however, findings from the young- 
ést group of abnormal children must be 
Viewed with caution, and may well be of lim- 


ited generality, due to the small number of 
Ss in this category in the present study. 

Analysis of findings utilizing the $1 value, 
instead of the 10¢ value, again reveal a higher 
proportion of disturbed children seeking im- 
mediate gratification at each of the three age 
levels. As shown in Table 2, for the total 
sample, 26% of the normal children would 
spend $1 immediately and 50% of the dis- 
turbed children would spend this amount im- 
mediately. Just as with the 10¢ value, chi- 
square analysis of within-group relations be- 
tween age levels and the three categories of 
gratification reveal no significant associations 
either within the normal sample or within 
the disturbed sample. 

It is evident from these findings that both 
groups contain a greater proportion of chil- 
dren who would delay gratification when the 
amount is $1 than when the amount is 10¢. 
The differences between the proportions show- 
ing immediate gratification with these two 
amounts is statistically significant in both 
samples. For the normal group, the difference 
between the proportions spending 10¢ and 
spending $1 is significant at the .04 level and 
for the disturbed group the difference is sig- 
nificant at the .02 level. 

Thus it can be concluded that there is no 
consistent developmental trend within either 
the disturbed group or the normal group, for 
either the 10¢ or the $1 value, but that for 
each of these two values a greater proportion 


TABLE 2 
COMPARISON OF NORMAL AND ABNORMAL GROUPS ON DELAY OF GRATIFICATION 
% normal group % abnormal group 
Critical 
Am z 
E Save Save Spend Save Save ratio 
Spend short long pea short long 
Small (10¢) 
Youngest 38 8 54 100 0 0 2.18" 
Middle 60 0 40 69 0 31 45 
Oldest 55 9 36 76 10 14 1.20 
Combined 50 6 44 76 5 19, 2.30* 
Larger ($1) 
Youngest 31 23 46 50 50 0 69 
Middle 10 30 60 54 i8 38 2.20* 
Oldest 36 18 46 48 14 38 65 
Combined 26 24 50 50 16 34 2.10* 
Bet? <.0s, 
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of the disturbed children would seek immedi- 
ate gratification and more of the normals 
would save the money for some future gratifi- 
cation. Also, for both emotionally disturbed 
and normal children there is a realistic per- 
ception of the difference in value between the 
amounts of 10¢ and $1, and increasing the 
amount of money involved has the effect of 
reducing the proportion of children who 
would seek immediate gratification in both 
the disturbed and normal samples. 


Associations among Measures 


As shown in Table 3, the only significant 
correlate of IQ in the group of normal chil- 
dren is the time-estimation error score on 
Trial 1. Motor manipulation is associated 
significantly with both motor inhibition and 
errors on the second time-estimation trial, 
indicating that those normal children who 
turned the most blocks tended to spend longer 
time drawing their way out of the spiral and 
were more accurate in their time estimations. 
For this normal group, the correlations be- 
tween time-estimation and motor inhibition 
scores seem particularly noteworthy. The 
correlation of .88 between the two trials of the 
time-estimation task indicates a high degree 
of reliability. Moreover, there are significant 
negative associations between time-estimation 
scores, on both trials, and the measure of 
motor inhibition. These coefficients indicate 
that, within the normal group, those children 
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who were most accurate in their judgments of 
the passage of time were better able to delay 
their motor responses in a situation calling for 
control and inhibition. It is evident from 
Table 3 that, for the normal Ss, the level of 
aspiration and delay of gratification measures 
are not significantly associated with the other 
measures. However, there is highly significant 
association between the two measures of delay 
of gratification, with those normal children 
who would save 10¢ tending to be those who 
would also save $1. 

As shown in Table 4, in the abnormal 
group, IQ is positively associated with motor 
manipulation, level of aspiration, and time- 
estimation improvement, and is negatively 
associated with time-estimation errors on the 
second trial. In other words, those disturbed 
children who did better on the IQ test, turned 
more blocks, evidenced a higher level of as- 
piration, were more accurate in estimating the 
passage of time, and showed greater improve- 
ment from one trial to the next. Motor manip- 
ulation is negatively associated with time-es- 
timation errors and positively associated with 
time-estimation improvement score. Thus, 
those disturbed children who obtained higher 
scores in the block-turning situation tended 
to be more accurate in the time-estimation 
situation, and also showed greatest improve- 
ment from the first to the second trial. 

There is a correlation of .78 between the 
two measures of time estimation, indicating a 


TABLE 3 
ASSOCIATIONS AMONG MEASURES WITHIN THE Norma Group 
Measure 2 3 4 | 5 | 6 | i | 8 | 9 
IQ (1) 3 } 9 -23 cet ai | —.32 12 00 1.14 
Motor manipulation (2) 35*  —14 + 135% 7 00 12 52 
Motor inhibition (3) Si ae Oren eh 12.95 
Time-estimation error 1 (4) .88** 27 2.95 ‘00 2.72 
Time-estimation error 2 (5) 21 12 ‘00 2.64 
Time-estimation improve- i è í 3 
ment (6) 7 
Level of aspiration (7) y n i K 
Delay of gratification (10¢) (8) i 9.54 


Delay of gratification ($1) (9) 


Note.—The values shown under Columns 2 through 6 x 
Columns 7 and 8 are chi-square tests of association, with df = 1, orenean ee poe ticients of correlation; those shown under 


squares with df = 2.N = 34 
*p <.05. 
+p <..01. 


inuity; those shown under Column 9 are chi- 


| 
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TABLE 4 
ASSOCIATIONS AMONG MEASURES WITHIN THE ABNORMAL GROUP 
Measure 2 3 4 | 5 6 7 | 8 | 9 

IQ (1) .64** -08 —.28 1E Se .38* 3.89* .02 67 
Motor manipulation (2) —.06 —.29.  —:46* 43 1.22 34 4.56 
Motor inhibition (3) -02  —.27 .38* 1.22 .03 1.96 
Time-estimation error 1 (4) Fal .08 ,00 90 .45 
Time-estimation error 2 (5) —.52** 10 58 13 
Time-estimation improve- 

ment (6) 00 03 69 
Level of aspiration (7) 03 3.09 
Delay of gratification (10¢) (8) 7 3.70 


Delay of gratification ($1) (9) 


Note.—The values shown under Columns 2 through 6 are product-moment coefficients of correlation; those shown. under 
Columns 7 and 8 are chi-square tests of association, with df = 1, corrected for continuity; those shown under Column 9 are chi- 


squares with df = 2. N = 38. 
*p <.05. 
*p <0. 


satisfactory degree of reliability. This finding 
is in keeping with the relationship found in 
the normal group. However, whereas signifi- 
cant correlations were found between the 
time-estimation error scores and the motor 
inhibition scores for the normal children, 
there is no significant association between 
time errors and motor inhibition in the ab- 
normal group. Interestingly, however, there is 
a significant correlation between time-estima- 
tion improvement score and motor inhibition 
score in the abnormal group. That is, those 
disturbed children who showed the greatest 
Improvement in their time estimations from 
one trial to the next tended to be the ones 
who were better able to comply with the 
instructions calling for motor inhibition and 
Control. As with the normal group, there is 
little evidence of associations between the 
level of aspiration and delay of gratification 
Measures and the other measures within the 
abnormal group of children. 

Tt may be that the simple goal-setting task 
employed in the present study is not suff- 
ciently refined to show consistently high rela- 
tions with various other measures designed to 
assess ego functions. Moreover, it should not 
be overlooked that the delay of gratification 
Procedures employed in this study were re- 
toe to two values, with the children be- 
i na to respond to a hypothetical situa- 
t f the procedure employed different 

ues to be either gained immediately or 


postponed to the future and/or if the children 
were confronted with actual choices, rather 
than hypothetical choices, then it might be 
that significant associations would be found 
among measures of preference for delayed 
gratification and other signs of ego strength. 
These possibilities remain to be explored in 
future studies. 


DISCUSSION 


The findings from this investigation reveal 
noteworthy similarities and differences in 
varied aspects of ego functioning in normal 
children and emotionally disturbed young- 
sters. In the area of motor manipulation these 
two groups of children do not differ and, 
within both samples, there is orderly progres- 
sion of better performance as a function of 
age. Previous research (Davids & Oliver, 
1960; Davids & White, 1958) has also demon- 
strated that on simple tasks requiring gross 
motor movements emotionally disturbed chil- 
dren perform as well as, or even better than, 
normal children. 

Interestingly, however, when the task calls 
for a voluntary slowing down of response, with 
inhibition of motor movements, the disturbed 
children function much less effectively than do 
the normal children. In other words, when 
the situation demands that they work rapidly 
and with abandon, the disturbed children are 
successful, but when they are asked to re- 
strain their energies and to inhibit their motor 
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responses they fare less well than do normal 
children. It is most impressive that a task as 
simple as drawing a path around a spiral 
differentiates to such an extent between chil- 
dren who function adequately in ordinary so- 
ciety and children whose behavior in everyday 
life is so socially unacceptable that they have 
to live in the special environment offered by 
a residential psychiatric treatment center. 
Since these disturbed children have so much 
difficulty in complying with the demands of 
this very simple uncomplicated task, it is 
obvious that they would experience great diffi- 
culty in the usual classroom situation. Many 
of the children in the disturbed sample would 
be characterized as suffering from the hyper- 
kinetic impulse disorder (Laufer, Denhoff, & 
Solomons, 1957) and most of them are at 
least 2 years retarded in scholastic attain- 
ment (Talmadge, Davids, & Laufer, 1963). 
They have low frustration tolerance and are 
extremely restless and active in an ordinary 
class situation. This sort of behavior makes 
it difficult to cope with the demands of the 
usual learning situation, and as these children 
proceed through the early developmental years 
they fall further behind in conventional 
learning and mastery of academic material. 
The present experiment demonstrates quite 
convincingly that motor inhibition is hard 
for them to master and this area of difficulty 
must be considered in any educational plan- 
ning for children like these. If one cannot 
even get them to slow down or to inhibit their 
responses in such a simple situation, how can 
they be expected to cope with complex phe- 
nomena like reading and writing as they are 
taught conventionally in the public schools? 
Although the level of aspiration situation 
constituted only a small facet of this experi- 
ment and the goal-setting task employed may 
seem overly simple, one might well be sur- 
prised to find that the disturbed children do 
not differ from the normal children. In an 
earlier study, using different groups of dis- 
turbed and normal children, Davids and 
White (1958) found that under neutral con- 
ditions disturbed children set goals for them- 
selves that were at least as high as those es- 
tablished by normal children. However, if 
they met failure or criticism, their aspiration 
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levels suffered to a much greater extent than 
did those evidenced by normal children. 

An interesting finding, in this regard, is the 
significant association between the level of 
aspiration measure and IQ scores for the 
disturbed children, with absolutely no rela- 
tionship between these measures in the normal 
group. Other studies (Davids, 1958) have 
shown that emotional disturbance often has 
a pronounced detrimental effect on intellec- 
tual functioning in children. It seems that 
many of the abilities called for on conven- 
tional intelligence tests are similar to those 
we have attempted to study in the present 
research. Within the abnormal group, those 
children who are highly motivated to turn the 
blocks in keeping with the examiner’s instruc- 
tions to work rapidly, who desire to do even 
better on a second performance as evidenced 
by their stated aspiration, and who have more 
accurate time orientation, are the ones who 
also tend to obtain higher scores on the 1Q 
test. With the normal children, however, there 
is noticeably less association between these 
measures of ego functions and IQ scores. 

The failure of regularity in development of 
time orientation is also demonstrated by the 
present study. Within the normal sample 
there are marked differences between the time- 
estimation accuracy of younger and older chil- 
dren, but the differences between younger and 
older emotionally disturbed children are 
much less striking and are nonsignificant. Ac- 
cording to psychoanalytic theory, a prime 
function of the ego is to integrate events from 
the past, present, and future of a person’s life 
space. The normal individual must be able to 
inhibit impulsivity and forsake immediate 
pleasures in order to attain future gratifica- 
tions that accompany the fulfillment of long- 
range goals. Since the disturbed children arè 
believed to possess weak ego structure, they 
should show impaired temporal orientation in 
comparison with their age mates who possess 
more adequate ego structures. This investiga- 
tion provides empirical evidence that this is, 
in fact, the case. 

The findings in regard to delay of gratifi- 
cation are also in keeping with theoretic 
expectations, with a greater proportion 0 
normal children, at each of the development 
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levels incorporated in the present experimental 
design, evidencing the ability to delay grati- 
fication. Most theories of child development 
propose that the ability to delay gratification 
has its origin in the early feeding situation 
between mother and child. The child learns to 
wait for his rewards and comes to learn that 
food (and love) will be forthcoming. This re- 
quires the development of a sense of basic 
trust in one’s environment (Erikson, 1950). 
In the microcosm of the present study we see 
that emotionally disturbed children, whose 
case histories attest to early infantile feeding 
difficulties and considerable hostility and in- 
security in the mother-child relationship 
(Davids & Lawton, 1961), are much more dis- 
posed to seek immediate gratification and are 
less oriented toward more distant goals. 

The problem for the future is how to pre- 
vent these ego disturbances from developing 
and, once developed, how to effect changes in 
a personally and socially enhancing direction. 
Studies of ego functions in adults in mental 
hospitals, prisons, and other institutional set- 
tings will undoubtedly shed further light on 
the differences between disturbed and normal 
individuals. However, such studies can pro- 
vide only retrospective evidence for patho- 
logical antecedents and, unfortunately, reveal 
little in the way of effecting basic changes. It 
is hoped that further studies which improve 
upon and extend the kind of information 
gathered from the present experimental ap- 
Proach will lead not only to increased under- 
standing of ego functions in children but also 
to improved methods for early modification of 
inadequate ego development before the dam- 
age becomes irreversible. 
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PRELIMINARY SOCIALIZATION FOR PSYCHOTHERAPY 
OF BEHAVIOR-DISORDERED ADOLESCENTS 


ROBERT L. SCHWITZGEBEL 1 


University of California, Los Angeles 


A common problem in treating adolescent delinquents is their failure to par- 
ticipate in traditional psychotherapeutic procedures. Small cash bonuses given 
contingent on participation in tape-recorded nondirective interviews were 
shown to be effective in shaping dependable and prompt attendance. Using a 
variable schedule of reinforcement, extrinsic rewards could be gradually reduced 
without corresponding decrement in attendance behavior. The rationale for 


using such a procedure is emphasized. 


A patient cannot be treated in absentia. A 
minimum expectation in traditional psycho- 
therapy is that the patient will come to a 
specified location at a predetermined time on 
a fairly regular basis for a person-to-person 
verbal exchange. And generally it is assumed 
that voluntary participation is highly desir- 
able, if not essential, for success of the treat- 
ment (cf, Hollingshead & Redlich, 1958). 

Unfortunately, a substantial number of 
persons whose behavior often represents some 
social threat (e.g., psychotics, psychopaths, 
adolescent delinquents) may be among the 
Teast likely to volunteer for therapy. In a 
follow-up study of 54 adolescents with behav- 
ior disorders referred to a psychiatric clinic, 
Hammar and Holterman (1965) found that 
21 would-be patients failed to appear, and 11 
More discontinued after the initial contact. 
Rosen, Bahn, Shellow, and Bower (1965) re- 
Ported that of an estimated 194,000 adoles- 
cents treated in outpatient clinics in the 
United States in 1962, approximately one- 

withdrew from treatment. Similar lack 
of adequate voluntary attendance is reflected 
ìn employment data cited by Gordon (1965). 
Of 234,000 underprivileged youths invited by 
letter to visit an employment counselor, only 
42,000 appeared for interviews. Of these, 
fewer than 13,000 were referred to jobs, and 
less than 7,000 were hired—some for only a 
few days. Similar to orthodox psychothera- 
Pists, the typical employer expects or demands 
a high rate of goal behavior (e.g., “After all, 
* Requests for reprints should be sent to Robert 


Schwitzgebel, Department of Psychology, University 


pg tornia, Los Angeles, Los Angeles, California 
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I’m running a business.”). Failure of the 
youth to meet this criterion usually results in 
threats and punishment which may antagonize 
the youth and cause him to devaluate himself 
further as a potential employee. 

The statistics which have been cited simply 
confirm a persistent and widely acknowledged 
difficulty confronting virtually every practi- 
tioner who deals with this population. Anna 
Freud (1958), for example, wrote that ado- 
lescents are very difficult to get into treatment 
because they do not cooperate, they miss ap- 
pointments, they are unpunctual, they cannot 
or will not introspect, and their rapidly chang- 
ing emotional patterns leave little energy 
available to invest in the analyst. Stieper and 
Wiener (1965) have claimed that developing 
motivation in behavior-disordered patients 
“is probably the single most crucial problem 
with which therapy must grapple [p. 123].” 
The purpose of this paper is to summarize 
briefly a strategy for dealing with adolescent 
delinquents and certain other groups of pro- 
spective patients who are often minimally 
motivated to participate in psychotherapy. 


RATIONALE 


The reasons usually given for poor partici- 
pation involve the adolescent’s desire for in- 
dependence, revolt against authority, a nar- 
cissistic belief in his ability to solve his own 
problems, reluctance to talk about hostile or 
sexual thoughts, etc. (e.g., Pelpz, 1957). In 
Aichhorn’s classic work, Wayward Youth 
(1955), he argued that one of the chief diffi- 
culties in treatment was the fact that a de- 
linquent “does not suffer discomfort from his 
symptoms [p. 27]” in the way which a neu- 
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rotic does. Alternately, one might hypothesize 
that traditional psychotherapeutic procedures 
simply appear more threatening to some po- 
tential patients than others. In this respect, it 
should be noted that many of the character- 
istics attributed here to delinquents which 
tend to make them unsuitable for traditional 
psychotherapy may be class-linked rather than 
related to age or pathology. That is, these 
predispositions characterize a large number of 
persons, especially in the lower socioeconomic 
class, who are neither adolescent nor delin- 
quent (cf. Riessman, Cohen, & Pearl, 1964). 
Therapy procedures which seemingly re- 
„quire adolescent delinquents (particularly 
males) to acknowledge feelings of weakness, 
ignorance, helplessness, or passivity may be 
extremely aversive. From the adolescent’s 
point of view, accepting help may imply that 
he is weak or sick or “queer.” He is likely to 
believe that psychotherapists are a strange 
kind of doctors who do mysterious things to 
crazy people. The delinquent knows that he is 
not like the crazy people he has seen in tele- 
vision movies. To the contrary, he may pride 
himself on being active, strong, and clever. 
This image is especially valued in his peer 
group subculture. In exchange for an adult’s 
promise of something better in the future, it 
seems to the delinquent that he must sacrifice, 
in advance, his social identity and his place 
on the street corner. These may not be much, 
but they are all he has. 

Should the delinquent happen to volunteer 
or be forced into an initial interview, the ex- 
perience may be embarrassing and punitive 
because traditional mental health procedures 
are not designed to treat patients with his 
characteristic reactions. For example, he may, 
having had no other experience, relate to the 
psychotherapist as he has to (other) medical 
doctors. He waits for questions, gives brief 
answers (not always accurate), and waits for 
the next question or for some specific advice. 
The therapist, on the other hand, who is often 
a trainee or resident in the clinic, may rigidly 
wait for the patient to verbally elaborate on 
feelings. The ensuing silences and awkward 
exchanges are likely to confirm the patient’s 
doubts regarding the possibility of getting 
“real help,” and the therapist may become 
more convinced that psychotherapy is appro- 


priately restricted to individuals who exhil 
what Schofield (1964, p. 133) has referred 
as the “Yavis” syndrome—persons who af 
youthful, attractive, verbal, intelligent, a 
successful. 

Unfortunately, the patient’s willingness { 
become involved has often been attributed 


which seem to lie outside the province 
treatment. Attendance at psychotherapeu| 
interviews is not, however, an intrinsic a 
tribute of prospective patients. It is the result 
of innumerable variables such as physical se 
ting and the nature of verbal and nonverb 
communications as well as the patient’s pr 
dispositions. Cooperation is more practicab 
viewed not as “something” which the pati 
brings to therapy but as a product of therap 

Since cooperative behavior among deli 
quents is likely to be minimal and of shë 
duration, the therapist may have only one! 
two opportunities to strengthen the desi 
responses. Therefore some relatively simp 
direct, but flexible strategies are mandatory 
Again, Aichhorn (1955) appropriately notet 


I consider this first moment of our coming togeth 
of the utmost importance. It is more than a “feelin 
out” of the situation; it must have the appearane 
of certainty and sureness and must be put throug 
quickly as possible because in most cases it forms 
foundation for our later relationship. The adolescell 
does the same thing when he comes in contact 
me. He wants to know right away what kini 
person he is dealing with [p. 99]. ; 


Passivity on the part of the therapist may bë 
misinterpreted as weakness, lack of con 
or hostile withholding of information. On 
other hand, gestures of friendship and att en- 
tion such as a sincere compliment abot 
clothes or appearance may actually pi 
threatening. This is particularly true in 
case of adolescent delinquents whose personal 
histories confirm that attention from elders i 
usually paired with punishment. The occasi 

on which parents, teachers, or public offici 
have given attention to our prospective 
tient without some form of punishment n 
be indeed rare. We can assume that fear 
counter-hostility will generalize to the therapi 
situation unless the therapist’s actions 4 
sufficiently unique to force the delinquent # 


n 


make new discriminations. Thus, one of th 
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primary tasks of the initial interview is to 
break up stereotyped response patterns with 
minimal aversiveness to the patient. 


STRATEGIES 


Despite theoretical differences, a survey of 
the techniques actually employed during the 
initial phases of treatment of behavior-dis- 
ordered adolescents by a number of well- 
known therapists shows remarkable conform- 
ity. Slavson (1956) noted, for example, that 
in group work, “elements of surprise and nov- 
elty were constantly being introduced into the 
group therapy situation in the form of varja- 
tions in activity and unexpected refreshments 
[p. 20].” Aichhorn (1955) reported that 


I usually begin with a friendly look or attitude, 
sometimes I say, ‘How do you do?’ or I may only 
shake hands in silence. I say that there is nothing 
here to be afraid of, that this is neither a police 
station nor a court. Sometimes I tell a joke by way 
of introduction [p. 99]. 


Schmideberg (1949), who will “make a point 
of shaking hands with some of these patients 
as often as possible” and accept an invitation 
to tea, has claimed that her “‘spontaneous and 
unprofessional manner helps to break down 
their depersonalization which is an important 
defense mechanism of the ‘callous’ criminal 
[p. 186].” 

Neill (1962) has provided an example of 
what might be considered positive reinforce- 
Ment of attendance behavior at his psycho- 
analytically oriented interviews known as 
private lessons”: 


“With, young children, the technique was more spon- 
taneous, I followed the child’s lead. Here is a typical 
first P.L, with a six-year-old girl named Margaret. 
She comes into my room and says, ‘I want a P.L? 
Righto,’ I say. 
She sits down in an easy chair. 
‘What is a P.L.? she asks, 
t isn’t anything to eat,’ I say, ‘but somewhere in 
his pocket I have a caramel. Ah, here it is.’ And I 
give her the sweet, 
ay do you want a PL? I ask. 
ae has one, and I want one too.’ 
eee: You begin it. What do you want to talk 
out?’ [p. 42]” 


aee (1949) wrote, “It is a general rule 
a ead of the delinquent never to act the 
y he expects, but unceasingly to introduce 

ew and unforeseen elements in order to keep 


alive his interest in the therapeutic situation 
[p. 19].” Specifically, he recommends the use 
of money, although he cautions that the situ- 
ation “is wrought with danger and therefore 
requires special skill and tact.” 

Gradually, as certain commonalities emerge, 
successful treatment should become less a 
matter of “special skill” and more a matter of 
definable technique. As a rather simple demon- 
stration of a way in which dependable attend- 
ance at psychotherapeutic sessions could be 
obtained, a program of instrumental condi- 
tioning was instituted among adolescent de- 
linquents by the present writer. No referral 
procedure was involved, Prospective Ss were 
contacted randomly in poolhalls, amusement 
centers, etc. Inasmuch as details of the ex- 
perimental results have been reported else- 
where (Schwitzgebel, 1967), this report will 
emphasize initial socialization procedures 
which have, over a period of years, become 
sufficiently systematized and successful to 
eliminate the problem of obtaining an ade- 
quate number of nonincarcerated delinquents 
for research and clinical training purposes. 


METHOD AND RESULTS 


Specification of Goal Behavior 


The terminal or goal behavior in the present 
demonstration was dependable and prompt arrival 
at a clinic office for tape-recorded interviews. 


Estimation of Goal Behavior Base Rate 


The Ss were selected from a population with a 
low probability of dependable attendance at a clinic, 
Twenty-one male adolescent offenders, ranging in 
age from 12 to 21 years and averaging 15 months of 
probation, were contacted directly on street corners 
or in amusement centers by Es and asked to par- 
ticipate in tape-recorded interviews. It was explained 
that S’s task would be to express his opinion and 
feelings about parents, teachers, policemen, schools, 
reformatories, etc.; they would be paid $1.00 an 
hour, but one of the “qualifications” for the job 
was a court record. (Records were verified later.) 

Eight members of the experimental group had his- 
tories of active refusal to participate in psychiatric 
treatment, and seven Ss had been incarcerated an 
average of 18 months. The base rate for attendance 
among Ss for the first four interviews showed a 
group mean of 38 minutes late. (Failure to arrive for 
the scheduled appointment was scored as 60 minutes; 
nine out of 21 Ss failed to appear for the first inter- 
view, and approximately 50% of those who did 
arrive were late.) Most Ss who missed an interview 
were able to be contacted a few days later at the 
original meeting place. 
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Selection of Reinforcing Consequences 


Eissler (1949) has claimed that “usually no suc- 
cessful therapy of a delinquent is possible without his 
obtaining money from his analyst during that in- 
troductory phase of treatment when all possibilities 
must be mobilized to make the analyst a valuable 
person in the delinquent’s eyes [p. 20].” Money is 
not only a readily-exchanged generalized reinforcer 
for most delinquents, but it frequently serves to 
“protect” the prospective patient from criticism by 
the peer group. Almost any activity is approved in 
the delinquent subculture if the participant can 
“make a fast buck”—this includes visits to a psycho- 
therapist. Other tangible “face-saving” consequences 
or valued activities will sometimes suffice. If money 
is used in the initial phase, it can be gradually re- 
placed by more personalized and idiosyncratic rein- 
forcers, These usually prove to be more effective and 
are well-received by the patient, 

Premack (1963, 1965) has suggested that any ac- 
tivity which has a higher probability of occurring can 
be used as a reinforcer for a response of lesser prob- 
ability. Thus, if at a given time, the independent 
response rates in a “free choice” situation for an 
adolescent would show the following rank order: 
Watching a movie (M) > playing pool (P) > eating 
candy (C) >reading a comic book (B), then P 
could reinforce C and B but not M. If true, this 
principle suggests that literally hundreds of poten- 
tial reinforcers are available for use by the therapist. 
An obvious difficulty arises, however, in estimating 
the respective probabilities of various activities un- 
less one has knowledge of previous deprivations. In 
work with retarded children, Addison and Homme 
(1966) have used what they term a “behavior menu.” 
The “menu” consists of a booklet of pictures and 
descriptions of brief activities (eg. a child being 
pushed down the hallway on a chair by E) which 
the child may choose when he becomes eligible for 
reinforcement. Another simple way of determining 
what tangible consequences might serve as reinforc- 
ers is to take a leisurely walk through a department 
store with, the patient and to note various items 
which receive favorable comment. 

In addition to administrative, ethical, and finan- 
cial considerations in selecting reinforcers, it is gen- 
erally held that the reinforcing activity should not 
be directly incompatible with the goal behavior. For 
example, dismissing a child from school as a rein- 
forcing consequence for regular attendance at school, 
even though such might have a relatively high prob- 
ability at a given moment, is not considered advis- 
able. Dismissal from school may be reinforcing (not 
punishing) for truants; detention is also likely to be 
ineffective since it is a direct pairing of a noxious 
event with presence at school, Counter-conditioning is 
probably the procedure of choice? 


2An unpublished study by Schwitzgebel, Faltico, 
and Feeney (1967) selected 10 junior high school stu- 
dents from an underprivileged urban area who had 
never attended all 5 days of school in a given week 
for a period of 2 years. Home visits and small gifts 
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Arranging Contingencies 


In our demonstration project, the basic wage of 
$1.00 an hour was supplemented by positive conse- 
quences contingent on dependable and prompt at- 
tendance. The first problem, of course, is just to get 
the delinquent into the office under what he views 
as favorable conditions. The following conversation 
is a sample street-corner contact. In this particular 
instance, two Es saw five prospective Ss standing in 
a row in front of a dime store. It was mid-winter, 
The boys were huddled in their jackets; there was no 
conversation. 


E: (approaching Ss) We’re looking for some real 
people. 

S1: That’s us! 

52: See, look we move! (Waves arms; all laugh.) 
E: Hey, you really are! (Exaggeratedly imitates 
arm waving, more laughing.) 

We're from college and trying to find out what 
kids think about cops and teachers and school and 
stuff like that. Instead of reading a bunch of books, 
we decided just to go out and ask the kids them- 
selves. We usually pay $1 an hour or a little more 
for kids to talk into a tape recorder. . . . We'll 
pay you a dollar if you want to come with us to 
some restaurant around here; you can get some 
food too. You don’t have to say anything you 
don’t want to—no names or places or stuff like 
that. We're straight, and we're not cops. 

53: There’s a pizza place down there (pointing). 
S2: You mean we get a buck and mooch some 
free food? 

E: Ya. 

S2: (Offering E some peanuts from a small 
crumpled bag) Want some of these? 

E: No thanks. 

S4: I can tell you that the cops in (names a town) 
are a lot worse than those in (names another near- 
by town). 5 

E: We want kids who know what they’re talking 
about—not just a bunch of bull. You've got to 
have a record or know what you’re talking about. 
52: Ha! I'm eligible all right! (Laughs) Here’s my 
probation card. (Pulls out a partly torn probation 
card from his pocket and hands it to the EZ. E 
reads some of the rules aloud, such as “not keeping 
bad company”; group laughs.) 

E: Anybody else got a record to qualify for the 


contingent on gradual improvement of attendance 
showed significant results over a 6-week period. Due 
to the ending of the school year, no data were ob- 
tained regarding the maintenance of such behavior 
following termination of reinforcement. It might be 
noted, however, that in contrast to school attendance 
officers, who typically apply aversive sanctions fol- 
lowing undesired behavior, the experimental pro- 
cedure produced at least temporary gain. (One parent 
of Spanish-American descent commented to a gradu- 
ate assistant that he was the first white person wh? 
had come to their house not to collect the rent oF 
“tell us we done something wrong.”) 
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restaurant? (Several boys point to a member of 
the group who reluctantly admits he does. Soon 
the Es and two boys go off to the pizza parlor. 
The E apologizes to the remaining three boys with- 
out records for not being able to take them along. 
Once the group gets to the restaurant, the boys ask 
if it would be all right just to order a coke and 
receive cash in place of food. The E agrees, Ap- 
proximately 20 minutes is spent in the restaurant 
talking in a general way about police. The boys 
are then taken to the project office, play briefly 
with the tape recorder, are given a small cash 
bonus, and an appointment for another meeting is 
scheduled for the same time the following day.) 


Obviously the delinquents are suspicious. They 
typically speculate that E is a gangster, policeman, 
homosexual, escaped mental patient, or—as one of 
them put it later—“just some new kind of nut.” The 
task of Æ is not to make guesses regarding the cog- 
nitions of the prospective patients, but to establish 
environmental conditions in which desired behavior 
will be prompted and reinforced. This is not ex- 
cessively difficult. Delinquents often stand in col- 
lective boredom waiting for something to happen. 
The offer of a little free food, adventure, and money 
under a situation in which Ss outnumber Es in a safe 
public place has resulted in acceptance by an esti- 
mated 70% of eligible adolescents. It might be noted 
that food is used in this situation not only as a 
generalized reinforcer contingent on cooperative be- 
havior but also as an unconditioned stimulus incom- 
patible with anxiety (cf. desensitization procedures of 
Wolpe & Lazarus, 1966). 

One group of nine Ss was then selected to receive 
teinforcers contingent on attendance, while a matched 
group of Ss served as controls by receiving bonuses 
contingent on an unrelated behavior. The Ss did not 
know when they would become eligible for a rein- 
forcement or the nature of the reinforcement (i.e., a 
variable interval-variable ratio schedule). 

In difficult cases when the patient does not have 
the Necessary repertoire of social skills, it may be 
Necessary to use a shaping procedure. In another 
Study (Schwitzgebel, 1964), two boys were met at 
successively closer geographical locations to the office. 
The E arranged to meet the boys on the first two 
Occasions at a downtown amusement center, the next 
Meeting outside a subway station near the amuse- 
ate center, then inside the subway station after the 
a gate, eventually outside the subway station near 
the office, and finally at the office. Arrival at the des- 
pease location was casually rewarded with a cig- 
T te or candy bar or perhaps money (in the form 
5 a token put into the toll gate for S). The shaping 
equired approximately six to eight meetings over a 
Period of 2 weeks. 


Recording Dependent Variable(s) 


fee demonstration project groups improved in 
Jh aes and dependability of attendance over the 
i Sessions; however, the contingent-reinforce- 

group had a discrepancy of 11.2 minutes be- 


tween appointment time and arrival time, while the 
noncontingent-reinforcement group showed a 15.6 
minute discrepancy (Fisher test, p< .025). Since the 
noncontingent-reinforcement group received bonuses 
for other socially desirable behavior (e.g., putting 
coke bottles away at the end of the interview), gen- 
eralization effects possibly obscured differences which 
might have occurred if more orthodox procedures had 
been used. Most Ss came to look upon the inter- 
views as an interesting and challenging game; an- 
ticipatory approach behaviors became stabilized at a 
fairly high rate. Skinner (1953) and others have 
pointed out that gambling establishments use varia- 
ble schedules of reinforcement. On one occasion, we 
noticed an S$ standing outside the office door, waiting 
for the chimes of a nearby church, to ring, so that 
he could enter just at the appointed moment. Using 
the VI-VR schedule, nonverbal consequences could 
be faded out between the fifteenth and twenty-fifth 
session without noticeable decrement in promptness 
of attendance. 


Discussion 


The strategy outlined here for introducing 
delinquents to psychotherapy is by no means 
entirely novel. Indeed, the implicit use of re- 
inforcement techniques by therapists of widely, 
divergent orientations has important implica- 
tions since it suggests that an empirically 
based technology may begin to be established. 
In this way, the potential replicability of 
clinical procedures should be improved, and, 
at the same time, alternative methods should 
become more obvious. For example, Redl and 
Wineman (1952), in a chapter entitled “Tech- 
niques for the antiseptic manipulation of sur- 
face behavior,” note the ineffectiveness of 
making promises to behavior-disordered chil- 
dren and recommend that a group leader give 
“gratification grants without strings attached 
[p. 229].” They assert further that to tie 
these gratification grants to a condition would 
be “like promising a child cough medicine 
provided he goes without coughing for a day.” 
That is, they recommend the use of noncon- 
tingent consequences since they believe, ap- 
parently, that it is impossible to set appropri- 
ate criteria for future performance. What is 
not explicitly considered by Redl and Wine- 
man, however, is making such grants con- 
tingent on modest past improvements of be- 
havior (i.e., shaping). 

Occasionally, clinicians will argue that “pay- 
ing delinquents” is a form of bribery or a dis- 
honest maneuver. Again, some ambiguity ap- 
pears to be involved regarding very basic de- 
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tails of the reinforcement procedure. “Brib- 
ery” is usually defined as a promise of pay- 
ment for a future performance. In instru- 
mental conditioning, consequences follow de- 
sired behavior. On the variable schedule used 
in the study reported here, Ss were explicitly 
told that while they could depend on the base 
wage, bonuses were entirely the decision of E. 
Furthermore, the amount and frequency of re- 
inforcers decreased over time in contrast to 
bribery situations where the victim usually 
demands increased payoffs as he becomes 
aware of the exploitation. Technically speak- 
ing, a promise to deliver a specified payoff 
following a specified performance is consid- 
ered a “prompt.” These may be useful in ini- 
tiating behavior which is then subsequently 
reinforced by an operant procedure. Kubie 
(1958), apparently referring to the technique 
of prompting, has argued that a therapist may 
be required occasionally to use a “symbolic 
bribe.” 

. We might also consider the frequently 
voiced objection to reinforcement procedures 
on the grounds that patients will become ex- 
cessively dependent. This possibility exists, 
but its actual occurrence would likely be a 
result of poor therapeutic management. As 
previously mentioned, in our study, nonverbal 
consequences for attendance behavior were 
faded out between the fifteenth and twenty- 
fifth session, Extinction can be minimized, of 
course, by using variable schedules and gradu- 
ally increasing the interval and ratio com- 
ponents, 

In addition, other aspects of the therapy 
situation may take on secondary reinforcing 
properties, including the therapist himself. 
Therefore extrinsic consequences are not re- 
quired, This situation may be particularly 
common in the case of therapists whom ado- 
lescents find “exciting” or “stimulating,” 
Physiologically, one might attempt to measure 
such attributes by the occurrence in the pa- 
tient of the “orienting reflex” (peripheral 
vasoconstriction, cephalic dilation, galvanic 
skin response, alpha desynchronization; see 
Sokolov, 1963) which seems to be both a 
prerequisite for learning and an intrinsically 
reinforcing event. The charismatic and color- 
ful personalities of well-known therapists who 
work with behavior-disordered patients (cf. 
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Federn, 1962; Josselyn, 1957; Levenson, 
1961) might be analyzed in terms of their 
ability to elicit orienting responses from pa- 
tients who typically have a high threshold for 
arousal, One might even speculate further that 
the percentage of orienting and defensive re- 
flexes during an initial interview with a given 
therapist could serve as a prognostic index. 

This writer believes that a minimal tech- 
nology now exists for introducing adolescent 
delinquents to psychotherapy or other “work” 
situations. Except for youths who persist- 
ently use narcotics, shaping dependable and 
prompt attendance of prospective patients has 
become for our experienced staff almost a rou- 
tine matter, But just as any continuing social 
relationship tends to move from initial stereo- 
typed patterns to those which are more subtle 
and unique, the preliminary socialization tech- 
nique suggested here neither encompasses nor 
excludes the complexities and excitement of 
therapeutic interaction. It, at best, increases 
the probability that a continuing dialogue will 
occur. 
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MEASUREMENT OF SOCIAL CAUTION: 
SELF-APPRAISAL, ROLE PLAYING, AND DISCUSSION BEHAVIOR 


JAY S. EFRAN anp PAUL R. KORN? 


University 
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3 experiments indicate some characteristics of socially cautious individuals, 
selected on the basis of a modified leaderless group discussion. As predicted 
from social learning theory, the cautious participants reported lower expecta- 
tions for social success, were more reluctant to engage in other social-verbal 
tasks, were uncomfortable and inhibited at role playing, and indicated lack of 
confidence on an inventory of social behavior. Their expectations for other 
kinds of tasks were not different and they did not perform differently on a 
concept attainment task or in a conformity situation. Their responses suggested 
that the “caution” measured in the discussion procedure relates to peer rather 


than authority approval. 


Some measure of the number of individuals 
who experience “shyness” as a major personal 
problem can be found in the sales records of 
books like How to Win Friends and Influence 
People, which promise relief from social dis- 
comfort. The distress such individuals experi- 
ence can be acute; many learn to avoid social 
interaction whenever possible and find them- 
selves trembling or “tongue-tied” when forced 
to participate. Because of the surplus mean- 
ings of terms that have been used to describe 
this pattern of behavior (e.g., introversion), 
the more neutral term “social caution” will be 
used in this analysis. 

Rotter’s (1954) social learning theory sug- 
gests that a person’s reluctance to participate 
in social situations can be understood in 
terms of (a) the importance he attaches to 
approval, and (b) his expectations that given 
behaviors will lead to it. An individual who 
needs approval but lacks confidence that 
his behavior will elicit it may adopt an 
“avoidant” strategy; that is, faced with un- 
certainty about outcome, with stakes too high 
to risk experimentation, he hesitates, fails 
to respond, or withdraws. Since avoidant be- 
havior is more probable when a situation 
involves negative reinforcement (e.g., disap- 


1 Now at the University of Connecticut. The au- 
thors wish to thank Edwin H. Lieberman for aid in 
the analysis of data and preparation of the manu- 
script. In addition to the authors, a number of indi- 
viduals served as Hs: Paul Stein and Robert Lewy 
in Experiment I, Harriet M. Braunstein in Experi- 
ment II, and R. Paul McNeal in Experiment III. 
Carl Spoto helped schedule S appointments. 


proval), it may be more precise to describe 
the “shy” individual as having strong ap- 
proval needs and reciprocally strong fears of 
disapproval. (His expectations for the former 
would be relatively low, and his expectations 
for the latter, relatively high). The distinc- 
tion between positive and negative reinforce- 
ment is of importance because an individual 


<—— a E, 


who desires approval but who only minimally ‘ 


fears disapproval may display alternatives to 
avoidant behavior, such as vociferous atten- 
tion-getting behavior. While this seems like 
an infrequent pattern, it does occur and one 
can envision a background of learning experi- 
ences to account for its development. 

When the socially cautious person does pat- 
ticipate, the content of his contributions is 
generally “safe.” He waits until he has learned 
what kind of comment his “audience” will 
appreciate, or restricts his remarks to the 
patently nonoffensive. Ironically, either of 
these styles may, in itself, irritate listeners 
and bring about the disapproval the person 
was trying so hard to avoid. 

This study investigates the generality of 
socially cautious behavior as measured by 
performance in a group discussion situation. 
Evidence is provided about the expectations 
and approval needs of cautious individuals. 

The behavior of discussion participants Was 
observed during a role-playing task, a con- 
cept attainment task, a conformity task, and 
a party-stunt choice task, In addition, 5s’ 
self-evaluations were obtained on an “expect: 
ancy” questionnaire and the Minnesota In 
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ventory of Social Behavior (Williamson & 
Darley, 1937). It was hypothesized that 
cautious discussion participants would be less 
willing to engage in other kinds of social- 
evaluative performance, and would report 
lower expectations for success in social situa- 
tions. In addition, they were expected to show 
signs of discomfort during role playing, to 
perform more poorly on the concept attain- 
ment task (especially with an “audience” 
present), and to agree more often with a 


confederate in the conformity situation. 


EXPERIMENT I? 
Subjects 


Ninety-nine male college students, enrolled in an 
introductory psychology course, served as Ss. They 
participated in discussion groups of four to seven 
members. Ninety-three Ss returned individually for 
a second session, 


Procedure 


The Ss were seated around a table and given copies 
of a list of 16 tasks. They were told that they would 
each have to do one task in front of the group, and 
were led to believe that, within the limits set by 
the experimental design, task assignments would be 
based on their personal preferences. The five state- 
ments on the preference scale ranged from “1— 
These tasks sound like they might be fun or inter- 
esting” to “S—These tasks are objectionable enough 
so that I would decline doing them.” The tasks were 
adaptations of party stunts, tests, and other common 
experiences involving various degrees of physical ef- 
fort, social-evaluative performance, and unpleasant 
sensations, Sample items are: “Swallow one teaspoon 
of soapy water.” “Talk to the group for approxi- 
mately five minutes about your goals in life.” “Blow 
up a balloon until it bursts.” “Spend about five 
minutes telling the group a dramatic story about a 
college student.” 8 

After Ss indicated their task preferences, one of the 
two Es collected the lists and left the room to “assign 
tasks,” The Ss were then told to spend 25 minutes 
trying to get to know each other.” This precipitated 


* Some of the data of this experiment were Te- 
Ported in a paper read at the Eastern Psychological 
Association Convention, New York, April 1967. 

p Copies of the instructions and forms used for 
TN experiments can be obtained without charge 
X m Jay S. Efran, University of Rochester, Roches- 
ie New York, 14627, or for a fee from the National 
uxiliary Publications Service. Order NAPS Docu- 
a 00122 from ASIS National Auxiliary Publica- 
ERR Service, c/o CCM Information Sciences, Inc., 

West 34th Street, New York, New York 10001; 


remitting $1.00 for microfiche or $3.00 for photo- 
copies, 


a leaderless group discussion, during which the second 
E served as an observer. When the time was up, 
group members evaluated each other’s discussion 
performance on a sociometric nomination form. 
They had an opportunity to nominate individuals 
as having “participated actively in the discussion,” 
having “participated comparatively little,” and 
having “seemed rather shy.” They could nominate as 
few or as many individuals in each category as they 
desired. Individuals identified each other by number 
cards fastened to the table in front of each place. 
The E, using a similar form, also evaluated the 
group members. 

When the sociometric forms were collected, the 
first E returned and assigned tasks which Ss then 
performed. Task performance was not evaluated; 
the purpose of this part of the procedure was to 
insure that later Ss did not learn, via the grape- 
vine, that task preferences were inconsequential. 
The Ss were always assigned preferred tasks. Before 
leaving, they were cautioned to maintain security 
about the procedures, and were invited to volunteer, 
on an individual basis, for “another experiment” on 
“concept formation.” Almost all Ss did volunteer 
and individual appointments were scheduled. 

When each S returned for his individual appoint- 
ment, a concept attainment task (CAT) developed 
by Bruner, Goodnow, and Austin (1956) and modi- 
fied by Weick (1964) was administered. A description 
of this task by Marcia (1966, p- 554), states that it 


requires the subject to arrive at a certain combina- 
tion of attributes of cards. The subject may 
eliminate certain attributes by asking whether a 
card is positive or negative for the concept and 
he may guess the concept at any time. He is 
penalized 5 points for every request, 10 points for 
every guess, and 5 points for every 30 seconds 
that passes before he attains the concept. 


Half the Ss worked on this task in the presence of 
a confederate * in addition to E. The purpose of this 
condition was to arouse additional evaluative stress. 
The confederate played the part of an S who was 
“running overtime” and had some forms to complete. 
To “keep things on schedule” E suggested that S 
get started while the confederate finished his forms. 
This provided an excuse for the confederate, who 
presumably had just completed the CAT, to stay in 
the room and watch S attempt the task. Shortly after 
S began, the confederate pretended to finish his forms 
and he then watched S’s performance with consider- 
able interest. After the administration of the CAT. 
the confederate was dismissed. The Ss under both 
conditions next completed an expectancy question- 


4 Bruce G. Birnbaum, Kenneth H. Cowan, Bernarc 
Demka, Eric L. Glazer, Robert Hegan, David C 
Hewitt, William A. Hirsh, George McNaughton 
Stephen F. Mendel, William R. Peters, Richard L 
Reiter, David Ross, Joseph P. Sherer, Lee L. Suskin 
and Daniel F. White served as confederates fo 
Experiment I or Experiment HI. 
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naire (EQ) listing 44 skills and abilities and requiring 
Ss to indicate how they compared in these areas 
with college students their own age. For each item 
«they circled the percentage (0, 1, 4, 11, 23, 40, 60, 
77, 89, 96, or 99) of students toward whom they felt 
superior, The items dealt with activities such as 
“singing,” “getting along with teachers,” “wood- 
working,” “mathematics,” etc. 

For most Ss, scores on the Minnesota Inventory 
of Social Behavior (MISB)—a self-report measure of 
social anxiety—were available from an earlier 
(unrelated) group testing. The Ss who had been 
absent for that testing were asked to complete this 
form before leaving. 


Results 


Twenty-eight Ss were designated as active 
participants (AP) because they were nomi- 
nated as “active” by all group members and 
the observer, and because no one considered 
them “shy” or having “participated little.” 
Twenty Ss were classified as “socially cau- 
tious” (SC) because they received 20% or 
fewer nominations as “active” and 60% or 
more nominations in the “participated little” 
category. Twelve of these 19 Ss were viewed 
as minimal participants by all the raters. 
Most were considered “shy” by a large pro- 
portion of the group, and none received fewer 
than 40% nominations in this category. A 
conservative estimate of the reliability of the 
nomination form was obtained by comparing 
the nominations of the group members with 
those of the observer. The biserial correlations 
(N = 93, r = .71, .80, 56) for the three ques- 
tions on which the classification of Ss was 
based indicate satisfactory reliability. 

There was a significant difference between 
AP and SC Ss’ preferences for the two tasks 
(combined) which directly involved verbal, 
social performance; telling the group a story 
and talking to them about goals in life 
(t = 2.69, df = 46, p < 01). The groups did 
not differ in their preferences for tasks which 
mainly required physical or intellectual effort 
or involved general aversiveness. 

On an intuitive basis, six EQ items were 
expected to differentiate between AP and SC 
Ss. Differences significant at the .05 level or 
better were found for four of these items: 
making conversation (t = 3.58), getting along 
easily with most people (f = 2.60), leading 
groups (t= 2.57), social skills, generally 
(t = 3.08). Near significant differences were 
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found for the other two: making male friends 
(t = 2.01), and making a good impression at 
social functions (¢ = 1.81). Two items in- 
volving social and/or verbal skills, but not 
as closely related to the discussion task as 
the above items, produced significant differ- 
ences: making female friends (¢ = 2.47) and 
directing plays (t = 2.12). In each case, SC 
Ss reported lower expectations for success 
than AP Ss. While the groups clearly dif- 
fered in expectations for social success with 
peers, items concerning parents and other au- 
thority adults did not produce differences, 
nor did items concerning a wide variety of 
skills and abilities not directly related to 
social and verbal competence. 

The scores of AP and SC Ss on the MISB, 
which has face validity for the measurement 
of lack of confidence in social situations, 
yielded a significant difference in the expected 
direction (£ = 2.50, df = 41, p < .05). 

CAT performance was unrelated to any of 
the other variables of the study and the 
presence or absence of the confederate had no 
discernible effect on adequacy of performance. 


EXPERIMENT II 
Subjects 


The AP and SC Ss of Experiment I were tele- 
phoned a semester later and invited to participate 
individually in this experiment. Because of scheduling 
difficulties, only 22 AP and 18 SC Ss participated. 


Procedure 
A female E read the following instructions: 


I am going to ask you to carry on several imagi- 
nary phone conversations using this prop tele- 
phone. As you can see, the phone isn’t connected. 
As you pretend to be talking to someone, I'll just 
tape-record your responses. As a kind of warm-up 
exercise, I’d like you to pretend to call the infor- 
mation operator to get a phone number for this 
person. [A card with the name “Bill Dudley” was 
shown to the subject.] Any questions? Now re- 
member, you are to engage in a conversation. 
Okay, you can begin now. 


The following additional role-play situations were 
then given to S: 

1. You have discovered a hole in a suit you just 
brought back from the cleaners—you are sure it 
wasn’t there when you took the suit in. Call the 
cleaners about it. 

2, Your friend’s grandmother died, Call to offer 
your condolences, 
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3, Call to make a blind date with a girl whose 
phone number was given to you by a friend. 

4, A friend of yours has divulged something to 
others which you told him in confidence. Call him 
about it. 

5. Today is the birthday of a friend who lives in 
California. Call him as a birthday surprise. 

6, You have made plans for a special date. Just 
before leaving to pick up your date, she calls and 
says that she can’t go because she has to stay home 
with her sick brother. You know that she does not 
have a brother! 

The Ss performed each situation once except for 
the conversation with the dry cleaner, which they 
were asked to do again as angrily as possible. After 
each role-play, Ss indicated how “comfortable” they 
had felt on a 7-point scale ranging from “completely 
at ease” to “very ill-at-ease.” At the same time, 
E rated S’s apparent anxiety level on a 5-point 
scale, Following the role-playing, Ss were asked to 
imitate the sounds of six animals: a cat, dog, cow, 
duck, horse, and lion, These imitations were also 
recorded. 


Results 


An independent rater, who did not know 
the identity of the Ss, rated the taped role 
playing and the imitations.’ A 4-point scale 
of “cautious” performance was used, ranging 
from “an inhibited, cautious, self-conscious 
attempt” to “an uninhibited, enthusiastic at- 
tempt.” In making these judgments, the rater 
tried to avoid evaluating role-playing skill 
per se—he evaluated the caution or freedom 
with which S approached the task. Obviously, 
skill and caution are not orthogonal dimen- 
sions, but the focus was on the latter. The 
reliability of ratings, assessed by having a 
second rater make the same judgments, was 
satisfactory for role playing (7 = .70), but 
low for the animal imitations (r = .41).° The 
Ss who were rated less inhibited in role 
playing reported greater subjective comfort 
(aS 38, p< .05) and were rated as less 
anxious by Æ (r = .53, p < .01). The Ss who 
seemed less inhibited in the animal imitation 
task reported greater comfort (7 = .46, P< 
01), but were not seen as significantly less 
anxious by E (r = .21, ns). The animal imita- 
tion and role-playing ratings correlated signifi- 
cantly with each other (7 = 34, P< 05), as 
did the comfort and anxiety ratings (7 = .47, 
P< 01). 


z Andrew Hoffman served as rater. 
y One S was dropped from this analysis because 
tape transcription difficulties. 


As predicted, the role playing of AP and 
SC Ss differed. Relevant biserial correlations 
and tests are presented in Table 1. Animal . 
imitation performance was also rated signifi- 
cantly different (t= 2.48, P< 05), but 
proved a less powerful discriminator. 

The interpretation of relationships between 
role playing and the self-report measures of 
Experiment I must be tempered by the fact 
that the S sample was not random, but had 
been preselected on the basis of discussion 
performance. However, it is of interest that 
role playing relates to MISB scores (r = .36, 
p < .05), to preference for social-verbal evalu- 
ative tasks (r = .49, p < .01), and to the 
sum of social-verbal EQ items (r= 559; 
p< 01) as well as the individual items. As 
before, no significant relationships were found 
with task and EQ items less directly social or 
verbal in nature. 


EXPERIMENT IIT 
Subjects 


A new group of 79 male Ss enrolled in an intro- 
ductory psychology course participated in discussion 
groups of four to seven members. The procedure 
for the selection session was the same as that used 
in Experiment I, except that slightly modified task 
preference and sociometric forms were used. On the 
basis of this procedure, 24 Ss were designated as 
AP and 18 were classified as SC. These Ss were 
then telephoned by a different E, and 16 AP and 
9 SC Ss agreed to participate in an experiment on 


TABLE 1 


Ratincs OF ROLE PLAYING FOR ACTIVE AND SOCIALLY 
Cautious Discussion PARTICIPANTS 


Rating M |SD| t Paa 
Anxiety observed by E 
Active participants | 29.00" | 4.38 h 
Cautious participants | 21.22 | 6.02 4.60%) .60 
Comfort reported by S 
Active participants 44.14» | 5.14 3.95*| .53# 
Cautious participants | 37.83 4.90 |>" 3 
Inhibition evaluated 
from tape recordings| 
Active participants 12.55° | 2.66 | 5 gox| 63+ 
Cautious participants | 7.94 3.01 |> : 
Note.—For active participants, N = 22; for cautious 
participants, N = 18. 
*p <.001. 


a A higher score indicates less anxiety. 
b A higher score indicates greater comfort. 
© A higher score indicates less inhibition. 
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“taste perception.” They were unaware of any con- 
nection between this request and their previous 
participation. 
Procedure 

The “taste perception” session used a conformity 
procedure developed by Rettig and Sinha (1966): 
A few drops from each of 16 bottles were placed 
on the tongues of an S and a confederate to test 
“taste thresholds.” Ss were told that each sample 
would be sweet, sour, salty, or bitter, but actually 
only four bottles contained flavoring (sugar, salt, 
lemon juice, and quinine sulphate). The rest con- 
tained water and food coloring. On some trials S$ 
responded first, and on others the confederate re- 
sponded first. The confederate’s responses were, of 
course, predetermined. After this taste test, the S 
and the confederate completed forms which asked 
about their confidence in making judgments, their 
feelings about the independence of their judg- 
ments, and their perception of the purpose of the 
experiment. 


Results 


As in Experiment I, the 24 AP and 18 SC 
Ss differed in their responses to the combined 
social-verbal evaluative items on the task 
preference form (¢ = 3.01, p< 01).7 Again, 
no other items were significantly related to 
the sociometric classification, 

It was predicted that SC Ss, compared with 
AP Ss, would be more influenced by the 
confederate’s responses. Neither the scores 
indicating conformity to the confederate’s 
responses nor those indicating taste accuracy 
reveal differences between the groups. On the 
posttest questionnaire there was a tendency 
for the SC Ss to report that the “other S’s” 
responses had a greater effect on them, but 
this finding did not reach significance with 
the small sample used (t = 1.94, df = 23, 
p< .10). 


Discussion 


Some characteristics of active and cautious 
discussion participants can now be described. 
In accord with social learning theory pre- 
dictions, cautious Participants acknowledge 
lower expectations for success in a variety 
of social and verbal skills. This does not re- 
flect a general questionnaire response bias 

7The revised task preference form used in this 
study contained three social-verbal evaluative items: 
telling the group a story, talking to them about 


goals in life, and portraying a master of ceremonies 
introducing a juggling act, 
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since their stated expectancies for other skills 
and abilities are the same as those of active 
participants. Their lack of social confidence 
is also seen in their response to the MISB, 
Moreover, the significant difference between 
the scores of active and cautious participants 
on this instrument serves as an indication 
of its validity. 

The generality of the behavior tapped by 
the sociometric classification used in these 
studies is evidenced by the replicated dif- 
ference in the willingness of active and cau- 
tious individuals to perform other social- 
verbal tasks and by the differences in their 
role playing. These latter differences are par- 
ticularly striking: Cautious discussants appar- 
ently look, feel, and sound more inhibited and 
anxious when role playing. Further, this rela- 
tionship between discussion behavior and role 
playing was found despite the fact that the 
tasks involved somewhat different skills, and 
were administered a semester apart in dif- 
ferent settings (one individually and one in 
a group) by Es of different sexes. 

The results just discussed indicate that the 
AP-SC classification measures a characteristic 
more general than discussion behavior in a 
male group. However, the experiments also 
provide data indicating some of the limits of 
this generality. The hypotheses that SC Ss 
would perform more poorly on the CAT, 
would be more sensitive to “audience” pres- 
sure on this task, and would go along with 
the confederate’s choices more often on the 
“taste test,” were all based on the assumption 
that a cautious style in the discussion situa- 
tion would have very general implications for 
performance on other kinds of tasks. None of 
these hypotheses was supported. The most 
patsimonious interpretation of this outcome 
is that the Ss we identified as “cautious” had 
in common only a lack of confidence concern- 
ing tasks construed as primarily social and/ot 
verbal. This is, of course, what the Ss told us 
on the EQ: they did not report lower expec- 
tations for success in intellectual, athletic, 
mechanical, or artistic endeavors. Therefore, 
the CAT task, which was presented as intel- 
lectual, and the “taste test,” which was prê- 
sented as a Psychophysical investigation, did 
not elicit “cautious” responses. Perhaps the 
difficulties investigators have had in finding 
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personality correlates of “conformity” behav- 
ior relates to the common practice of defining 
conformity tasks as some kind of “perceptual 
judgment” situation. 

An examination of the EQ items which do 
and do not differentiate between AP and SC 
Ss indicates that while the groups generally 
respond differently to items which describe 
social skills, they do not respond differently 
to items which focus on relationships with 
parents, teachers, employers, or other adults 
of authority. Thus it would seem that the 
kind of social caution measured in the dis- 
cussion situation relates primarily to the at- 
tainment of peer reinforcement rather than 
reinforcement from authority figures. These 
findings concerning generality suggest that the 
class of situation and the reinforcing agent 
must be specified to obtain reasonable pre- 
diction. Basing measurement techniques on 
broad concepts such as “self-esteem” or “self- 
concept,” where these factors are not specified, 
should continue to lead to low-level prediction. 

Theoretically, Ss classified as “cautious” in 
this series of experiments might have had 
low expectations for social success but at- 
tached little importance to obtaining social 
reinforcements. If this were so, the Ss selected 
would not be the type proposed in the intro- 
duction, where the “shy” person was de- 
scribed as attaching importance to the receipt 
of approval. However, Experiment II pro- 
vides evidence on this point. When forced to 
tole play (by the demands of the experi- 
mental situation), SC Ss stated that they 
experienced discomfort and Æ reported con- 
gtuent symptoms of the Ss’ distress (eg., 
sweating, stammering, tremor, etc.). Together 


with our other measurements, these manifes- 
tations of evaluative concern clearly indicate 
that SC Ss were not indifferent to the social 
reinforcements of the situation, and that they 
are more accurately described as “fearful of 
appearing foolish” than “disinterested in 
social approbation.” 

The leaderless group-discussion technique 
differed from those often used (e.g., Bass, 
1954) in that no specific topic was assigned. 
The Ss were merely instructed to “get to 
know one another.” It was felt that this 
definition of the task would facilitate its 
perception as “social” rather than “intel- 
lectual,” and would help prevent Ss from 
beginning at a disadvantage because they 
had not read recent newspapers or were not 
involved with a particular campus issue. 
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Self-esteem is defined within a context of self-other orientation, and an instru- 
ment is described which is designed to measure the evaluation of the self in 
relation to significant others using topological representations of self and others 
and involving limited verbal demands. The results of a program of research are 
described which emanate from an evolving theory of social self-esteem. The 
results suggest that self-acceptance and social acceptance are inextricably 
combined and raise serious doubts about the meaning of earlier results con- 
cerning self-esteem which were based upon verbal self-reports. 


Self-esteem is usually defined as the indi- 
vidual’s perception of his worth. In evaluat- 
ing the self, however, few physical cues are 
available which provide a reliable basis for an 
estimate. Still, the individual has recourse to 
paired comparisons of the self and significant 
others; that is, self-evaluation evolves in 
terms of social reality (Festinger, 1954). Self- 
evaluation, then, emerges largely within a so- 
cial frame of reference. 

According to this definition of self-esteem, 
if the social environment changes, a corres- 
ponding change in self-esteem may be antici- 
pated. It is now proposed that the person’s 
response to the social environment is a func- 
tion of self-esteem. Self-esteem mediates so- 
cial stimuli and response (Social stimuli > 
Self-esteem — Response). It is proposed here 
that self-esteem is a component of the self 
system which regulates the extent to which 
the self system is maintained under conditions 
of strain, such as during the processing of 
new information concerning the self. Thus, for 
example, evaluations of either a positive or 
negative nature do not evoke immediate, cor- 


1The research program from which this report 
emanated was supported in part by a grant to the 
senior author by the National Science Foundation 
and in part by the United States Office of Educa- 
tion through a contract to the Center for the Ad- 
vanced Study of Educational Administration, Uni- 
versity of Oregon. 

2 Requests for reprints should be sent to Robert 
C. Ziller, Department of Psychology, University of 
Oregon, Eugene, Oregon 97403. 


responding action by the individual with high 
self-esteem. New information is examined on 
the basis of its relevance and meaning for the 
self system and is disregarded if its meaning 
tends to be tangential. In this way the or- 
ganism is somewhat insulated from the en- 
vironment or is not completely subject to mo- 
mentary environmental contingencies. 

Persons with low self-esteem, on the other 
hand, do not possess a well-developed con- 
ceptual buffer for evaluative stimuli. In Wit- 
kin’s terms (Witkin, Dyk, Foterson, Good- 
enough, & Karp, 1962), the person with low 
self-esteem is field dependent; that is, he 
tends to passively conform to the influence of 
the prevailing field or context. Since the indi- 
vidual’s behavior is directly linked to immedi- 
ate environmental circumstances and is not 
mediated or differentiated and integrated, by 
the self-concept, he is thereby inclined toward 
inconsistency. 

Thus, the concept of self-esteem as de- 
scribed here is linked to the concept of person- 
ality integration as used by Lewin (1935): 
Development was described by Lewin as in- 
cluding an increase in the number of the rela- 
tively independent subparts of the person 
(differentiation) and increasing the unity of 
the person (integration or organization). Simi- 
larly, Piaget (1947) presents the concepts of 
assimilation and accommodation. It is antici- 
pated that the behavior of persons with high 
self-esteem is more integrated and that their 
Cognitive processes are characterized by 2 
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selective consideration of relevant social ele- 
ments (stimulus control). 


COMMUNICATION AND THE MEASUREMENT OF 
SELF-ESTEEM 


Previous research concerning self-esteem has 
not emphasized sufficiently the social nature of 
the self system. The failure to incorporate and 
weight social factors within the self-evalua- 
tion framework may have contributed, in part 
at least, to the disppointing state of the inves- 
tigation of self-esteem. 

‘A second shortcoming of earlier studies in 
this area is their descriptive nature, which, 
coupled with the serious shortcomings of the 
measurement techniques, have left the area 
at a low level of theoretical development. 

Finally, and most seriously, previous re- 
search has largely involved a verbal self- 
report measure of self-esteem. Kelly (1955) 
qualifies his assertions continuously by point- 
ing to the most tentative of his assumptions, 
that S’s word labels for his constructs mean 
what the examiner thinks they mean. Finally, 
Kelly suggests that if a test 
can be arranged to produce a kind of protocol which 
can be subjected to a meaningful analysis, indepen- 
dent of words, we shall have made progress toward 


a better understanding of the client’s personal 
constructs [p. 268]. 


The approach used here involves a method 
of communcation with limited verbal demands 
and relies primarily upon rudimentary forms 
of abstraction which are assumed to predate 
verbal communication systems. 

The approach is a confluence of the ap- 
proaches of DeSoto and Kuethe (1959), Kelly 
(1955), and an evolving theory of self-other 
orientation. The measurement approach as- 
sumes that the human organism finds it 
expedient to order and categorize or to 
structure generally the multitude of self- 
surrounding stimuli, The processes used by 
the respondents are expected to be somewhat 
idiosyncratic, but owning to commonality 
among human experience, sensory Processes, 
and classification systems, the evolving ab- 
straction systems possess sufficient common- 
ality that the basis of a communication 
system exists, Some of these processes include 
extent of separation between objects (Kuethe, 
1962), number of objects in a category, and 


ordering of objects (DeSoto, London, & 
Handel, 1965). 

The most relevant ordering process with 
regard to self-esteem is what DeSoto, London, 
and Handel refer to as “spatial paralogic” 
and “linear ordering.” It is observed that 
people are prone to place elements in a linear 
ordering to the exclusion of other structures, 
and that they handle linear ordering more 
easily than most other structures (Coombs, 
Raiffa, & Thrall, 1954; DeSoto, London, & 
Handel, 1965). Indeed, DeSoto, London, and 
Handel note that serial ordering proceeds 
more readily in a rightward direction than 
in a leftward direction. The tendency to 
attribute greater importance to the object 
placed at the extreme left position in a hori- 
zontal display has been noted by Morgan 
(1944). 

The measure of self-esteem developed here 
utilized the serial ordering predilection of Ss 
within a social context. (See Figure 1.) 

The measure involves presenting a horizon- 
tal array of circles and a list of significant 
others (including the self) such as those used 
by Kelly (1955). The task requires S to as- 
sign each person to a circle. The score is the 
weighted position of the self. In accordance 
with the cultural norm, positions to the left 
are assumed to be associated with higher 
self-esteem. 

The item in Figure 1 is one of six items in 
the student form of the instrument. The other 
five self-esteem items involve the following 
sets of significant others: (a) doctor, father, 
friend, mother, yourself, teacher; (b) some- 


Dmections. The circles below stand for people. 
Mark each circle with the letter standing for one of 
the people in the list. Do this in any way you like, 
but use each person only once and do not omit 
anyone. 

F-someone who is flunking 

H-the happiest person you know 
K-someone you know who is kind 
S-yourself 
Su-someone you know who is successful 
St-the strongest person you know 


gooo 


Fic. 1. Measure of self-esteem. 
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one you know who is a good athlete, someone 
you know who is a good dancer, someone you 
know who is funny, someone you-know who 
gets good grades, yourself, someone you know 
who is unhappy; (c) an actor, your brother 
or someone who is most like a brother, your 
best friend, a dean of students, yourself, a 
salesman; (d) someone you know who is 
cruel, your grandmother, a housewife, a po- 
liceman, yourself, your sister or someone who 
is most like a sister; (e) doctor, father, 
friend, nurse, yourself, someone you know 
who is unsuccessful. 

In a study involving 75 randomly selected 
students from Grades 7 through 12, the split- 
half reliability (odd-even) was .80 corrected 
for length (Long, Ziller, & Henderson, in 
press). Split-half reliability (odd-even) for 
the adult form* was .85, uncorrected for 
length, in a study (Mossman & Ziller, 1968) 
involving 60 neuropsychiatric patients. Test- 
retest reliability for 86 sixth and seventh 
graders was .54 for the student form. 


VALIDATION 


The measure of self-esteem proposed here 
is assumed to involve social reasoning and 
a norm of hierarchical ordering of social ob- 
jects in a horizontal line from left to right. 
This assumption was examined in a series of 
separate studies. 

In the first of these (Ziller, Megas, & 
DeCencio, 1964), 45 patients in an acute 
neuropsychiatric treatment ward were pre- 
sented with seven circular pieces of white 
felt cloth 2 inches in diameter. The circles 


The six sets of social objects included in the 
adult form of the instrument are: (a) doctor, 
father, a friend, a nurse, yourself, someone you know 
who is unsuccessful; (b) doctor, father, friend, poli- 
tician, yourself, an employer; (c) someone you know 
who is a good athlete, someone you know who is 
popular, someone you know who is funny, someone 
who knows a great deal, yourself, someone you 
know who is unhappy; (d) an actor, your brother or 
someone who is most like a brother, your best 
friend, yourself, a salesman, a politically active per- 
son; (e) someone you know who is cruel, a judge, 
a housewife, a policeman, yourself, your sister or 
someone who is most like a sister; (f) a defeated 
legislative candidate, the happiest person you know, 
someone you know who is kind, yourself, someone 
you know who is successful, the strongest person you 
know. 
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were marked by symbols indicating the person 
they represented. These persons included 
nurse (N), nurses’ aide (NA), other patients 
in the ward’ (O), psychiatrist (Pi), psycholo- 
gist (Po), social worker (SW), and yourself 
(Y). A list of the symbols and their referents 
were placed on a table in alphabetical order 
for the S’s information. These were also read 
to the Ss. The Ss were instructed to arrange 
the circles on a black felt board, 2 x 24 
yards, in any way that they wished. 

In support of the serial ordering tendencies 
proposed by DeSoto, London, and Handel, the 
majority of Ss arranged the symbolic circles 
in a straight line from left to right. By 
assigning weights to the left-right positions 
and calculating the mean weighting of the 
seven symbols, the resulting order of the 
symbolized positions was: psychiatrist, psy- 
chologist, social worker, nurse, nurses’ aide, 
other patients, and yourself. It is apparent 
that a left to right status hierarchy of the 
social objects emerges. 

A similar analysis was made of the left to 
right location of a low-status other person 
by college students using the student form 
(see Table 1). It was noted that the 
“unhappy” person was placed in the last 
position to the right 48% of the time; 
“someone you know who is unsuccessful,” 
56%; “someone you know who is cruel,” 
64%; and “someone you know who is flunk- 
ing,” 59%. 

Further support for the assumption of a 
left to right paralogic was found in the associ- 
ation of the self-esteem score as previously 
derived and a second technique of scoring. 


TABLE 1 


DISTRIBUTION oF THE Location or THE “NEGATIVE 
SIGNIFICANT OTHERS” IN Four Items OF THE SELF- 
EsTEEM MEASURES, HORIZONTAL ARRANGE- 
MENT POSITIONS 


Item 6 5 4 3 2 1 
pada pee 
Unhappy IGE S T P otushi15.1. 48 
Unsuccessful | 21 | 3 | 5 | 9 5 | 56 
Cruel E nin ee: A a 3] Oe 
Flunking A T mes) bg 4 | 59 


Note—For Unhappy, N = 150; fe ful, N = 1475 
for Cruel, N = 154; for Flunking, N = 172, Percentage signs 
are omitted. 
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This involved the identification of the most 
negative, significant other for each set of 
significant others and calculating the distance, 
in number of circles, between the low-status 
other and the self. (Only four items which 
involved a clearly differentiated low-status 
other were included in this analysis. These 
items were a, b, d, and e.) This method of 
scoring was suggested when it was noted in 
Table 1 that the low-status other was some- 
times located in the left position, indicating 
that arrangement of the self and others may 
be based on other than a left to right hier- 
archical ordering in some cases. 

The correlation between the scores derived 
by these methods was .33, p < .05 (N = 163 
male and female college students). Only the 
results with regard to males were statistically 
significant, however (r= .46, N = 61, p< 
001; females, r = .14, N = 102, p< 10). 

A similar analysis of the two scoring 
methods was made using the adult form with 
male neuropsychiatric patients (Mossman & 
Ziller, 1968). The results corroborate the 
findings with regard to the male college 
sample (r = .56, N = 60, p < .001). 

In yet another study of the left-right serial 
ordering phenomenon, a children’s form of the 
self-esteem measure was used (Henderson, 
Long, & Ziller, 1965). The analysis involved 
the responses of 48 boys and girls ranging in 
age from 7 to 14 who had applied for cor- 
tective training at a reading study center, plus 
48 controls matched for age, sex, and general 
intelligence. The Ss were given a paper with 
a long horizontal line. They were next pre- 
sented in random order six circles with pic- 
tures representing self, friend, and a “smart,” 
dumb,” “funny,” and “bad” classmate. The 
children were told to paste these symbolic 
circles in a row on the line. It was found that 
children placed the “smart” classmate to the 
left and a “bad” classmate to the right to a 
Significant degree. 

s Evidence that the left-right serial ordering 
is not a phenomenon limited to persons within 
the United States is found in the analysis of 
the location of “someone you know who is 
unsuccessful” in the student form of Item e. 
a regard to 92 boys and girls from Form I 

the M.V.D.M. High School in Vesakapat- 
nam, Andhra, South India, the frequency with 


which the “unsuccessful” person was located 
in the positions from right to left was 74, 9, 
4, 0, 1, and 4. With regard to an American 
sample of 94 boys and girls, the corresponding 
frequencies were 53, 17, 15, 3, 3, and 3. 
Another test of the left to right hierarchical 
ordering assumption was the association be- 
tween the weighted position of “yourself” 
among five others including “someone you 
know who is flunking;” “the happiest person 
you know,” “someone you know who is kind,” 
“someone you know who is successful,” and 
“the strongest person you know” under condi- 
tions where the social objects were to be ar- 
ranged horizontally as in Figure 1 and verti- 
cally. In the vertical display, the higher 
position of the self is assumed to represent 


higher self-esteem. The correlation between 
these two measures was .50 (N= 82, 
p < 05). 


An analysis of the location of the lowest 
status other, “someone you know who is 
flunking,” indicates (see Table 2) that the 
number of reversals in the placement of the 
low-status other is reduced in the vertical 
arrangement (9% vs. 31%). The , vertical 
arrangement may introduce greater item 
visibility, however. 

A third approach to the validation of the 
social self-esteem (SSE) measure was a cor- 
relational analysis of SSE with existing mea- 
sures of the construct. The measures selected 
for comparison were those most frequently 
referenced in the literature (Wylie, 1961) and 
a more recent device developed for research 
purposes by Cutick (1962) and used by 
Diggory and her collaborators (Diggory- 
Farnham, 1964). With the exception of the 


TABLE 2 


DISTRIBUTION OF THE LOCATION OF “SoMEONE WHO 
1s FLUNKING” IN Two IDENTICAL SELF-ESTEEM 
ITEMS (VERTICAL VS. HORIZONTAL 


ARRANGEMENTS) 
Location | 6 | 5 | 4 | 3 | 2 | 2 
arrangement 
Vertical 9 | .06 | .06 3 3 84 
Horizontal | 31 | 1 2 3 4 59 


Note—In the vertical arrangement location, “6” was the 
ary top position in the hierarchy. In the horizontal, sre 
rangement location “6” is the first position in the left-right 
hierarchy. Percentage signs are omitted. 
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SSE, all the measures were based on self- 
reports. Thus, the  Bills-Vance-McLean 
(1951) Index of Adjustment and Values 
required S to rate himself with reference to 
each of 49 adjectives as to how often he was 
“this sort of person.” Six-week test-retest 
reliability was .90 (V = 100). 

Diggory’s Self-Evaluation Questionnaire 
asks S the percentage of time that he ex- 
pected to succeed in eight given situations. 
The reliability is not reported. 

Coopersmith’s (1959) Self-Esteem Inven- 
tory contains 54 items concerned with S’s 
perceptions in four areas: peers, parents, 
school, and self, The form was modified 
slightly to make it more appropriate for a 
college population, The Self-Esteem score is 
twice the sum of the high self-esteem items 
(as agreed upon by five psychologists) marked 
“like me” and low self-esteem items marked 
“unlike me.” Reported test-retest reliability 
after 5 weeks was .88. 

In an earlier test of the relationship be- 
tween the SSE and a single item, overall self- 
evaluation (Ridgeway, 1965), a negative but 
not statistically significant relationship had 
been found (r= —.15, N= 100, p<.5). 
Given the different theoretical frameworks 
and method of communication upon which 
the measures are based, a significant relation- 
ship was not anticipated in the present study. 
The purpose of this study in the program 
was to establish the independence of the SSE 
more systematically. 

The correlation matrix for these three 
scales and the SSE for each sex is shown in 
Table 3. None of the correlations with SSE 
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were statistically significant. Once again, sex 
differences are quite apparent in the inter- 
correlations among the measures. For male 
Ss (N = 33), the highest correlation, r = 60, 
was between Diggory’s and Bills’ measures, 
Significant correlations with regard to male 
Ss were also found between Diggory’s and 
Coopersmith’s measures (r = .37), and Bills’ 
and Coopersmith’s (r = .46). The only signifi- 
cant correlation found for female Ss (N = 53) 
was between Diggory’s and Bills’ measures 
(r= .29). These results are worrisome, even 
though they were anticipated. Yet, the results 
may be interpreted to indicate that the SSE 
and the other measures of self-esteem are in 
different psychological domains. The SSE in 
contrast to the other devices is a nonverbal, 
“low visibility” instrument, and also incorpo- 
rates a social frame of reference. 

One of the universal criticisms of the most 
frequently used measures of self-acceptance 
is that they are about equally correlated with 
socially desirable responses as they are with 
each other (Crowne, Stephens, & Kelly, 
1961). For example, the greater the tendency 
to give socially desirable responses, the less 
the reported discrepancy between self and 
ideal self. 

Using the Crowne-Marlowe (1964) measure 
of socially desirable response tendencies, an 
relating it to the SSE as well as Diggory’s 
measure of self-evaluation, correlations 0 
—36 and 65 (N = 24, p< .05 for both) 
were found for sophomore female volunteers 
for an experiment. Higher self-esteem as mea- 
sured by the Diggory device was associated 
with a tendency to give socially desirable re- 


TABLE 3 
INTERCORRELATION MATRIX OF Four MEASURES OF SELF-ESTEEM 


(2) 3 4 
Measure 2 9 

Males Females Males Females Males Females 
Bills-Vance Mclean Index (1) 46** A7 60*** = zug 
Coopersmith’s Self-Esteem (2) 37* ai a 04 
Diggory’s Self-Evaluation (3) 7 —09 a 


Social Self-Esteem (4) 


Nizam males, N = 33; for females, N = 53. 


D> < 01, 
wee p < 001, 
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sponses. The opposite relationship was found 
using the SSE. 


SELF-ESTEEM AND SOCIAL ACCEPTANCE 


Turning now to construct validation pro- 
cedures, one of the earliest studies in the 
series examined the frequently hypothesized 
relationship between acceptance of self and 
acceptance by others (Mann, 1959; Wylie, 
1961). In one of the reported studies, Cooper- 
smith (1959) found that fourth, fifth, and 
sixth graders showed a significant positive 
correlation (.34) between self-esteem and 
popularity. The rationale for the relationship 
is often tautological (see Rogers, 1951, p. 
520) but the findings are nevertheless con- 
sistent. Within the present framework, self- 
acceptance and the acceptance by others are 
perceived as inextricable components of social 
self-esteem. 

The Ss in this study (Ziller, Alexander, & 
Long, 1964) were 321 sixth-grade students in 
11 classrooms from four elementary schools. 
The Ss were all white, and the composition 
of the classes remained unchanged throughout 
the school day. All Ss completed a socio- 
metric item asking them to name the five chil- 
dren with whom they would most like to play. 
Twenty-five children (17 boys and 8 girls) 
who were unchosen and 25 children (17 boys 
and 8 girls) who were most highly chosen 
from the same classes as the unchosen were 
administered one item of the social self-esteem 
measure. The social set included “doctor,” 
“father,” “friend,” “the person with whom 
you are most happy,” “mother,” “yourself,” 
ithe most successful person you know,” and 
the person with whom you are most comfort- 
able.” The directions were read to Ss. The 
mean position of the popular children (1 being 
the left position) was 3.8 and of the un- 
Popular 5.7 (¢ = 3.87, p < .005). 


Success Anp FAILURE OF POLITICAL 
CANDIDACY AND SELF-ESTEEM 


i One of the difficulties of studying changes 
n the self-concept is that conditions associ- 
ated with changes in the self-concept are not 
readily generated and are rarely encountered 
a circumstances amenable to statistical 
vid ysis. A political election, however, pro- 

les an exceptional opportunity to study 


TABLE 4 


CHANGES IN SELF-ESTEEM OF WINNING AND LOSING 
POLITICAL CANDIDATES 


Direction of change in self-esteem 


Candidates 
Increased | Decreased | No change 
Winners 15 4 + 
Losers 8 il 2 


x? = 6.01 (p < .05). 


changes in the: self-concept associated with 
winning as opposed to losing the election. 
One month prior to the 1967 state election 
in Oregon, 44 candidates for the state legis- 
lature were administered the first four items 
of the adult Form II of the SSE. The same 
items were again administered to the same 
candidates approximately 1 month after the 


. election. Each candidate was approached indi- 


vidually and completed the form in the pres- 
ence of the data collector. The results are 
provided in Table 4. Fifteen of the 23 winning 
candidates increased in self-esteem as opposed 
to 8 of the 21 losing candidates. Moreover, 
11 of the 21 losing candidates decreased in 
self-esteem as opposed to 4 of the 23 winners. 
The results are significant at the .05 level of 
confidence (x? = 5.99). 


SELF-ESTEEM AND CONSISTENCY OF 
SOCIAL PARTICIPATION 


In an experiment by Mossman and Ziller 
(1968), it was hypothesized that self-esteem 
is: (a) positively related to frequency of 
participation in group discussion, and (b) is 
associated with the organism’s consistency of 
response. Relative to the first hypothesis, it 
has already been demonstrated that high 
self-esteem is associated with social accept- 
ance. This finding suggests that in a group 
discussion the individual with high self-esteem 
will receive and expects to receive verbal and 
nonverbal cues from the other members which 
invite or support his bid for participation. 
In addition, the high self-esteem—high soci- 
ally accepted member may be expected to 
receive more social reinforcement for his par- 
ticipation. Thus, the high self-esteem—high 
socially accepted member is assumed to re- 
ceive more self-reinforcement and social rein- 
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forcement for participation in group discus- 
sion, which results in a higher level of 
participation. 

Here, high self-esteem is assumed to be 
associated with a higher potential for self- 
reinforcement and a higher probability of 
social reinforcement. To some extent self- 
reinforcement and social reinforcement may 
be complementary. When social reinforcement 
is withheld, the individual with high self- 
esteem has recourse to self-reinforcement. The 
individual with low self-esteem is more 
dependent upon social reinforcement, lead- 
ing, as we stated at the outset, to less stable 
participation. 

Borgatta (1962, p. 256) demonstrated a 
significant correlation between Cattell’s Guilt 
Proneness versus Confident Adequacy sub- 
tests and total activity in a group discussion 
(r= —.34, the more guilt, the less activity) 
and between Edwards’ Abasement subtest and 
total activity (r = —.30, the less abasement, 
the more activity). The Ss in the study were 
76 neuropsychiatric patients who were mem- 
bers of four “autonomous problem-solving 
groups” similar in purpose to those discussed 
by Fairweather (1964, p. 171), Each of the 
four groups was observed during one session 
a week over a 3-week period. The observer 
recorded the total amount of interaction units 
per individual. The adult forms of the SSE 
were administered at the end of the third 
session. 

The self-esteem scores were ordered as 
anticipated with regard to levels of verbal 
participation: high interactors (those who 
contributed more than 5% of the interaction 
units across the three sessions), 21.6 (the 
sum of six ratings); low interactors (those 
who contributed between 2% and 5% of the 
interaction units, 19.8; and the noninteractors 
(those who contributed 1% or less of the 
interaction units across the three sessions), 
17.4. Furthermore, the results were sta- 
tistically significant (F = 3.37, df = 1/57, 
p< 05). 

In order to test the second hypothesis, the 
variance of the relative frequency of inter- 
action units across the three sessions was used 
as the measure of consistency of social be- 
havior for each individual. A highly consistent 
interactor was defined as a group member 


whose relative frequency of interaction vari- 
ance was .03 or less. The .03 point of division 
created two equal-size categories of Ss with 
regard to consistency of verbal participation, 
However, only the consistency of high inter- 
actors was analyzed since consistency among 
low interactors and noninteractors would be 
a statistical artifact stemming from a ceiling 
effect. As hypothesized, there was a signif- 
cant difference (p < .05) in self-esteem scores 
between the high interactors—low consistency 
(SSE = 15.90, N = 10) and high interactors- 
high consistency (SSE = 26.73, N = 11) cate- 
gories of Ss. 


SELF-ESTEEM AND SOCIOECONOMIC STATUS 


Although Wylie’s review of the literature 
contains no reference to an analysis of a 
relationship between socioeconomic status 
and self-esteem, more recently two studies 
(Coopersmith, 1967; Rosenberg, 1965) indi- 
cate a positive relationship, although only the 
results of the first are statistically significant. 
The rationale for the relationship is that 
social status is one of the most striking 
indexes of prestige and success. Persons 
higher in the social system have more prestigi- 
ous occupations, have higher incomes, and 
tend to live in larger and more luxurious 
houses located in more desirable neighbor- 
hoods. These persons are perceived as more 
successful and tend to receive material and 
cultural benefits that might lead them to 
believe that they are generally more worthy 
than others. 

Coopersmith (1967) points out, however, 
that children’s social position emanates from 
experiences in school and the neighborhood 
rather than in an occupational context. These 
attenuating considerations notwithstanding, it 
is proposed children from higher status fami- 
lies are more apt to have ego enhancing 
material reinforcements and social reinforce- 
ments. Social self-esteem, then, is presumed to 
be a general evaluation of the self in relation 
to significant others, and socioeconomic status 
is but one component of social self-esteem: 

A reanalysis of the results of a study by 
Long, Ziller, and Henderson (in press) ™ 
volved an equal number of boys and girls of 
normative age for grade in each Grade 6 to 
12 in four schools in Queen Anne’s County; 
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Maryland. The Ss were white and lived in a 
rural area on the eastern shore of Maryland. 
The Ss had completed six items of the student 
form of the SSE. Hollingshead’s Occupational 


Scale based upon head of household’s occupa- 


tion (Hollingshead & Redlick, 1953) was 
used. Four classifications evolved which pro- 
vided classes with maximally equivalent 
numbers of Ss per class. Thus, 83 Ss whose 
fathers’ occupation was professional, business, 
office worker, or salesman constituted Class 1; 
66 Ss whose fathers were classified as skilled 
labor constituted Class 2; 77 Ss whose fathers 
were farmers constituted Class 3; and 69 Ss 
whose fathers were semiskilled or unskilled 
labor constituted Class 4. The mean self- 
esteem scores of these Ss were 23.0, 22.4, 20.4, 
and 20.5, respectively (F = 2.54, df = 3/292, 
p about .05). Self-esteem and socioeconomic 
status are positively associated in this Ameri- 
can sample of Ss. 


SELF-ESTEEM AND CULTURE 


In an analysis of cultural shaping of con- 
ception of the self, Hallowell (1955) assumes 
that the individual’s self-image and his in- 
terpretations of his own experience cannot be 
separated from the concept of the self that is 
normative in his society. 

A similar analysis to the foregoing was 
made possible by the availability of the caste 
of Indian Ss in reanalysis of a cross-cultural 
study (Long, Ziller, Ramana, & Reddy, 1966). 
The Ss consisted of 50 boys and 50 girls from 
Form I of the M.V.D.M. High School in 
Vesakapatnam, Andhra, South India. The 
children ranged in age from 10 to 14 with a 
Median age of 12. Six items of the student 
form of the SSE were used. The instructions 
were read aloud in Telugu, the native lan- 
guage of the children, by one of the Indian Es. 

The Ss were found to be members of four 
castes: (a) Caste 1 (Brahmin), N = 39, self- 
esteem = 27.1; (b) Caste 3 (Visya), N = 9, 
Self-esteem = 25.1; (c) Caste 4 (included 18 
types such as Satani and Najara which are all 
associated with crafts), M = 48, self-esteem 
= 29.3; (d) Caste 5 (Harijan and Relly), N 
= 4, self-esteem = 30.3. Because of the ex- 
tremely small number of Ss in two of the 
Cates, Castes 1 and 3 were combined and com- 
Pared with Castes 4 and 5. The results were 


statistically significant at. the .05 level of con- 
fidence (F = 5.6, df = 1/98). 

The results were in opposition to expecta- 
tions based upon the results of the previous 
study. In this sample of Indian Ss the highest 
self-esteem was expressed by students in the 
lowest caste (in the results above, the self- 
esteem scores were simply reversed; high 
scores represent high self-esteem). However, 
school was attended within this age group and 
in the region by only 14% of the population. 
The children from the lowest castes, then, may 
be using other children of their caste who do 
not attend school as points of reference, and 
within this frame of reference their own 
status appears extremely high. Thus, as indi- 
cated in the initial self-esteem framework, the 
field of comparison becomes crucial. Self- 
esteem is defined with regard to significant 
others. 

In this same study of Indian children it 
was possible to compare the self-esteem of a 
comparable sample of boys and girls from the 
Queen Anne’s County sample mentioned 
earlier. Here again the high score represents 
high self-esteem. The Indian and American 
means were 28.1 and 20.6 respectively (F= 
65.6, df = 1/196, p< 005). Again, however, 
the privilege of school attendance may be the 
crucial variable. The Indian student who per- 
ceives himself as being a member of a select 
group by virtue of school attendance may 
have higher self-esteem than American chil- 
dren, who all attend school as a matter of 
course. In terms of the discussion of the as- 
sociation between self-esteem and socioeco- 
nomic status, it is proffered that the school 
environment is the most salient status variable 
for the Indian children. 


SELF-ESTEEM AND CONFORMITY 


In a summary of the research concerning 
self-esteem and conformity, Wylie (1961) 
acknowledges that there is a trend indicating 
an inverse relationship; individuals with low 
self-esteem tend to be more persuasible. In 
the present study 41 high school seniors were 
administered the six-item student form of the 
SSE and then were placed in the classic Asch 
conformity situation (Asch, 1956). Under the 
nine extreme conditions where the unanimous 
majority (of four in this study) chose the line 
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vhich deviated most from the standard line, 
he biserial correlation between conformity 
and the SSE was .32, p < 05. Higher self- 
asteem was associated with higher conformity. 

As stated at the outset, previous research 
was equivocal. Nevertheless, it is compelling 
to view these results as negative. In view of 
the strong social component inherent in the 
present measure of self-esteem, however, it is 
still possible to interpret the results as indi- 
cating that the person with high self-esteem 
within a social context may not perceive con- 
forming behavior under low cost conditions as 
damaging to the self system. 


SELF-ESTEEM AND THE NEUROTIC 
PERSONALITY 


Ausubel (1952) regards self-esteem as the 
outcome of achieving a status commensurate 
with one’s conception of self-importance. He 
proposes that a devaluation of the self-con- 
cept is necessary in the face of reality and 
in order to avoid severe injury to self-esteem. 
Trauma to self-esteem may result if ego im- 
portance is devalued extremely. On the other 
hand, personality disorders may evolve from 
untenable notions of omnipotence to which 
the child is subject. 

Results of experiments which have investi- 
gated the relationship between adjustment and 
self-regard (Wylie, 1961) are equivocal. 
Again, however, the results may simply re- 
flect the shortcomings of verbal self-report 
measures of self-esteem, particularly when 
used with neuropsychiatric patients. 

In the first and second of three studies in- 
volving the SSE (Ziller, Megas, & DeCencio, 
1964), the felt circle approach described 
earlier was employed. In the first study in- 
volving a set of significant other members of 
an acute neuropsychiatric treatment ward 
(psychiatrist, psychologist, social worker, 
nurse, nurses? aide, other patients in the ward, 
and yourself), patients who had been admin- 
istered electroconvulsive shock therapy (N = 
10) from 1 to 6 weeks prior to completing the 
SSE as compared with those patients who had 
not received this treatment (W = 35) placed 
the circle representing themselves in the last 
position to the right more frequently (x*= 
12.34, p < 01). These results appear to vali- 
date the psychiatric screening techniques for 


depression since electroconvulsive shock treat- 
ment was only recommended for acutely de- 
pressed patients. 

In the second experiment in the series (Zil- 
ler, Megas, & DeCencio, 1964), 25 patients 
from the neuropsychiatric ward of an acute 
treatment center and 23 volunteers from the 
hospital staff, including personnel from several 
levels of the occupational hierarchy, arranged 
in a horizontal straight line on a black felt 
field 10 significant social elements represented 
by symbols on a piece of felt 2 inches in di- 
ameter. The elements included “mother,” 
“father,” “your wife or girl friend,” “the most 
successful person you know,” “the happiest 
person you know,” “the person with whom 
you are most uncomfortable,” ‘employer,’ 
“your doctor,” and “your friend.” The circles 
were placed in a random order on a table in 
front of the black felt field. On each circle 
there was a one-word description of the person 
whom the circle represented. A list of the s0- 
cial elements that the circles represented was 
presented in alphabetical order on the same 
table. The Ss were asked to “place them all 
on the board in a straight line according to 
some relationship that you decide upon.” 
Normals were found to have placed the “self” 
in a higher position than the patients in the 
assumed left-right hierarchy (t = 4.57, 2 < 
001). 

In the third study in the series (Ziller & 
Grossman, 1967), 90 male, acute neuropsychi- 
atric patients and 87 male employees of the 
same hospital served as Ss. Two self-esteem 
items were administered to the patients dur- 
ing the first week of admission to the hos- 
pital. In the first item, S was asked to choose 
a circle to represent himself from 10 circles 
arranged in a vertical column. Circles were 
weighted from 1 to 10, with a higher score 
associated with a higher position. 

The second measure of self-esteem Pre 
sented a horizontal array of circles. The Ss 
were also presented with the same list of 1 
significant other persons (including the self) 
used in the preceding study. Here the usual 
left-right hierarchy was assumed. L 

The results corroborate earlier findings ™ 
the series, Neuropsychiatric patients in com 
parison with normals show lower self-esteem 
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on both measures (horizontal, p < .10; verti- 
cal, P< 05). 
OVERVIEW 


The series of studies described here repre- 
sents the first phase in a program of research 
concerning self-other orientation. The program 
of research involves the integration of a theory 
of personality involving self-other perceptions 
and an instrument designed to measure the 
evaluation of the self in relation to significant 
others using topological representations of self 
and others. The outcome of the present ap- 
proach, although balanced with regard to the 
emphasis on theory, instruments, and re- 
search, rests largely upon the utility of the 
measures involved. Measurement remains the 
missing link in personality research. With re- 
gard to self-social constructs, measurement is 
dependent upon the method of communicating 
between S and the scientist. Here we have 
proposed that there are some distinct advan- 
tages to avoiding the usual verbal self-report 
approach and substituting for it a topological 
approach with limited verbal demands. 

The results tend to support the validity 
and utility of the approach to the measure of 
self-esteem, Social objects with greater value 
tend to be placed to the left in the horizontal 
display; the absolute difference between loca- 
tion of self and a low-status social object is 
significantly associated with the left to right 
location of the self; left-right location of the 
self is significantly associated with the up- 
down location of the self. Higher self-esteem 
was found to be associated with social ac- 
teptance, social participation, socioeconomic 
status (only in an American sample), identi- 
fication with parents, consistency of social be- 
havior, and the normal as opposed to the 
Neurotic personality. Finally, winning politi- 
cal candidates for state legislative offices rose 
in self-esteem, whereas losing candidates 
dropped in self-esteem. 
The results of three studies require recon- 
om. In one of these negative or very 
low correlations were found between the SSE 
and three frequently used measures of self- 
a The results are consistent with those 

an earlier study, however, and strongly 
Suggest that the SSE measures an aspect of 
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self-evaluation which is in a different factor 
region. 

The second result requiring reconsidera- 
tion concerns the tendency of low caste as 
compared with high caste Asian Indian stu- 
dents to place the symbol representing the 
self more to the left. By way of explanation, 
it was observed that only a select few of the 
low caste as opposed to a high percentage of 
the high caste attended public schools, and as 
a consequence, the high status of the selected 
low caste student in comparison with the non- 
selected low caste student was reflected in 
their self-evaluations on the SSE. Finally, 
persons with high social self-esteem were found 
to be more conforming in the Asch situation. 
It is difficult to discount this finding, even 
though previous studies concerning a relation- 
ship between self-esteem and conformity have 
been equivocal. Against the background of 
the directly supporting results, the three re- 
sults requiring qualification do not appear to 
require a reexamination of the validity of the 
concept of social self-esteem. 

The social context of self-esteem has been 
emphasized in the present approach, and, in- 
deed, the results of most of the studies are 
concerned with social behavior (i.e., popu- 
larity, frequency of participation in group 
discussions, parent-child relationships, con- 
formity). An attempt was made to describe 
the concept used here in terms of its social 
correlates. Thus, the term “social self-esteem,” 
was used. The meaning of social self-esteem 
as it evolves against the background of its 
social correlates suggests social acceptance or 
perhaps self-other confidence. The individual 
who is assured of his high self-evaluation 
within a social context is more consistent in 
social behavior and more accepted by others. 
As stated at the outset, self-acceptance and 
social acceptance are intrinsically interdepen- 
dent. 

Tf, then, the traditional self-report measure 
of self-esteem is unrelated to social self- 
esteem, a reanalysis of the meaning of the 
traditional measures is indicated. It is now 
suggested that the self-report measures indi- 
cate a socially desirable self-esteem, an evalu- 
ation of the self that the reporter is willing to 
reveal, or that he desires the other to accept. 

‘Aside from the question of the social con- 
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text of self-esteem, the limited verbal demands 
of the present approach recommend it as a 
most useful measure in cross-cultural research, 
developmental research, and research in gen- 
eral where there may be some question of the 
comparable verbal ability of the Ss in rela- 
tion to the experimental tasks. 

The fundamental assumption of the pro- 
posed measure of self-esteem is the proclivity, 
in the Western culture, for left to right linear 
ordering of objects. The findings of the pres- 
ent series of studies along with a long history 
of research appears to support this assump- 
tion. As has already been noted, the tendency 
to attribute greater importance to the object 
placed at the extreme left position in a hori- 
zontal display was recorded by Morgan in 
1944. Introspectionists such as Lashley (1961) 
and Inhelder and Piaget (1958, p. 252) have 
also noted the use of spatial imagery in think- 
ing about nonspatial orderings. 

The most systematic work in this area has 
been conducted by DeSoto (1960, 1961; De- 
Soto, London, & Handel, 1965). For example, 
in spatial imagery associated with syllogisms, 
the results suggest that the left end is the 
preferred starting point for a horizontal order- 
ing (DeSoto, London, & Handel, 1965). 

The studies presented here have attempted 
to extend this work to the development of a 
universal communication system; a sign lan- 
guage, if you will, for describing self-other 
orientations. On this assumption, a number of 
self-other configurations have been developed 
(see Ziller & Grossman, 1967; Ziller & Long, 
1966) using other spatial arrangements of 
symbols representing self and significant oth- 
ers which are designed to measure self-other 
power orientation, marginality, social interest, 
identification, identification with the major- 
ity, self-centeredness, social inclusion, and 
openness. 
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NEED FOR APPROVAL AND THREATENED NEGATIVE 
EVALUATION AS DETERMINANTS OF EXPRESSIVENESS 
IN A PROJECTIVE TEST 


HERBERT M. LEFCOURT + 
University of Waterloo, Ontario, Canada 


Constriction, or the lack of expressiveness in projective testing as measured by 
the number of human movement responses given to the Barron inkblot test, 
was predicted on the basis of Ss’ need for approval and expectancy of possible 
disapproval which derived from £’s directions. Results strongly supported the 
emphasis upon situation cues and personal motives as predictors of responses 
to projective testing. Additional findings with repression-sensitization and the 
California Psychological Inventory-Good Impression scale complimented the 
findings with need for approval. Constriction was most evident when tests 
offered personal threat but this was especially so for high need for approval 
persons seeking to make good impressions. 


Constriction, a lack of productivity, and 
stereotyped responding to projective tests 
have usually been identified as avoidant, de- 
fensive ways of coping with evaluation situa- 
tions. Defensiveness and constriction on such 
tests as the Rorschach commonly are said to 
indicate attempts on the examinees’ part to 
avoid criticism and to protect their self- 
images. From such interpretations it might be 
inferred that examinees would behave differ- 
ently in projective testing if they did or did 
not believe that the potential for criticism 
was implicit in the testing procedure. Indeed, 
there has been considerable research into the 
effects of situational cues, which could alter 
such beliefs, upon projective testing. Masling 
(1960) and Rotter (1960) have each re- 
viewed the research literature in this area 
and report substantial support for the hy- 
pothesized context effects upon projective test 
performance. Rotter (1963) has stated that S 
“attributes some purpose to the test, and the 
purpose he attributes to it may well hinge 
on odd bits of information, the setting of the 
test, differential emphasis on the words in the 
same instructions, and so forth [p. 805].” 
In other words, the ambiguity in projective 
testing produces a greater search for cues 
by Ss; therefore, Ss become more easily in- 
fluenced by situational cues than they would 
be if the tests purposes were more explicit. 


1The author wishes to express appreciation to 
Gary Eason, who helped at different stages of the 
analysis, 
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Despite this well-supported contention col 
cerning the effect of situational cues up 
projective testing, there has been little 
search reported which might further ill 
nate this finding. For instance, the questio 
could be asked: Who, or what sort of pe 
would respond to what sorts of cues in p 
jective testing? In essence, when we re 
a constricted and defensive production fron 
a patient on an inkblot test, can we reli 
attribute his defensiveness to a “person: 
style,’ to threats implicit in the evaluati 
outcomes for him, or to some combinatic 
of these? As of the present, most clinici 
would probably assume that defensiveni 
a trait exhibited by persons cross-situati¢ 
ally. Broad categorizations of defensivent 
such as repression-sensitization are develope 
with the assumpution that persons beha 
similarly across different situations. 
subjected to experimental variations, ho’ 
even predictions with this widely empl 
personality variable have been found to var 
given different cues in criterion situa 
such as projective testing (Lefcourt, 1966) 

The present investigation was designed 
test out one theoretical approach to the pr 
diction of defensiveness in projective tes 
With variables derived from social learnif 
theory (Rotter, 1954), we will attempt 
predict the degree of defensiveness in an 
blot test on the basis of an S’s needs 
expectancies as to how his test behavior maj 
reflect back upon himself, 
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In three recent investigations concerning 
productivity upon inkblot tests, the produc- 
tion of human movement (M) responses has 
been found to be a useful indicator of expres- 
siveness (the opposite of defensiveness). In 
astudy by Lefcourt and Steffy (1966) human 
movement responses were given more often 
to an examiner from whom there appeared 
to be less likelihood of censure. Since cognitive 
style investigators have reported that M re- 
sponses are positively correlated with ac- 
tive visual scanning (Luborsky, Blinder, & 
Schimek, 1965), negatively correlated with 
defensiveness in the report of conflict stimuli 
(Kagan & Moss, 1962), and negatively re- 
lated to repressive or denial type defenses 
(Gardner, Holzman, Klein, Linton, & Spence, 
1959; Levine & Spivack, 1960), these writers 
assumed that M responses reflect greater par- 
ticipation, effort, and consequently, expres- 
siveness or the opposite of defensiveness by 
Ss performing on inkblot tests. That more 
M responses were obtained from Ss who were 
less likely to anticipate censure gave some 
support to these hypotheses. 

In a microanalysis of M response produc- 
tion, Megargee (1966) found that the number 
of words used to define a percept is one of 
the factors necessary for producing M re- 
sponses. While he did not interpret his 
data in this fashion, Megargee’s findings sug- 
gest that M production requires greater verbal 
expressiveness, the “willingness to talk or 
write” which, in turn, indicates a lack of 
perceived threat in the examination situation. 

A third study of immediate relevance is 
an unpublished dissertation by Tutko (1962) 
reported by Crowne and Marlowe (1964). 
Th this study, psychotic patients were sub- 
Jected to a battery of projective tests under 
threatening versus supportive conditions. In 
the former condition the tests were described 
as Measures of “mental sickness.” The sup- 
Portive directions emphasized a concern with 
the tests per se assuring anonymity to the 
Patient, Prior to testing the patients had 
completed the Marlowe-Crowne (M-C) scale 
of need for approval which is a Lie scale type 
y measure in which Ss can endorse low 
Wobability but highly desirable self-descrip- 
've comments. A person scoring high on this 
Measure is said to need or value the approval 


of the examiner. The patients were then 
tested on the Rorschach, TAT, and Incom- 
plete Sentences tests. Responses on these tests 
were judged for “revealingness,” “pathology,” 
and “defensiveness.” While the results vary 
by test and rating, generally the threat condi- 
tion produced less revealingness and pathol- 
ogy, and greater defensiveness, especially 
among patients who had scored high in the 
need for approval. In regard to M responses 
which were not subjected to the detailed 
analyses as were the ratings, high need for 
approval patients produced less M than those 
who scored low on the M-C scale. 

From these previous studies, then, we may 
hypothesize that when Ss are confronted with 
inkblot tests, they can be readily influenced 
by Æ’s directions as to the test’s purposes. 
Since M responses reflect the degree of ex- 
pressiveness engaged in by S, then this type 
of response should vary as a function of E’s 
directions, especially when those directions 
suggest that negative evaluations can be de- 
rived from S’s expressions, This hypothesis is 
not novel and simply reiterates previous find- 
ings as to the effects of threat upon Ss en- 
gaged in projective testing. In addition, we 
are hypothesizing that Ss who have expressed 
a need for approval from the examiner, as 
indicated by their performance on the M-C 
scale, will exhibit considerably more constric- 
tion and defensiveness in the face of threat- 
ened negative evaluation than those who have 
not expressed such a need for approval. 
Specifically, we are predicting that constric- 
tion and lack of productivity, as measured 
by the amount of M responses given to an 
inkblot test, will be a function both of S's 
needs (approval) and the specific expectancy 
derived from directions that his behavior in 
the testing situation may cause him grief in 
regard to the fulfillment of that need. 

A supplementary hypothesis was added late 
in the investigation. Boor and Schill (1967) 
report that Ss who describe themselves as 
low in anxiety but who are high in the need 
for approval exhibit considerably more task- 
anxiety than their low need for approval 
peers. Since scores on Byrne’s (1961) 
Repression-Sensitization (R-S) scale were 
available for all Ss, supplementary hypotheses 
were constructed concerning the interaction 
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of the M-C and R-S scales upon M responses 
under different conditions. It was predicted 
that low-anxious (repression)-defensive (high 
M-C) Ss would be more responsive to threat 
directions than other Ss, that they should 
produce less M than any other of the three 
groups (low M-C-repressor, low M-C-sensi- 
tizer, high M-C-sensitizer) when anticipating 
negative evaluation possibilities. 


METHOD 
Subjects 


The sample for this investigation was 133 Ss, 71 
males and 62 females, who were members of the 
author’s third-year undergraduate-level course in 
personality at the University of Waterloo. 


Procedure 


During the semester, the entire class completed 
several test measures which were used to illustrate 
given theoretical orientations. Among these measures 
was the M-C scale of need for approval and Byrne’s 
R-S scale, The overall class mean for the former 
was 14.97, SD=5.46, which is comparable to 
Crowne and Marlowe’s (1964) norms. The overall 
mean was 56.76, SD =15.39 for the R-S variable, 
In later class sessions the group was divided in half 
alphabetically. One group was informed that they 
were to take an inkblot test which was not the 
Rorschach. On this test, they were instructed to be 
as imaginative as possible. They were told that 
there would be 26 inkblots shown for a minute 
apiece. Barron’s M inkblot test (Barron, 1955) was 
then presented by slide projector, The second group 
was told that they were to take an inkblot test 
which was not the Rorschach, but which had derived 
from that test. It was explained that while the ink- 
blot type tests had originally been devised to mea- 
sure such attributes as mental illness, this should 
not really be relevant to the present group. The 
students were then instructed to be as imaginative 
as they could be. Again Barron’s M measure was 
presented by slide projector. 

The specific predictions were that Ss who were 
high in the need for approval would give less M 
responses than would Ss who were low in the need 
for approval, especially when a threat of negative 
evaluation was present. Since the label of mental 
illness is commonly associated with disapprobation, 
these directions would seem to be particularly 
arousing to those Ss with a high need for approval. 

Secondarily, it was predicted that among the high 
need for approval Ss, those who described themselves 
as low in anxiety (repressors) would produce even 
less M responses than their high need for approval 
peers who admit to anxiety, While the R-S scale 
employed here is not identical to the Taylor Mani- 
fest Anxiety scale used in the Boor and Schill (1967) 
article, it has been found to be strongly correlated 
(r= 91, Joy, 1963) with that scale. 
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The first and more benign condition described 
above will be referred to as the creative imagination 
(CI) condition and the threat session as the mental 
illness (MI) condition. 


RESULTS 


The number of M responses were scored 
by two raters who established a high rate 
of agreement (r= .92, p< .01) using the 
criteria developed by Spivack, Levine, and 
Sprigle (1958). With a median split of scores 
on the M-C scales (Mdn = 15) Ss were clas- 
sified as low if their scores fell in the 0 to 14 
range, and high in the need for approval if 
their scores were 15 and above. Sixty-three 
Ss were classified as low, 70 as high in the 
need for approval; 73 Ss were run in the CI 
and 60 in the MI conditions. The analyses of 
variance reported, therefore, are based on an 
unweighted mean method for unequal N 
analyses (Winer, 1962), and the design is 
that of a 2 X 2 factorial including two levels 
of need for approval and two conditions (CI 
vs. MI). 

Two dependent measures are derived from 
Barron’s inkblot test: the number of M re 
sponses and the M threshold score. The latter 
measure is defined by the number of the 
card when S produces his first M. Since the 
cards are ordered for their pull for M, 4 
high threshold score would indicate that 5 
produces his first M response late in the series 
and can be said to require highly explicit 
stimuli to evoke an M. A low threshold would 
indicate a readiness to produce M responses 
early in the series when only minimal cues 
suggestive of M are present. While the two 
measures, number of and threshold for M 
must of necessity be related, there is some 
evidence to suggest that they are different 
enough to require separate attention (Nyman, 
1961; Spivack, Levine, & Sprigle, 1958). 

Table 1 presents the means and standard 
deviations of the number of M responses and 
of M threshold scores. 

As indicated in Table 2, there is a highly 
significant main effect. The Ss who are high 
give less M than those who are low in the 
need for approval. This effect is largely a 
counted for by the very evident constriction 
of high need for approval Ss in the MI con 
dition as is indicated by the significant inter- 
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TABLE 1 


= Means AND STANDARD Deviations or Torar M 
RESPONSES AND M THRESHOLD IN CI AND 
MI Conpitions BY Ss Low anp HIGH 
IN THE NEED FOR APPROVAL 


ae Low need for | High need for 
Condition Approval Approval 
Total M 
CI 
M 11,31 12.21 
SD 2.35 3.38 
MI 
M 11.14 8.94 
SD 2.97 2.47 
Threshold for M 
Er 
M 6.97 8.14 
SD 3.35 4.51 
MI 
M 8.73 10.28 
SD 4.95 5.50 


Note.—For low need for approval, N = 63; for high need 
for approval, N = 70. CI = creative imagination; MI = 
mental illness, 


action term. Subsequent simple effects analy- 
ses reveal that the interaction largely derives 
from differences between high need for ap- 
proval Ss across the two conditions ($ < .01), 
and between high and low need for approval 
groups in the MI condition (p < .001). It is 
evident that the hypotheses regarding need 
for approval receive strong support from the 
first analysis. 

Tn the analysis of variance for M threshold 
Presented in Table 3, the pattern of results 
appears similar to that for the number of 
M responses. The main effect for need for 
approval again reaches statistical significance. 
The Ss with a high need for approval emit 


TABLE 2 


ANALysis or VARIANCE or EFFECTS or NEED 
FoR APPROVAL AND CI versus MI 
CONDITIONS UPON THE NUMBER 
or M ResponsEs PRODUCED 


Source df MS F 
(eed for approval (A) 1} 83.05 | 9.08" 
A Vs. MI conditions (B) 1 15.97 1.86 
hata 1| 6620 | 7.7% 
of 129 8.58 


FETA = creative imagination; MI = mental illness. 


their first M later than Ss low in the need for 
approval. A trend is also found indicating 
that threshold scores tend to be higher in the 
MI condition for both groups while no inter- 
action is obtained between condition and 
need for approval. In general, the results 
obtained are less impressive with the thresh- 
old measure. The large variance for this as 
opposed to the number of M responses mea- 
sure may help to explain the lesser power 
of threshold as a dependent variable. 

In the supplementary analysis which in- 
cludes the R-S variable, low-anxious Ss or 
repressors were defined as those whose R-S 
score was below the median (0-55), sensi- 
tizers above (56 and above). The means, 
standard deviations, and Ws for this three- 
factor design are presented in Table 4. The 
analysis conducted was a 2X2X2 wn- 
weighted mean analysis for unequal Ns. While 
the results obtained for the need for approval 
variable were the same as that reported 
above, the predicted interaction between 
need for approval, R-S, and CI versus MI 
was obtained (F= 6.59, df= 1/123, p 
<.05). In the subsequent simple effects 
analyses, no differences were obtained 
among groups in the CI condition. In 
the MI condition, however, significant differ- 
ences were obtained (p< .01). To begin 
with, the low need for approval-repressor Ss 
produced more M responses than all of 
the other groups while the high need for 
approval-repressor produced less M than the 
other groups. These differences were all sig- 
nificant at least at the .05 level except for 
that between the high need for approval- — 


TABLE 3 


ANALYSIS OF VARIANCE OF EFFECTS OF NEED 
FOR AppRovAL AND CI versus MI 
CONDITIONS UPON M THRESHOLD 


Scores 
Source df MS F 

Need for approval (A) 1 | 125.7106] 5.57* 
CI vs. MI conditions (B) 1 64.1926 | 2.84** 
AXB 1 1.1935] 1< 
Error 129 | 22.5660 

Note.—CI = creative imagination; MI = mental illness. 

*p <.05. 

++p <10. 
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TABLE 4 


Means AND STANDARD DEVIATIONS or TOTAL HUMAN 
Movement Responses IN CI anp MI CONDITIONS 
BY SS CLASSIFIED BY REPRESSION-SENSITIZA- 
TION AND NEED FOR APPROVAL 


Ss cI MI 
Low need for approval 
Repressors 
M 10.73 13.05 
SD 1.60 3.61 
N 11 19 
Sensitizers 
M 11.67 10.81 
SD 2.64 2.90 
N 18 16 
High need for approval 
Repressors 
M 11.29 8.37 
SD 3.59 2.32 
N 17 19 
Sensitizers 
M 10.83 10.00 
SD 2.34 2.32 
N 18 13 


Note.—CI = creative imagination; MI = mental illness. 


repressors and sensitizers. Nevertheless, this 
difference was in the predicted direction. In 
short, the repressors or low anxious persons 
were both the lowest and highest in M pro- 
duction with need for approval being the vari- 
able that predicts the direction. When the 
groups are compared across conditions, the 
low need for approval-repressors are found to 
increase their M responding in the MI condi- 
tion (p<.05) while the high need for 
approval-repressors decrease in the number of 
M given in the MI as compared to the CI 
condition (p < .01). 

That the conditions play such an important 
role in the interaction with the personality 
variables in predicting M responses receives 
further corroboration from results with the 
California Psychological Inventory (CPI; 
Gough, 1964) which had been administered 
to Ss earlier in the year. The variable Good 
Impression (GI) on the CPI would seem to 
be similar to the need for approval. The Ss 
scoring high on GI are said to be concerned 
about how others react to them and wish to 
make a good impression, With the total 
sample of 129, the M-C need for approval 
scale was found to be related to GI, 7 = .41, 
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< .01. In the CI condition, GI and the 
number of M responses fall short of signifi- 
cance (r = —.19, ns). However, in the MI 
condition, GI and M are significantly related 
(r = —.34, p < .01). In other words, when it 
becomes clear that there is a possibility of 
negative evaluation, the wish to create a good 
impression becomes more salient as a predictor 
to the manner in which one will be expressive, 


DISCUSSION 


The results of the present investigation 
give strong support to the original contention 
that small cues present in a projective testing 
situation can dramatically affect Ss’ perform- 
ances. That we have been able to predict the 
interaction between a personality variable 
such as need for approval and the response 
to a particular kind of situational cue (threat 
of negative self-evaluation) suggests that 
more exacting methods for interpreting test 
behavior may be possible. Rather than rely- 
ing on a trait assumption in interpreting test 
results (e.g., he was defensive on the test so 
that he is henceforth described as being 
defensive), it may become possible to inter- 
pret test behavior as a function of the situa- 
tional cues in testing which can be controlled 
in interaction with personality variables which 
are more articulated in given personality theo- 
ries and supported with empirical research. 

The results of this study indicate that the 
absence of M responses can be viewed as 
indicators of constriction and defensiveness: 
We are not arguing that M is the only or 
even the best determinant for inferring de- 
fensiveness, but that it has proven useful as 4 
measure of defensiveness in this case as well as 
in previous studies (Lefcourt & Steffy, 1966): 
M responses apparently reflect the kind of 
participation and expressiveness which $$ 
might fear would reveal them more and conse- 
quently make them more vulnerable to ai 
examiner’s judgment. This is not to say that 
Ss are conscious of a relationship between 
and self-disclosure. Rather, it is assumed that 
M responses require a somewhat greater ef- 
fort and participation than a simple form 
response, and that this effort will not be 
expended when Ss are on the defensive. 

The data obtained with the GI measure 
from the CPI lends support to that obtained 
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with the need for approval scale, that the 
incidence of M responses varies as a function 
of Ss’ intentions. The Ss who desire to create 
a good impression are less expressive, or more 
constricted, when expression is perceived as 
constituting a risk to the impression they 
wish to maintain. 

It is interesting that the M threshold mea- 
sure does not appear to generate the sorts 
of results obtainable with the total number 
of M responses. Barron (1955) originally de- 
vised his test to create this threshold measure. 
Other investigators such as Spivack et al. 
(1958) have noted that very few Ss produce 
an M on every card following the threshold 
card. Consequently, there tends to be a less 
than perfect congruence between the two M 
Measures. Nyman (1961) also found that 
number of M and M threshold were related 
in different ways to a series of measures re- 
lated to the tolerance of delay. In the present 
study, M threshold is found to be a less sensi- 
tive measure than the total M score. The 
data, though in the same directions as the 
number of M, do not reach statistical signifi- 
cance possibly because of the larger variance 
secured with the threshold measure. 

The additional findings obtained with the 
Tepression-sensitization measure lead to inter- 
esting hypotheses. Since the present result 
Supports the findings of Boor and Schill 
(1967) in singling out the high need for 
approval-low anxious Ss (repressors) as a 
deviant group, further conjecturing seems 
Justifiable. It is possible to ask why persons 
who report that they are unanxious, or less 
uncertain about who and what they are, 
should at the same time be desirous of ex- 
ternal reassurance. A possible answer is that 
these persons are describing themselves as 
Nonanxious because that too is seen as neces- 
sary for the securing of approval. In other 
Words, such persons would find the admission 
of anxiety as a threat to their maintenance 
of social approval. In effect, the very feelings 
of Uncertainty or anxiety might prove, ironi- 
cally, to be further anxiety arousing for such 
Petsons. Consequently, when these persons 
ae being evaluated on a dimension relevant 
5 anxiety, their first predisposition would be 
0 avoid self-disclosure as would be evidenced 
™ constriction and withdrawal. The fact that 
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the low need for approval-repressor group, 
on the other hand, significantly increases their 
tendency to give M responses with threatened 
negative evalution may reflect the fact that 
this group feels they have nothing to hide. 
If they do not seek approval and yet deny 
anxiety, conceivably these are less troubled 
individuals who have confidence in their own 
well-being. Consequently, when being evalu- 
ated their predisposition is to be as open 
or self-disclosing as possible. In other words, 
if viewed enough, they will in all probability 
be judged favorably. However, it might also 
be argued that this group simply does not 
care what the evaluator believes since they 
are low in the need for approval. Neverthe- 
less, the unpredicted increase in M respond- 
ing with the anticipation of negative evalua- 
tion which this group exhibits would tend to 
belie this assertion. 

In conclusion, we have obtained significant 
support for our original hypotheses. Situa- 
tional cues, as slight as they were in the 
present study, have a large but differential 
impact upon Ss undergoing projective testing. 
The results likewise reflect the usefulness of 
social learning theory in predicting the kinds 
of behavior that often come to the clinician’s 
attention. 
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The purpose of this research was to demonstrate the predictive efficiency of 
self-concept measurement for the posthospital vocational adjustment of psy- 
chiatric patients. A total of 104 Ss—vocationally unplaceables, failures, suc- 
cesses, and normals—were administered the Miskimins Self-Goal-Other Dis- 
crepancy Scale, There was a clear difference as to psychopathology, i.e., dis- 
crepancies among the three aspects of self measured by the MSGO, between nor- 
mals and psychiatric patients. Unplaceable patients were shown to have high 
Self-Other minus discrepancies, indicating, generally, psychotic symptomatology, 
and specifically, difficulties in relating to other persons. The failure group’s 
notable distinguishing characteristic was high Self-Goal minus discrepancy, 
this suggesting extensive use of manipulative behavior, while successes were 
most easily identified by high Self-Goal plus discrepancy, this related most 


often to psychoneurotic diagnoses. 


Many forms of clinical data have been 
Utilized for the prediction of posthospital 
psychiatric adjustment; however, none of 
these variables has been impressively effective. 
The present study will attempt to demon- 
strate that a person’s concept of self is a 
highly effective indicator of one key aspect 
of psychiatric rehabilitation—posthospital 
vocational adjustment. 

There are various general theoretical 
formulations relating the “self-concept” to 
human behavior, psychotherapy, and psycho- 


logical research; however, if this particular 


type of theorizing is to prove genuinely fruit- 
ful, it must deal in detail with psychopathol- 
ogy. That is, self-concept theory should offer 
4 framework in which all the various devia- 
tions from normal behavior may be clearly 
Understood. A review of the literature will 
show that traditional phenomenal self-concept 
variables allow only limited differentiation 
between normals and the major psycho- 
diagnostic groups, neurotic, psychotic, and 
character disorder (e.g, Miskimins & Sim- 
mons, 1966; Rogers & Dymond, 1954; Sarbin 
& Rosenberg, 1955), and among these three 
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diagnostic groups (e.g., Friedman, 1955; Hill- 
son & Worchel, 1957). Ibelle (1960) has 
demonstrated that it is not possible, using 
typical-self-concept measurement, to distin- 
guish most forms of schizophrenia from 
normals, and the same problem arises with 
differentiation of normals and character dis- 
orders. Miskimins (1967), however, intro- 
duced a novel measurement technique, the 
Miskimins Self-Goal-Other Discrepancy Scale 
(MSGO) which, in addition to allowing dupli- 
cation of variables typically measured, permits 
the measurement of other self-concept vari- 
ables which serve to discriminate heretofore 
undistinguishable psychopathological catego- 
ries. For example, it provides a self-goal 
factor which duplicates the typical self-ideal 
Q sort and correlates highly with neurotic 
psychopathology, and a self and other’s evalu- 
ation of self factor which clearly distinguishes 
schizophrenia. 

The MSGO is a technique for measuring 
the discrepancies in a person’s alignment of: 
(a) His self-concept, (b) his goal self-concept, 
and (c) his perception of how others evaluate 
him, on a given area of concern. Miskimins 
(1967) has postulated that the measurement 
of the incongruencies among the three self- 
constellations will reflect internal tension, 
that is, anxiety and the individual’s general 
level of adjustment. Specifically, there are 
available two kinds of discrepancy scores, 
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self-goal and self-other. When an individual’s 
aspired goals are those typical to the society 
(e.g., Friendly and Warm) a plus discrepancy 
obtains, whereas culturally atypical goal 
choice (e.g., Unfriendly and Cold) results in 
a self-goal minus discrepancy. Self-other dis- 
crepancy is also bidirectional; that is, when 
others are seen as viewing one’s self very 
favorably this produces plus discrepancy, as 
opposed to the situation where others are per- 
ceived as hypercritical which produces minus 
discrepancy. The above four kinds of scores— 
Self-Goal plus, Self-Goal minus, Self-Other 
plus, Self-Other minus—represent the first 
four factors of the MSGO (see Table 1 for a 
summary of the MSGO factors). Factor 5 of 
the MSGO is similar to Factor 4 in that it is 
based upon self-other minus discrepancy, but 
unique in that it involves only very general, 
global self-related constructs (e.g., Successful 
in Life). The last factor considered in this 
paper is that labeled “Negativism” and op- 
erationally denotes those Ss who claim to de- 
sire to be more tense, less self-confident, and 
the like. For further information regarding 
the MSGO, see Miskimins (1967, 1968). 

For the purpose of this study vocational 
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adjustment has been operationalized in terms 
of “placeability” and “placement durability,” 
that is, a patient has or has not obtained a 
job, and he has kept or lost an obtained job, 
The specific predictions hypothesized for the 
relationship of the self-concepts of psychiatric 
patients and their future vocational adjust- 
ments are: (a) The placeable group will ex. 
hibit significantly higher discrepancy on the 
Self-Goal plus and minus factors, and signifi- 
cantly lower discrepancy on the remaining 
four MSGO factors (all Self-Other discrep- 
ancies and Negativism) than will unplaceable 
patients. (b) Among those placed, the voca- 
tional success group will manifest signif. 
cantly higher discrepancy on the Self-Goal 
plus and Self-Other plus factors and will score 
significantly lower on all other factors (all 
Self-Other minus factors and Negativism) 
than will the vocational failures. In addition 
to the above, it was hypothesized that since 
the MSGO is a measure of psychopathology 
the total patient group will score significantly 
higher than a matched normal group on all 
six MSGO factors. 

The rationale for the first hypothesis was 
threefold. First, it was assumed that patients 


(relates to basic overall evaluations, and he feels that 


TABLE 1 
Summary or Sıx MSGO Factors 
yee Interpretation 
Self-Goal SC-GSC_ Discrepancy: Culturally typical goals (similar to the commonly used “Self-Ideal”’ discrep- 
plus ancy, i.e., Factor 1 is related to all self-goal positive discrepancies). High scores on this factor arè 
significantly correlated with neurotic diagnoses. 
Self-Goal SC-GSC Discrepancy: Culturally deviant goals (based on rejection of usual or typical goals, i.e., Factor 
minus 2 is based on self-goal negative discrepancies). High scores on this factor are significantly correlated 
with personality disorder and psychotic diagnoses. 
Self-Other SC-PRO Discrepancy: Accepting others (others generally overvalue him, do not know him as he 
plus really” ae Tari is related to self-others’ evaluation of self positive discrepancies). High 
Scores on this factor correlate significantly with the neurotic and psychotic depressive diagnoses: 
Self-Other SC-PRO Discrepancy: Critical others—Specific, interpersonal, affective aspects of self (others genet- 
minus ally criticize him as regards his social skills, his emotional stability, his empathy for others, ety 
(Specific)| i.e., Factor 4 is based upon self-others’ evaluation of self negative discrepancies). High scores 0t 
this factor correlate significantly with most psychotic diagnoses. 
Self-Other | SC-PRO Discrepancy: Critical others—Global 
minus others regard him as “completely” deficient, ie., Factor 5 is related to self-others’ evalua 
(Global) tion of self negative discrepancies for the “General” items). High scores on this factor correlate 
Ly significantly with delusional psychotic diagnoses. 
Negativism | Megativistic Response Set: SC-GSC minus on emotional items (Ss who maintain that they wish to H 


more tense, less self-confident, etc.). High scores on this factor correlat significantly with the 
psychotic and personality disorder diagnoses. ar 


Note.—SC = self-concept; GSC = goal self-concept; PRO = perceived responses of others. 
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exhibiting high Self-Goal plus discrepancy, 
which correlates highly with neurotic psycho- 
pathology, would be the group most likely to 
benefit from psychiatric treatment, and, there- 
fore, would be a group with high likelihood of 
job obtainment. Second, Miskimins (1967) 
theorized that persons exhibiting high Self- 
Goal minus discrepancy would be character- 
ized by the ability to manipulate others, and 
this manipulative ability can be conceived of 
as a positive, although not necessarily desir- 
able, factor in obtaining placement. Finally, 
patients exhibiting severe depression (high 
Self-Other plus scores) and other psychotic 
behavior were assumed to be the most diff- 
cult to successfully treat, and therefore it was 


_ thought that placement would be unlikely. 


The basis for the second hypothesis is simi- 
lar to the first with respect to patients pro- 
viding high Self-Goal plus discrepancies; that 
is, of patients obtaining placement, those ex- 


_hibiting neurotic symptomatology are most 


likely to achieve vocational success. How- 
ever, it was thought that high Self-Goal minus 
patients, although successful in obtaining 
placement, would fail once their manipulative 
behaviors became obvious to their employers. 
The prediction that patients scoring high on 
the Self-Other plus factor would succeed, once 
placed, was based on the premise that if de- 
Pressed patients met with success (place- 
Ment), this would serve to provide reduction 
of the discrepancy for this factor (i.e., reduce 
depression). Related to the placeability pre- 
dictions, it was assumed that patients who 
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were negativistic and had considerable diffi- 
culty in relating to others (the Self-Other 
minus factors) would fail if they should hap- 
pen to obtain placement. 


METHOD 
Scale 


The hypotheses being tested involve the predictive 
abilities of six “self” factors measured by the MSGO, 
This scale provides, in addition to information typi- 
cally gathered in self-concept research, other perti- 
nent self-related data; for this reason, it appears to 
be the most comprehensive and detailed measurement 
technique readily available in this area. 


Subjects 


One hundred and four Ss were divided equally 
into the four study groups—vocational successes, vo- 
cational failures, vocationally unplaceable, and nor- 
mals. The patient samples were drawn from psychi- 
atric patients referred to the Placement Department 
at the Fort Logan Mental Health Center, Denver, 
Colorado. The 26 successes were randomly selected 
from placement referrals who, at the time of this 
research, were holding and had held the same job 
for at least 3 months (averaging over 6 months). 
The 26 failures were randomly selected from place- 
ment referrals who, at the time of this research, 
were not holding a job and had failed on one or 
more placements (never remaining on any job over 
3 months). The 26 patients classified as unplaceable 
were randomly selected from persons who, at the 
time of this research, had no job and had not held a 
job for over 6 months, during which time Fort 
Logan’s placement services had been available to 
them, The 26 normals were successfully employed 
people taken from the community and were matched 
with the other three groups on age, sex, and level of 
education. Four age groups were utilized for the 
matching: 20-30, 31-40, 41-50, and 51 and over. 
Patient and normal group composition did not differ 


TABLE 2 
Summary or Mean Scores on Six MSGO Factors 
Mean scores 
MSGO factor 
Normals [Total patients*} Placeable> | Unplaceable | Successes Failures 
Seli-Goal plus 23.731 37.038 39.827 31.462 44.077 35.577 
Self-Goal minus 0.423 1.948 2.615 0.615 1.154 4.077 
Self-Other plus 7.731 10.846 10.846 10.846 12.577 9.115 
Self-Other minus 3.692 6.205 5.827 6.962 3.577 8.077 
5 (Specific) 
elf-Other minus 1.385 2.076 1.808 2.615 0.962 2.654 
(Global) 
Ssativism 0.154 0.858 0.654 1.269 0.500 0.800 


+ Consists of ients. 
b Consists of Some dnra p unplaceable patient 
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over 1% for any of the four age groups. With re- 
spect to sex, both patient and normal groups were 
42% male and 58% female. Three ranges of educa- 
tion were used for the matching: Less than 9 years, 
9-12 years, and over 12 years, Patients and normals 
were within 2% of each other for all three ranges. 


Procedure 


As part of a routinely administered battery of 
vocational interest tests, all Ss in the patient group 
were administered the MSGO soon after admission to 
Fort Logan (upon referral to Placement). The 
period of time between MSGO administration and 
the assessment of vocational adjustment varied from 
1 to 12 months. For all patient groups the average 
elapsed time was approximately 5 months. Normals 
were tested by a research assistant who went out in 
the community, The instructions were read to all Ss 
and example items were carefully completed. The 
data used in the matching processes described above 
were obtained from self-report, and in the case of 
the patient groups, these data were verified through 
the hospital records. 


RESULTS 


The mean scores for the four study groups 
and relevant combinations of these groups on 
all factors are reported in Table 2. A 1 X 4 
analysis of variance was utilized to test for 
differences among the means of the four 
groups on each factor. Five of the six 
ANOVAs resulted in significant F ratios; 
Self-Other plus was the only factor which did 
not differentiate the four groups significantly. 
Table 3 presents a summary of these results. 

In order to test the specific hypotheses it 
was necessary to run mutually orthogonal 
comparisons on the presumed relationships, 
utilizing the within sum of squares from the 
calculated ANOVAs (see Edwards, 1960). 
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TABLE 3 
SUMMARY OF 1 X 4 ANALYSES OF VARIANCE 


MSGO factor F $< 
Self-Goal plus 4.736 .005 
Self-Goal minus 4.600 005 
Self-Other plus 1.674 ns 
Self-Other minus 4.357 .010 
(Specific) 

Self-Other minus 3.300 025 
(Global) 

Negativism 5.068 001 


Note.—df = 3/100, 


Although the above analysis of the Self-Other 
plus factor was not significant, it was decided 
to test the orthogonal comparisons on this 
factor in order to verify rejection of the hy- 
potheses involving this form of discrepancy. 
The results of these comparisons supported 
the hypotheses in all but four instances (see 
Table 4). Patients and normals were signifi- 
cantly different on all factors except Self- 
Other minus (Global). Vocationally place- 
able patients were significantly different from 
unplaceable patients on the Self-Goal plus and 


while a statistical “tendency” (p < .100) sup- 
porting the placeability hypothesis was found 
for the Self-Other minus (Global) discrep- 
ancy. The Self-Other plus and minus (Spe- 
cific) factors did not differentiate the place- 
able and unplaceable patient groups. For the 
success versus failure comparisons, significant 
differences in the predicted direction were 
found for the Self-Goal minus and both Self- 
Other minus factors, while the analyses of the 


TABLE 4 
Summary or MUTUALLY ORTHOGONAL Comparisons 


Normal ys. patient 


Unplaceable vs. placeable 


Success vs. failure 


MSGO factor 
t $ t $ t p 

Self-Goal plus 2.958 .001 1.754 .050 1.540 
Self-Goal minus 1.666 .050 2.060 025 2.605 
Self-Other plus 1.659 050 .001 ns 1,506 
Self-Other minus 1.985 .025 1.014 ns 2.901 

(Specific) 
Self-Other minus 1.264 ns 1.389 -100 2.522 

(Global) 
Negativism 2.901 001 2.384 .010 1,037 


minus factors and on the Negativism factor, | 


= 


| SelfGoal plus and Self-Other plus discrep- 
ancies showed supporting statistical tenden- 

cies, Successes and failures were not differen- 
tiated by the Negativism factor. 


DISCUSSION 


The results of this study strongly support 
the hypothesis that normals would exhibit less 
internal tension, that is, greater adjustment, 
than patients on all self variables measured 
by the MSGO. The only exception to the 
postulation was the lack of differentiation by 
Self-Other minus (Global). This lack of dif- 
ference, however, does not indicate, as other 
studies have (e.g., Friedman, 1955; Hillson 
& Worchel, 1957; Ibelle, 1960), that self- 
concept measures cannot separate normals 
from delusional patients. Rather, the data in 
Table 2 suggest that the nondifferentiation 
Was not a product of high discrepancy scores 
. for normals, but that the vocational successes 
low score lowered the total patient mean to a 
‘point where no difference between patients 
and normals was indicated, Indeed, a sup- 
plementary orthogonal comparison of normals 
With the combined vocational failures and un- 
Placeables significantly demonstrates that 
normals and the two “failure” groups of pa- 
tients can be differentiated by this aspect of 
the MSGO (¢ = 2.151, p < .025), while nor- 
mals and vocational successes cannot be sepa- 
ated on the same factor (t = 0.630, p = ns). 
‘With regard to the posthospital vocational 
“Aspect of this study, the results support the 
sized relationships for nine of the 12 
parisons tested. First, the findings of the 
laceable-unplaceable comparisons will be dis- 
Cussed. The hypothesis that the placeable 
Koup would exhibit significantly higher Self- 
Soal plus discrepancy upon admission than 
üld the unplaceable group was significantly 
ong . As was mentioned earlier, the rea- 
‘on for this confirmation would seem to be 
tat patients exhibiting this form of psycho- 
Pathology are more likely to be successfully 
treated than those patients manifesting the 
Mdiscrepancies characteristic of more severe 
of psychiatric disability, that is, the 
TeS and personality disorders. The sig- 
t differentiation between these two 
Eoups by the Self-Goal minus factor was not 


À 
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hypothesized on the basis of ease of symptom 
remission, rather in this case it was thought 
that the patient’s pathology would contribute 
to job placement. That is, this discrepancy 
(self-concept-goal self-concept minus) is 
highly correlated with the general diagnostic 
category of personality disorder and more 
specifically with sociopathic personality dis- 
turbance (Miskimins, 1967), a diagnosis 
characterized, in many cases, by manipula- 
tion of others; and it was assumed that this 
manipulative ability would be an asset dur- 
ing the job-seeking process. The third factor, 
Self-Other plus, failed to differentiate the 
placed and not placed groups; in fact, the 
means on this factor for these two groups are 
identical. The failure to support the hypothe- 
sis that unplaceable patients would manifest 
significantly higher self-concept—perceived re- 
sponses of others (SC-PRO) plus discrepancy 
(depression) than would placeable patients 
suggests that this discrepancy is not a deter- 
mining factor in obtaining vocational place- 
ment; however, it has been shown to be of 
some value in predicting vocational success or 
failure (see Table 4). Self-Other minus (Spe- 
cific) also failed to separate the placed and 
not placed. This finding did not support the 
hypothesis that the unplaceable patient would 
be characterized by a significantly higher 
degree of hypersensitivity to criticism of in- 
terpersonal and affective aspects of self than 
would a placeable patient. A further discus- 
sion of this finding will be presented with the 
success-failure comparisons. The fifth MSGO 
factor, which measures global feelings of 
criticism from others, did not separate the 
placed from not placed as sharply as had been 
expected (p< .100), but there was some 
predictive value indicated. The presence of a 
statistical tendency for this comparison, 
rather than significance, will also be illumi- 
nated by the discussion of success-failure 
comparisons. As was expected, the two groups 
were clearly differentiated by the Negativism 
factor, that is, the unplaceable patient ex- 
hibits significantly more Negativism (a corre- 
late of psychotic behavior) than does the 
placeable patient. This differentiation sug- 
gests that the presence of high discrepancy 
on this factor is contributory to unplaceabil- 
ity, that is, a person whose emotional makeup 
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is “negative” will also feel negative toward 
placement. 

The Self-Goal plus factor tends to differen- 
tiate vocational successes from failures in the 
direction hypothesized, that is, there is here 
indication that the potential success’s pathol- 
ogy is characterized by anxiety over attaining 
acceptable goals, while the failure’s illness 
manifests itself in other forms. The failure 
group, as hypothesized, scored significantly 
higher on the Self-Goal minus factor; thereby 
suggesting that although the presence of this 
discrepancy is useful for job obtainment, the 
potential failure’s manipulation of others soon 
becomes obvious and job termination results. 
The third factor, Self-Other plus, also tends 
to separate these two groups in the expected 
direction, thus indicating that this factor is of 
some predictive value once the patient is 
placed. As was previously postulated, a possi- 
ble reason for this success-failure separation 
would seem to be that if the depressed patient 
should meet with some success (placement), 
this would raise his self-concept and thereby 
reduce the SC-PRO plus discrepancy; and 
once this partial remission occurs the patient 
has a higher probability of job success. The 
two groups’ difference on both of the Self- 
Other minus factors were highly significant, 
and these differences not only demonstrate 
the predictive potential of these two factors 
for job success and failure, but also explain 
the lack of clear-cut distinction on these 
factors for the placed-not placed groups. On 
both of these factors the failures are closely 
aligned with the not placed in their patho- 
logical perception of criticism from others (see 
Table 2), thereby obscuring any differences 
between the placed and not placed groups. 
This lack of distinction between failures and 
the unplaceable patients is not surprising, as 
both groups form a larger category of failure. 
The success-failure comparison for Factor 6, 
Negativism, does not support the hypothesis, 
that is, emotional negativism, although pre- 
dictive of placeability, does not seem to be 
an indicator of success potential. 
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The results of this study have clearly dem. 
onstrated that the vocational adjustment of 
psychiatric patients is indicated by their con. 
cept of self, as measured by the MSGO. The 
unplaceable patient is characterized by se. 
vere Self-Other minus discrepancies (psy- 
chotic disability) and a generally negative 
emotional set, while the vocationally place- 
able patient exhibits high Self-Goal plus and 
minus discrepancies. Within the placeable 
group, the vocational successes manifest 
higher Self-Goal plus discrepancy (neurotic 
psychopathology), while the failures are 
characterized by high Self-Goal minus (ma- 
nipulative behavior) and a considerable ' 
amount of Self-Other minus tension. 
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PERSONALITY AND PERFORMANCE CHARACTERISTICS 
ASSOCIATED WITH CIGARETTE SMOKING AMONG 
COLLEGE FRESHMEN * 


DONALD J. VELDMAN ? ann OLIVER H. BOWN 
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Habitual cigarette smokers were located in a group of 2321 university freshmen. 
Smoking was related to both sex and social class. Matched samples of 401 
smokers and nonsmokers were compared on a series of 70 dependent variables. 
Smokers were significantly lower in cognitive ability, high school and college 


grades, two self-report health 


indices, self-esteem, 


work-motivation, and 


attitudes toward men. The smokers scored higher on semiprojective measures 
of orality, social confidence, and acceptance of ambiguity. The data were most 
consistent with the stimulus hunger theory of smoking motivation. 


In 1964 the Office of the United States 
Surgeon General issued an extensive report, 
Smoking and Health, which included correl- 
ative evidence linking cigarette smoking with 
a number of physical diseases. The report 
also considered the question of personality 
traits associated with the smoking habit. The 
authors summarized the available literature 
as follows: 


Despite the individual deficiencies of many of the 
studies, despite the great diversity in conceptualiza- 
tion and research methods used, and despite certain 
discrepancies in reported findings, the presence of 
some comparability between them and the relative 
consistency of findings lend support to the existence 
of a relationship between the smoking habit and a 
Personality configuration that is vaguely described 
el [United States Public Health Service, 


In a review of the literature on personality 
and smoking, Matarazzo and Saslow (1960) 
Mentioned large-scale statistical summaries in 
the 1950s which indicated that about 46% 
of the males and 31% of the females in the 
age range 18-24 smoked cigarettes. Their own 
data suggested that smoking was related to 
icome, anxiety, psychological tension, and 
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self-reports of psychological symptoms, but 
not to social class level. Among the earlier 
studies they reviewed was one by McArthur, 
Waldron, and Dickinson (1958) where 
smoking was more common among lower-class 
public school students. 

In 1961 Salber and MacMahon studied all 
high school students in Newton, Massachu- 
setts, and found that the percentages of 
smokers were greater at the lower social class 
levels. Horn (Horn, Courts, Taylor, & Solo- 
mon (1959) also found that smoking was 
more common among students from lower 
social class levels (as measured by fathers’ 
education); the association was strongest in 
Grade 9, but decreased steadily to become 
negligible in Grade 12. The authors suggested 
that this phenomenon was due to lower-class 
students beginning to smoke earlier, but the 
same effect would be produced if lower-class 
smokers were more likely to drop out of school 
before graduation. 

Another major topic of many studies of 
personality and smoking has been a hypo- 
thetical syndrome of personality traits sug- 
gesting character disorder rather than neu- 
rosis as the “pathology” underlying the 
smoking habit. Straits and Sechrest (1963) 
found that male smokers scored significantly 
higher on the Psychopathic Deviate (Pd) 
scale and lower on the Social Responsibility 
(Sr) scale of the MMPI; other data indicated 
that smokers were more “chance oriented.” 
Schubert (1959) also found a Pd scale dif- 
ference with college freshmen. Stewart and 
Livson (1966) noted that smokers scored 
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lower than nonsmokers on the Socialization 
scale of the CPI. Their other data strongly 
suggested that the “rebelliousness” of the 
smokers antedated the habit by many years. 
Carney (1967) obtained similar results with 
the same measure, and also found a positive 
correlation between smoking and the Achieve- 
ment Orientation (Ao) scale of the CPI. 
Jacobs (Jacobs, Knapp, Anderson, Karush, 
Meissner, & Richman, 1965) described the 
personality trait syndrome associated with 
smoking in terms of “oral frustration, oral 
impulsivity, defiance of authority, and living 
more in the present than in the future.” 

The evidence from self-report personality 
measures is not entirely uniform. Weatherly 
(1965) found no differences at all between 
smokers and nonsmokers on the scales of 
the Edwards Personal Preference Schedule. 
Since this instrument was designed to control 
the influence of social desirability response 
set, he suggested that the effects reported in 
other studies may only indicate that smokers 
are simply more willing to acknowledge soci- 
ally undesirable characteristics. When Jacobs 
and his colleagues (Jacobs, Knapp, Anderson, 
Karush, Meissner, & Richman, 1966) at- 
tempted a direct test of the response-set hy- 
pothesis, they found clear attitudinal differ- 
ences between smokers and nonsmokers, but 
no evidence of a generalized response set 
that could account for them. Particularly in- 
teresting is their summary of the smoker syn- 
drome as reflecting “. . . a neurotic image 
depreciatory to both self and family.” 

The characterization of the smoker as 
“neurotic” appears now and then in the litera- 
ture. Lilienfeld (1959), for instance, found 
smokers to be more neurotic, and to have 
been hospitalized more frequently than non- 
smokers. Eysenck (1960, 1963), beginning 
with the notion that both smoking and carci- 
noma are resultants of a third genotypic 
factor, hypothesized that smokers would be 
more extraverted, less rigid, and more neu- 
rotic. Data from very large samples of males 
clearly confirmed the smokers’ extraversion, 
mildly supported their flexibility, but failed 
entirely to justify the term “neurotic.” 
Eysenck has suggested that both smoking 
and overeating are symptoms of “stimulus 
hunger,” and that when smoking is stopped, 
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it is replaced by overeating—a well-known 
phenomenon. Carney (1967), in the study 
mentioned earlier, found relationships between 
sex chromatin, masculinity of body type, and 
smoking among college students, and sug. 
gested the same “third factor” link between 
smoking and cancer as a possibility. A recent 
study by Pumroy (1967) of male college 
freshmen enrolled in physical education indi- 
cated that amount of cigarette smoking was 
negatively related to grade-point average and 
positively to the probability of dropping out 
of school. These relationships were interpreted 
as due to the smokers’ supposed antagonism 
toward parents and others in authority. 

Finally, two studies by Baer (1966a, 
1966b) with college males produced rather 
surprising results that deserve further investi- 
gation. He found that cigarette smokers 
(former and current) obtained significantly 
lower scores on the mathematical portion of 
the College Entrance Examination Board 
Scholastic Aptitude Test. He also failed to 
discriminate smokers from nonsmokers on the 
basis of height, weight, or a ponderal index 
derived from these two variables. 

The analyses to be reported in the re- 
mainder of this paper were designed to re- 
solve certain ambiguities in many of the 
earlier studies, and to confirm the results of 
previous investigations where possible within 
the limits of our data. In particular, the 
variables of sex and social class were con- 
sidered important enough to enter into the 
selection of the Ss, and were treated subse- 
quently as independent variables in the design 
so that their interactions with smoking behav- 
ior could be assessed. 


INDEPENDENT VARIABLES 


The basic data from which the present 
samples were drawn were collected from 4 
total of 2321 freshmen at the University of 
Texas as part of an institutional reseat' 
project. This sample constituted the majority 
of the incoming freshman class during the fall, 
1963, semester. 

Three paper/pencil instruments were ad- 
ministered: (a) the Bown Self-Report Inver 
tory, (b) an Autobiographical Information 
(AT) form, and (c) the Veldman-Peck On 
Word Sentence Completion (Form 63). 1 


| 
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tion of a “smoker” was based on a single 
from the autobiographical form; at the 
f a health status checklist was the 


E smoke cigarettes every day 


e who checked this item was classi- 
a smoker. Although considerable varia- 
n the intensity of the habit is possible 
e, habituation can be safely assumed from 
ositive response. 
ther item on the autobiographical form 
'S to indicate the extent of his father’s 
tion. This variable was used as an opera- 
definition of socioeconomic class. Three 
ls were defined: (a) high school only, 
(b) college only, and (c) graduate degree. 

Of the original 2321 Ss, 100 failed to indi- 

fe their fathers’ educational levels. Table 1 
the distribution of the remaining 2221 
‘smoking, sex, and social class level. By 
the three sets of marginal frequencies, 
values for each of these 12 cells 


=11, p= .0005). Two-way chi-squares 
then computed for all combinations of 
ree variables. 

e chi-square value for the association 
sex and smoking was statistically sig- 
pant (x? = 4.58, df= 1, p= .03); males 
5%) were more likely to smoke than 
females (16%). The chi-square value for 
Ihe association between social class and 
4 was also significant (x? = 13.98, 


TABLE 1 


DISTRIBUTION or STUDENTS BY SMOKING, SEX, 
AND SOCIAL CLASS LEVEL 


Social class 


Totals 


‘Glass estimated from father's education: HS = high 
, G = postgraduate degree. 
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TABLE 2 

DISTRIBUTION OF SELECTED SAMPLES BY SEX 

AND FATHERS’ EDUCATION 

Fathers? Subject sex 

education 
Males Females Totals 

High school 90 41 131 
College 83 58 141 
Postgraduate 81 48 129 
Totals 254 147 401 


df = 2, p= .001); smoking was much more 
common at the highest social class level 
(23.6%), and roughly equal (16.2%) at the 
lower levels. The chi-square value for the as- 
sociation of sex and social class level was 
also significant (x= 14.99, df=2, p= 
.0009); the proportion of males at the lowest 
social class level (63.7%) exceeded those at 
the higher levels (55.4% and 55.9%). The 
residual chi-square value (Sex X Smoking x 
Social Class) did not approach statistical sig- 
nificance. 

In order to avoid biases in cell variance due 
to very disproportionate subsample sizes, 
further § selection was carried out to equate 
the groups of smokers and nonsmokers. For 
each of the 401 smokers, a nonsmoker of the 
same sex and social class level was randomly 
selected. The resulting distributions of Ss by 
sex and social class are shown in Table 2. All 
statistical analyses reported subsequently were 
based on this selected group of 802 students. 

All but two of the 70 dependent variables 
studied were continuously distributed, and 
were subjected to a three-factor analysis of 
variance design (Veldman, 1967), in which 
the three main effects (smoking, sex, and so- 
cial class) and four interactions were all tested 
against an averaged within-cells error term. 
Two categorical variables, birth-order posi- 
tion and future vocational preference, were 
analyzed by means of the chi-square tech- 
nique. 

DEPENDENT VARIABLES 
Cognitive, Somatic, and Social Variables 


Data for these variables were obtained from 
the AI form and from central records of the 


University. 
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Verbal ability and quantitative ability. 
These scores were from the Scholastic Apti- 
tude Test. 

High school quarter. This information was 
obtained from the files of the Testing and 
Counseling Center. Although a low number 
signifies a high rank, subsequent discussions 
will reflect the level of performance implied. 

Freshman grade-point average (GPA). This 
score was determined from university rec- 
ords, and covers only the initial semester dur- 
ing which enrollment and testing took place. 

Confidence in college. This score was based 
on a three-choice item from the AI form which 
asked for an estimate of certainty about even- 
tually graduating from college. Again, a low 
score signifies certainty, but the discussion will 
reflect degree of confidence. 

Weight and height. These data were ob- 
tained from the AT form. 

Health index. A six-item checklist was pro- 
vided on the AI to register various types of 
physical disorders. Since the sum of items 
checked was used as the score, a low value 
signifies health, and subsequent discussions 
will reflect this interpretation. 

High school social activity. The AI in- 
cluded space for $ to list extracurricular ac- 
tivities. These were simply counted; no at- 
tempt was made to weight various types of 
activity. 

Number of siblings. This score was based on 
information from the AI form, as was the next 
variable. 

Birth order position, Three categories were 
employed: (a) only child, (b) firstborn, and 
(c) later born. 

Future occupational choice. Data were ob- 
tained from the Testing and Counseling Cen- 
ter, and the most frequently named categories 
were used (see Tables 4 and 5). 


Self-Report Inventory Variables 


The Self-Report Inventory (Bown, 1961; 
Bown & Richek, 1967) consists of 48 state- 
ments, which are scored for eight subscales and 
a total of all items. The item statements are 
relatively straightforward self-reports of the 
respondent’s feelings toward various aspects 
of his phenomenal world. Internal consistency 
coefficients (a) are included with the scale 
descriptions that follow. 
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Self. (a = .78) Acceptance, liking, valuing 
of oneself. 

Others. (a = .65) Attitude toward peers; 
the importance of satisfactory interpersonal 
relationships to own sense of well-being. 

Children. (a = 85) Attitude toward chil- 
dren; satisfaction derived from relationships 
with them. 

Authority. (a = .53) Attitude toward non- 
familial authority figures. 

Work. (a = .70) Valuing of work and ac- 
complishment in terms of intrinsic satisfaction 
or self-enhancement. 

Reality. (a = .28) Acceptance of life as a 
process (including death); positive existential 
adjustment. 

Parents. (a = .84) Attitude toward parents; 
importance of relationships with them to own 
sense of well-being. 

Hope. (a = .66) Optimistic anticipation of 
the future. 

Total. (a = .87) Simple sum of the eight 
scales: the general positiveness of the respon- 
dent’s view of his phenomenological world. 

Detailed explication of the scoring proce- 
dures and norms are available from the au- 
thors (Bown & Veldman, 1967). In order to 
obtain more sensitive measures of attitudes 
toward Others, Children, Authority, and 
Parents, the Self score was subtracted from 
each of these scale scores to yield four addi- 
tional measures. Both the Self scores and the 
scores for attitudes toward others reflect two 
components, the “true” attitude and a gen- 
eral response set. By subtracting the Self 
score from those for other significant figures, 
the common response set is canceled and the 
“true” attitude is also made relative to S’s 
own level of self-esteem. 


Sentence Completion Variables 


As part of a research program concerning 
computer-based psychological assessment,” 4 
36-item sentence completion form was in- 
cluded in the test battery. The Ss were re- 
stricted to single-word responses, which weré 
transferred to punch cards. An experimental 
scoring procedure employing preclassified word 


8Grant 06823 from the National Institute of 
Mental Health, titled “Computer Analysis of Per- 
sonality,” codirectors Robert F. Peck and Donald J: 
Veldman. 


was then used to obtain quantitative 
n 40 variables. Most of the basic pro- 
are described elsewhere (Veldman, 


Response length. Average number of 

cters per response. 

Response variation. Number of generic 

ponse types used. 

. Consistency index. Number of identical 

onses to four paired stems, 

Pronoun responses. (e.g., “He makes me 
” 

”) 

. Evasive responses. Stem-repetitive words 

ther noninformatives. 

Proper names. (e.g., “John makes me 

sad.”) 

Nonresponses. Unanswered items. 

. Self references. (e.g., “The average per- 
i me.”) 

|, Age responses. (e.g., “I am 18.”) 

Orality. References to food, eating, 

nking, and smoking. 


Commonality. Ordinariness (high) 

j uniqueness (low) in response to a 

of self-descriptive stems. 

Somatic self-esteem. Health, strength, 

tractiveness. 

5. Social self-esteem. Extraversion versus 

shy or withdrawn. 

Cognitive self-esteem. Intellectual depth 

acuity. 

Performance self-esteem. Success or im- 
ment. 

8. General self-esteem. Mostly the sum of 

14-17, 

Emotional stability. (“My emotions are 

a?) Controlled, mild versus erratic, 


Character. Morality, stability, responsi- 


it is ______.”) Approval, acceptance 
ear, rejection. 
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24. Certainty. References to decisiveness 
versus doubt. 

25. Self-confidence. Level of conscious anxi- 
ety. 

26. Ambiguity acceptance. (“Darkness is 
.”) Acceptance, pleasure versus re- 
jection, fear, depression. 

27. Stress resistance. (“I when put 
under pressure.”) Active coping versus anxi- 
ety, quitting, or rebellion. 

28. Academic attitude. Cathexis versus re- 
jection of school and studies. 

29. General motivation, Ambition, interest, 
and effort. 

30. Attitude toward mother. (“My mother 
» 
31. Attitude toward father. (“My father is 
” 

32. Attitude toward family. Mostly the sum 
of scores 30 and 31. 

33. Attitude toward men, (“Most men are 

” 

34. Men toward women. (“Men often 
women,”’) 

35. Attitude toward women. (“Most women 
are spew diy 2 9 
36. Women toward men. (“Women often 

men,”’) 

37. Heterosexuality. Cathexis of opposite 
sex and marriage. Derived largely from scores 
33-36, considering sex of S. 

38. Attitude toward average person. (“The 
average person is 7) 

39, General interpersonality. Friendliness, 
kindness, courtesy. 

40. Unique responses. Count of responses 
that could not be located by the word-root 
system. (Less than 2% of the total.) 


As noted in the definitions above, a number 
of the scores are based on responses to single 
stems, while others are derived from particu- 
lar aspects of the entire protocol, No reliabil- 
ity data are available at present on this sys- 
tem, but Peck, Menaker, and Veldman (1966) 
reported very satisfactory agreement between 
ratings by human judges and scores from an 
earlier computer-based system. 


is 


RESULTS 


For ease of communication, the results of 
the statistical analyses will be presented in 
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TABLE 3 
RESULTS OF ANALYSES OF COGNITIVE, SOMATIC, AND SOCIAL VARIABLES 


Smoking 
Dependent variable 
$ level Direction 
Verbal ability 02 NS>S 
Quantitative ability .0001 NS>S 
High school quarter <.0001 NS>S 
Freshman GPA <.0001 NS>S 
Confidence in college 01 NS>S 
Weight = — 
Height — — 
Health index .001 NS>S 
High school social activity* .003 NS>S 
Number of siblings* — — 


= 


Sex Social class 

p level Direction p level Direction 

— — .03 C>G>HS 
<,0001 M>F -004 C>G>HS 

.003 F>M — = 

— — .003 G>C>HS 
<.0001 M>F 0006 G>C>HS 
<.0001 M>F — = 
<.0001 M>F =. = 
<.0001 M>F a = 
<.0001 F>M — = 

.02 M>F => = 


Note.—Social class estimated trom father's education: HS = high school, C = college, G = postgraduate degree. 


* Also yielded a significant Sex X Smoking interaction. 


three groups: (a) the cognitive, social, and 
somatic variables, (5) the scale scores from 
the Self-Report Inventory, and (c) the One- 
Word Sentence Completion variables. Rather 
than attempt to report these analyses in full, 
we have summarized the significant (p < 
.05) results in Tables 3, 7, and 8. Although 
seven F ratios were computed in each analy- 
sis, only nine significant interactions occurred. 
Since they all involved the smoking variable, 
they will be described below. 


Cognitive, Somatic, and Social Variables 


Nonsmokers scored higher on both the 
verbal and quantitative sections of the college 
entrance exams, but the differences were much 
greater for the quantitative variable (543 vs. 
515) than for the verbal (527 vs. 512). Non- 
smokers ranked higher in their high school 


classes (1.32 vs. 1.74), and also obtained 
higher grades during their first college se- 
mester (1.32 vs. .90), As might be expected 
from the performance differences, nonsmokers 
indicated more confidence (1.83) that they 
would eventually graduate than did smokers 
(1.97). 

Smoking was not significantly related either 
to height or to weight, but smokers reported 
more physical disorders (.55) than did the 
nonsmokers (.36). Among the males, smokers 
reported fewer (2.9) social activities in high 
school than did nonsmokers (3.8), but the 
difference between the female groups was 
negligible. With regard to family size, non- 
smokers of both sexes were equivalent; male 
smokers, however, came from somewhat larger 
families (2.0) than did female smokers (1.6): 

Table 4 contains the frequency distribution 


TABLE 4 
DISTRIBUTION OF SAMPLE BY SMOKING, SEX, SOCIAL CLASS, AND BIRTH-ORDER POSITION 
High school social class College social class Postgraduate social class 
Birth-order position 
Nonsmokers Smokers Nonsmokers Smokers Nonsmokers Smokers 
Males 
Only child 21 8 7 9 10 6 
Firstborn 30 31 45 31 45 40 
Later born 39 51 31 43 26 35 
Females 
Only child 7 9 7 3 5 9 
Firstborn 15 16 25 19 17 22 
Later born 19 16 26 36 26 17 


— 
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of the sample by sex, social class, smoking, 
and birth-order position. A three-way chi- 
square analysis was computed separately for 
males and females. Birth-order was related to 
smoking among males (x? = 9.99, df=2,p 
= 007); the proportion of smokers was much 
smaller among firstborn and only children 
than among the later born. The three-way 
interaction of smoking, social class, and birth 
order was not significant. Among females vir- 
tually no differences in proportions of smokers 
appeared among the birth-order groups. 
Tables 5 and 6 contain the frequency dis- 
tributions among the most commonly desig- 
nated occupational preferences in the male 
and female samples. Although the chi-square 
values for these tables did not approach sta- 
tistical significance, some rather striking con- 
sistencies appear. In both sex samples the Fine 
Arts group contains twice as many smokers as 
nonsmokers. In the male sample the Engineer- 
ing group contains a predominance of non- 
smokers, which is consistent with the higher 
quantitative ability of the nonsmoking Ss. 


Self-Report Inventory Scales 


The results summarized in Table 7 clearly 
evidence substantial differences in attitudes 
between the groups of smokers and nonsmok- 
ets, Smokers indicated a lower level of self- 
esteem than nonsmokers. Although the abso- 
lute levels of attitudes toward others did not 
differ, when these scores were made relative 
to self-esteem, the smokers appeared to be 
more positive than nonsmokers. Attitudes to- 
ward children were equivalent in the two 


TABLE 5 


DISTRIBUTION OF MALE SAMPLE BY SMOKING 
AND OCCUPATIONAL PREFERENCE 


Occupation Nonsmokers | Smokers | Percentage 
Fine arts 9 17 65% 
Business 32 32 50% 
Medical-dental 49 42 46% 
Engineering 64 49 | 43% 

W, government 38 39 51% 

esearch, college 

teaching 18 26 59% 
Other and un- 

decided 44 49 53% 

Totals 254 254 
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TABLE 6 


DISTRIBUTION OF FEMALE SAMPLE BY SMOKING 
AND OCCUPATIONAL PREFERENCE 


Occupation Nonsmokers | Smokers | Percentage 
Fine arts 11 22 67% 
Business 10 10 50% 
Medical—dental 20 21 51% 
Social sciences 22 25 53% 
Elementary edu- 

cation 17 15 41% 
Secondary edu- 
cation 28 20 42% 
Other and un- 
decided 39 34 47% 
Totals 147 147 


groups. In absolute terms, male smokers indi- 
cated more negative attitudes toward non- 
parental authority figures, but this effect dis- 
appeared when the scores were adjusted for 
level of self-esteem. Smokers indicated more 
negative attitudes toward work and reality. 
Smokers reported more negative attitudes to- 
ward parents, but the three-way interaction 
indicated that this was due largely to the 
extremely low scores of the lower social class, 
female smokers. When adjusted for level of 
self-esteem, no significant effects were found. 
No significant differences in optimism about 
the future appeared. Smokers obtained a 
lower average score on the total of all items, 
but a significant Sex X Smoking interaction 
revealed much larger differences between 
smokers and nonsmokers among males (M = 
131 vs. 123) than among females (M = 135 
vs. 133). The male smokers exhibited a strong 
general tendency toward negative characteri- 
zation of their own attitudes. 


Sentence Completion Variables 


A number of intriguing results appear in 
Table 8, but due to the experimental status of 
the scoring system that was used to quantify 
5’s verbal behavior, the interpretations placed 
upon the observed effects must be considered 
as tentative, and in need of confirmation from 
other sources. In some cases we will be able 
to offer such confirmatory evidence from other 
data analyzed in the present investigation, but 
other findings must await support from 
further research. 
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TABLE 7 
RESULTS oF ANALYSES OF Raw AND ADJUSTED SEL¥-REPORT INVENTORY SCALE SCORES 
Smoking Sex Social class 
SRI variable 
(Rawseotes) p level Direction Pp level Direction p level Direction 
Self 01 NS>S 
Others = = -0004 F>M — — 
Children — = <.0001 F>M -= — 
Authority* .03 NS>S .002 F>M — = 
Work <.0001 NS>S 01 F>M = — 
Reality 03 NS>S 004 F>M =? = 
Parents» 0004 NS>S — — — 
Hope = so Sat nso = = 
Total* .0009 NS>S <.0001 F>M = 
Others—Self .002 S>NS — — — — 
Children—Self = — <.0001 F>M .03 HS>G>C 
Authority—Self — — — 
Parents—Self E = — — — — 
“Sex X Smoking interaction. 


» Triple interaction. 


Nonsmokers used significantly longer re- 
sponse words than did smokers. This finding 
is consistent with the greater verbal ability of 
nonsmokers reported earlier. 

The significant interaction of Smoking X 
Social Class with regard to the use of proper 
names is difficult to explain. Among the low- 
est and highest social class levels, smokers 
used more proper names, while the reverse 
was true for the middle social class level. The 
triple interaction for nonresponses (blank) 
was even less interpretable. 

Age responses were more common among 
nonsmokers, Although this could reflect con- 
cern with a “maturity image,” such an in- 
terpretation is speculative at best. 

The highly significant tendency for smokers 
to give more “oral” responses cannot be ex- 
plained by the fact that the term “smoking” 
was included in this category; the frequency 
of this response was far too small. Unfortu- 
nately, we can offer no corroborative evidence 
from the present data for this classic, face- 
valid theoretical motivation for smoking. 

The significantly more frequent ideological 
references by nonsmokers, and their higher 
scores on the “character” variable, suggest 
that moral considerations may play a stronger 
part in the inhibition of smoking than has 
been indicated in previous studies, 


The higher scores of nonsmokers with re- 
gard to somatic self-esteem are consistent with 
their higher scores on the health index de- 
scribed earlier. 

Although the main effect for social self- 
esteem appears to favor smokers in general, 
inspection of the means for the Sex X Smok- 
ing interaction indicates that this is due al- 
most entirely to extremely low scores for fe- 
male nonsmokers on this variable. Essentially 
the same pattern characterizes the analysis 
of the general self-esteem scores. 

The significantly higher scores on the Op- 
timism scale for nonsmokers did not appear 
with the Hope scale of the Self-Report In- 
ventory, although both measures supposedly 
concern expectations for the future. 

A very interesting difference between smok- 
ers and nonsmokers appears on the Ambiguity 
Acceptance scale, which is derived entirely 
from the item, “Darkness is .” Smok- 
ers apparently feel less threatened, or more 
positive toward this stimulus. 

Nonsmokers of both sexes indicate more 
positive attitudes toward men, more positive 
expectations for the treatment of women by 
men, and a more positive attitude towar 
marriage in particular. The complementary 
scales concerning attitudes toward women, 
however, do not differentiate the smokers and 
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i TABLE 8 
RESULTS OF ANALYSES OF SENTENCE COMPLETION VARIABLES 
E Smoking Sex Social class 
One-word sentence completion 
variable 
p Direction p Direction p Direction 

1, Response length 03 NS>S = — — — 

2. Response variation tit =< .006 F>M — = 

3. Consistency index i= — .04 M>F — — 

4. Pronoun responses 

5. Evasive responses — — -004 M>F — — 

6. Proper names? 

7. Nonresponses® — — — — — = 

8. Self references = _ san w = — 

9, Age responses 02 NS>S 

10, Orality -0001 S>NS .005 M>F — — 

11. Money TE = <.0001 M>F — ae 

12. Ideology 05 NS>S == — = farts 

13. Commonality — = 02 M>F — = 

14. Somatic self-esteem .04 NS>S — i ar a 

15. Social self-esteem* 002 S>NS 04 M>F 05 G>HS>C 
16. Cognitive self-esteem — = .003 M>F = e 

17. Performance self-esteem = — .05 F>M — = 

18. General self-esteem* — — .01 M>F — ~ 

19. Emotional stability 25 — .006 M>F 04 G>C>HS 
20. Character .0008 NS>S .005 F>M — — 
21. Impulse acceptance = 

22. General mood = ra = = aan = 

23. Optimism .005 NS>S — = m: ‘=a 
24. Certainty — — — = 02 G>HS>C 
25. Self-confidence — — .006 M>F .02 C=G>HS 
26. Ambiguity acceptance .03 S>NS — e .002 G=C>HS 
27. Stress resistance — — .008 M>F — = 
28. Academic attitude = A = er a = 
29. General motivation — T = = Frond re 
30. Attitude to mother FA 
31, Attitude to father oy Gi 
32. Attitude to family — ra y> aa Ta 
33. Attitude to men .02 NS>S 003 F>M .001 C=HS>G 
34. Men toward women 04 NS>S 02 M>F STR TY 
35. Attitude to women = = 0001 M>F ia pa 
36. Women toward men = en al EA FA S 
37. Heterosexuality 01 NS>S -0003 F>M i 
38. Attitude to average person = Re a p 57 T 
39. General interpersonality — E <.0001 F>M — 
40. Unique responses — ni 7 mF aa coe 


{ Smoking X Sex interaction. 
Smoking X Social Class interaction. 
° Triple interaction. 
nonsmokers. The implications of these results 
are not at all apparent. 


CONCLUSIONS 


Some of the conclusions stated below are 
clearly supported by the present data, while 
others are offered as speculations deserving 
further research. In any event, these generali- 


zations must be limited by consideration of 
the population sampled—freshmen at a large 
university. It should also be noted that any 
inferences to be drawn from these findings re- 
garding the etiology of the smoking habit 
must be tempered by the fact that a major in- 
fluence—the smoking habits of the parents— 
could not be included in the analyses reported 
here. With these reservations, the results of 
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the present investigation may be summarized 
as follows. 


1. Smoking was more common among males 
than among females. Although relatively few 
of the differences between smokers and non- 
smokers needed qualification due to a Sex X 
Smoking interaction, the differences due to 
smoking were generally greater among males 
than among females. 

2. Smoking was more common among stu- 
dents whose fathers held graduate degrees 
than at the lower social class levels. Generally 
speaking, very few interaction effects between 
smoking and social class were observed, how- 
ever. 

3. Although smokers did not tend to come 
from larger families, among the males only the 
smokers were much more likely to be later- 
born than firstborn or only children. 

4. Smokers reported more physical disorders 
and described themselves more negatively 
with regard to somatic characteristics. They 
did not differ from nonsmokers in either 
height or weight, however. 

5. Smokers obtained lower academic apti- 
tude scores than did nonsmokers, particularly 
in the quantitative area. This may explain the 
fact that they also obtained lower grades in 
high school and in their first semester at the 
university. 

6. Although male smokers reported involve- 
ment in fewer high school social activities, 
female nonsmokers indicated less social in- 
volvement and self-assurance at the time of 
testing. Various measures of attitudes toward 
peers yielded equivocal results. 

7. The negative attitudes toward parents 
and other authority figures that are often 
assumed to characterize the smoker did not 
emerge clearly in our data. One analysis did 
suggest that the lower social class girls who 
smoked held very negative attitudes toward 
their parents. 

8. Nonsmokers revealed a strong general 
tendency to respond more positively to the 
self-report attitude measures, and also indi- 
cated a certain pride in their own moral char- 
acter. 

9. A few results taken together seem to sup- 
port the “stimulus hunger” theory of smoking 
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motivation. The “orality” of the smoking 
habit is supported by more frequent responses 
by smokers with content concerning eating 
and drinking. They also reacted more posi- 
tively to the stimulus “darkness,” when pre- 
sented as an incomplete sentence. 

10. In general, our results do not appear to 
justify the term “neurotic” as a characteriza- 
tion of the smokers’ personality syndrome— 
if indeed such a pattern exists anywhere but in 
the minds of those who seek to discourage the 
habit by means of propaganda. 
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NOTES AND COMMENTS 


INTERRELATIONSHIPS OF SEVERAL PROCESS-REACTIVE MEASURES: 


CHARLES G. WATSON 
Veterans Administration Hospital, 
St. Cloud, Minnesota 


AND 


PATRICK E. LOGUE 
Veterans Administration Hospital, 
Coral Gables, Florida 


8 previously used and potential process-reactive measures were intercorrelated 
and factored by the principal-factor solution, 2 factors were rotated. The 1st 
factor, consisting of marital status and the Phillips, Elgin, and Ullmann- 
Giovannoni scales, consisted of traditional process-reactive measures. The 
variables making up the 2nd factor do not appear to be appropriate process- 
reactive indices. Implications of the findings are briefly discussed. 


Research on the process-reactive dimension has 
proliferated in recent years as psychologists have 
used the concept in an effort to bring order to 
the schizophrenia literature. A questionable by- 
product of this activity has been the disorderly 
development of a rash of process-reactive scales 
and definitions, The operational definitions which 
have appeared in the literature include the Elgin 
Prognostic Scale (Becker, 1955, 1956, 1959; 
Wittman, 1941), the Phillips Premorbid Scale 
(Phillips, 1953), the Kantor Scale (Kantor, 
Wallner, & Winder, 1953), the MMPI Ego 
Strength (Zs) scale (Herron, 1962), the PR 
questionnaire (Ullmann & Giovannoni, 1964), 
Rorschach scores (Kantor & Herron, 1966), an 
unsuccessful MMPI scale (Johnson & Holmes, 
1967), and marital status. Moreover, other mea- 
sures, such as length of previous psychiatric hos- 
pitalization, might also be proposed as appropriate 
process-reactive indices. 

To the extent that these measures may be inde- 
pendent, psychologists run the risk of developing 
separate—and possibly conflicting—bodies of 
process-reactive literature. The purpose of the 
present project was to study the interrelation- 
ships of eight previously used and potential 
process-reactive measures by factor analysis to 
determine whether they reflect the same dimen- 
sion. 


METHOD 


Ninety-four schizophrenic patients at the St. Cloud 
Veterans Administration Hospital served as Ss. All 


1 Carried out within the framework of activities 
supported by the Veterans Administration Program 
Evaluation Staff. The authors are indebted to Ray- 
mond Collier, Theodore W. Lorei, and Jack Collier 
for help with the statistical analyses and to Vi Ko- 
horst, Catherine Tidd, and Kathleen Schelonka for 
clerical support. 


| 
| 
were nonlobotomized males under 60 with no evi- 
dence of brain damage in their clinical files. One 
hundred and one patients meeting these qualifications 
were originally chosen for the study but, because 
complete test data could not be obtained for seven 
Ss, the final N was 94. The mean age, education, and 
total months of previous neuropsychiatric hospitali- 
zation(s) for these men were 41.7 years (SD =8.), 
108 years (SD=3.3), and 118.8 months (SD= 
89.6), respectively. 

Measures taken on Ss were Beckers? 10- 
item revision of the Elgin Prognostic Scale, the 
Phillips Premorbid Scale, the Ullmann-Giovannoni 
PR questionnaire, the Barron Es Scale, months of 
previous psychiatric hospitalization, a quantified 
marital status measure (2 = married, 1 = separated 
or divorced, 0=never married), education, and age. 

Phillips Premorbid Scale and Elgin Prognostic 
Scale (10-item form) ratings had previously 
obtained on all Ss as part of another study (Watson 
& Logue, 1968). Each author independently rated 
every S on both scales and the Elgin and Phillips 
scores used in the present study were totals over the 
two judges. 

Product-moment correlations between each of the 
28 pairs of variables were calculated. Subsequently 
the matrix was factored by the _principal-factot 
solution (Harman, 1960, Ch. 9) using squared multi- 
ple correlation communalities in the diagonal. Tr 
factors were then rotated orthogonally using Kaiser’ 
(1958) normal varimax method. 


RESULTS AND DISCUSSION 


The correlation matrix is displayed in Table ! 
and the composition of the factors in Table 2 
The nature of the factors is discussed below. i 
defining the factors, only those variables with 
loadings of .40 or greater are considered. 


2W. C. Becker, Modified Elgin Prognostic Sal 
1959. (Mimeo available from W. C. Becker, Univ? 
sity of Tlinois.) 
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TABLE 1 
INTERCORRELATIONS OF Process-REACTIVE MEASURES 
Scale 2 3 4 5 6 7 8 
E a) 67 goto 360 5 7 03 
Marital status (2) —.51%* 43t — 32%" 17 AS .04 
Ullmann-Giovannoni (3) 45** wore 08 —.07 10 
Elgin (4) 45 soot —.06 17 
Length of hospitalization (5) OE 19. 5.20 
Age (6) -.07 13* 
Ego Strength (7) 26" 
Education (8) 
The first factor consisted of social adjustment loaded on it and it appears to be of no particular 
Measures and was defined largely by marital value as a process-reactive measure. 
status and the Phillips, Elgin, and Ullmann-Gio- It is particularly interesting to note that Es 
yannoni scales. More than the other factor, it reflects quite a different dimension from that 
seems to reflect conventional process-reactive defined by marital status and the Phillips, Elgin, 
Measures. It accounted for 64% of the common and Ullmann-Giovannoni scales, Even though 
factor variance. the name “Ego Strength” suggests the scale as 
A relatively small loading on this factor also an appropriate process-reactive definition and it 
appeared for length of psychiatric hospitalization, has been used as such in the past, the present 
Suggesting that the latter is not a good opera- data do not recommend it for that purpose. 
tional definition of the term although it is related The data suggest that marital status and the 
: 


to the process-reactive dimension. This finding 
also supports the contention made elsewhere 
(Lorei, 1966) that a surprisingly large share of 
the variance in length of neuropsychiatric hos- 
Pitalization is attributable to nonpatient variables 
Such as staff attitudes. 

The second factor, which accounted for 36% 
of the common factor variance, consists essen- 
tially of age and length of hospitalization, al- 
though small loadings also appeared for educa- 
tion, Zs, and Elgin scores. None of the more 
“common process-reactive measures were highly 


TABLE 2 


ROTATED VARIMAX ORTHOGONAL MATRIX 


or Factor LOADINGS 
Ueda ar E os 


Factor 
Scale M SD 

T(r 
i li 44.5 | 14.5 88 | —.14 
Marital status 40} .68|—.69| 14 
mann-Giovannoni | 13.5 | 4.4 66 | —.12 
Bein 59.9 | 13.4 67 | —.30 
ospitalization 118.8 | 89.6 31 | —.74 
te 417 | 8.2 | 07 | —65 
pe Strength 40.8 | 80 | —.11] .22 
ducation 108 | 3.3 | —.04| 34 


Phillips, Elgin, and Ullmann-Giovannoni scales 
are probably appropriate process-reactive defini- 
tions, but that neither Zs nor length of previous 
hospitalization, age, or education would serve as 
an appropriate measure. Since the Elgin and 
Phillips scores used were composites based on 
the ratings of two psychologists, a comparison of 
the loadings recorded in Table 2 cannot be used 
as a basis for selecting a single process-reactive 
measure for future use. However, since the inter- 
judge reliability of Phillips and Elgin scales is 
questionable (Watson & Logue, 1968), the data 
now available suggest that marital status and the 
Ullmann-Giovannoni scale may be the most ap- 
propriate operational definitions available, at 
least for male patients. 
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INTERNAL VERSUS EXTERNAL CONTROL AND ETHNIC 
GROUP MEMBERSHIP: 


A CROSS-CULTURAL COMPARISON + 


THEODORE TIN-YEE HSIEH, JOHN SHYBUT, anb ERWIN J. LOTSOF 
Northern Illinois University 


To investigate a relationship between Internal versus External control and 
ethnic group membership three groups of high school students (Chinese, Ameri- 
can-born Chinese, and Anglo-American) were compared in their responses to the 
Internal-External Control Scale. A hypothesis that a belief in Internal versus 
External control is significantly related to ethnic group membership was con- 


firmed. 


Internal-External (I-E) control has been de- 
scribed as a major personality variable (Rotter, 
1966) and its relationship with ethnic group 
membership has been reported (Battle & Rotter, 
1963; Graves, 1961). In contrast to previous 
studies which compared different ethnic groups 
within the United States culture, the present 
study compares Chinese, American-born Chinese, 
and Anglo-American Ss in their responses to the 
Tnternal-External Control Scale. 

The I-E may be defined as: “the extent to 
which an individual feels that his own actions 
determine what happens to him or what happens 
to him is determined by fate, luck, chance, or 
powerful forces outside his control [Rotter, 
Seeman, & Liverant, 1962].” A similar distinction 
in terms of the locus of control is also observed 
in the description of certain culture-specific per- 
sonalities. Thus, the “individual-centered” Ameri- 
can personality is associated with a culture that 


1 This study is based on an M.A. thesis submitted 
by the first author to the Graduate School, North- 
ern Illinois University. 


emphasizes the uniqueness, independence, and 
self-reliance of each individual (Hsu, 1963a). It, 
among other things, places a high value on per 
sonal output of energy for solving all problems; 
pragmatic ingenuity; individualism, that is, self- 
reliance and status achieved through one’s own 
efforts; power or ability to influence or control 
others, things, animals, and forces of nature; and 
efficiency (Gillin, 1955). Life experiences appea! 
to be largely a consequence of one’s actions. 

In contrast, the “situation-centered” Chines 
personality is associated with a culture where 
kinship and emphasis on status quo are stressed. 
The individual in this culture tends to view lis 
life as being relatively fixed. Luck, chance, and 
fate are taken for granted in life, which is Com 
sidered to be full of ambiguity, complexity, and 
unpredictability (Hsu, 1953, 1963b). Life situ: 
tions may be viewed as being largely determin? 
by circumstances outside one’s control (Eber 
hard, 1965). i 

Consequently, it may be assumed that the 1" 
ternalized cultural orientation, other things beié 


—— 


bi 
( 


be the discriminating variable in one’s 
ce for the items stressing a belief in 
yersus External control. A hypothesis 
comparison with American Ss, the Chin- 
show higher preference for items em- 
external control, was tested in this 


METHOD 


original Internal-External (I-E) scale (Rot- 
1966) was translated from English to Chinese by 
‘the first author. The scale includes 29 forced-choice 
Fy ich as: “No matter how hard you try some 
P just don’t like you vs. People who can’t get 
others to like them don’t understand how to get 
with others.” In translating the scale all items 
translated literally. Popular Chinese sayings 
‘whenever applicable were not used to avoid social 
irability. In general, the principle of translation 
comparable to that of Gough and Sandhu 
), who translated the socialization scale of 
rnia Personality Inventory for cross-validation 
. Recently the I-E scale has been translated 
yedish and used with Swedish Ss.” 
"To test the Chinese translation, three judges who 
‘know both English and Chinese were asked to trans- 
late independently the Chinese version into English. 
versions were very similar to each other and 
original English version. Subsequently, the 
se version of the I-E scale was administered to 
Chinese students at the local university. Three 
weeks later, the 16 students were given the English 
version of the scale. A highly significant correlation 
(r= 83) between the two sets of scores supports 
reliability of the translated version of the I-E 


The Ss for this study consisted of three groups of 
school students. Group I included 131 male and 
emale Anglo-American high school seniors from 
hern Illinois community near Chicago. 
oup II was composed of 38 male and 42 female 
ican-born Chinese high school seniors attending 
$ public schools in Chicago. At least one 
nt of each S was born in China. 
e Ss in Group III were drawn from three upper 
at Pui-Ching Middle School in Hong Kong. 
school is a Baptist institution with an enroll- 
of over 5,000, Generally, it is attended by stu- 
from families of the middle or middle-upper 
s, the majority of whom are not church-related 
hurch-going. Of the 343 Ss in this group, 241 
males. 
Ss were closely matched for age (17.5, 17.2, 
‘Were the means for Groups I, II, and Ill re- 
tively) and checked for socioeconomic back- 
Ind in terms of family income and father’s edu- 


Seeman, unpublished manuscript, 1964. Cited 
B. Rotter (1966). 
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Procedure 
The English version of the I-E scale was admin- 
istered to Groups I and II while the Chinese version 
was administered to Group III with the following 
instructions: 
Try to answer these items as carefully and hon- 
estly as you can, Do not spend too much time on 
any one item, but be sure to find an answer for 
every choice. Find the number of the item on the 
answer sheet and blacken the letter a or b— 
whichever you choose as the more true statement. 
In some instances you may discover that you be- 
lieve both statements or that you believe neither 
one to be true. In such cases, be sure to select 
the one you more strongly believe to be true as 
far as you are concerned, Also, try to answer each 
item independently when making your choice; do 
not be influenced by your previous choices. 
REMEMBER! 
Make the choice which you personally believe to be 
more true. 


The forms were then identified and additional 
variables such as sex, grade, father’s education, and 
family income were also collected. 


RESULTS 


The mean Anglo-American score was 8.58, SD 
= 3.89, The mean Hong-Kong Chinese score 
was 12.07, SD = 3.96. The highly significant dif- 
ference between these two groups (¢ = 10.57, df 
= 508; p < .001) strongly supports the hypothe- 
sis (the scale is scored such that a higher score 
= more external). 

The fact that the American-born Chinese Ss 
(mean score 9.79, SD =3.07) are significantly 
more internally oriented than the Hong-Kong 
Chinese but less than the Anglo-American Ss 
further supports the acculturation hypothesis. The 
differences between these groups attained the .01 
level of significance. 

To check for possible interaction effect be- 
tween I-E, ethnic group membership, father’s 
education and family income, an analysis of 
variance was performed, The results of the 
analysis showed that only the among groups 
difference was significant (F = 45.3, df = 2, p < 
005). 

Discussion 


The findings of this study are relatively clear- 
cut, They support the postulated relationship 
between Internal versus External control and 
ethnic group membership. Heeding the warning 
of previous investigators, that socioeconomic 
variables are also related to Internal versus Ex- 
ternal control and should not be overlooked, two 
demographic indexes were used in this study to 
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account for socioeconomic level: father’s educa- 
tion and family income. The main differences on 
the Internal versus External control dimension 
among groups remained when the groups were 
controlled for socioeconomic level. No attempt 
was made to control for intellectual functioning 
since most of the studies reported in the litera- 
ture indicate very low and nonsignificant correla- 
tions (Rotter, 1966). 

It appears that a cultural orientation may be 
closely linked with a personal belief in Internal 
versus External control. Individuals raised in a 
culture that values self-reliant individualism, 
pragmatic ingenuity, and personal output of en- 
ergy are likely to be more internally oriented 
than individuals from a culture that tends to 
emphasize a different set of values. 
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TREATMENT OF A PHOBIA BY PARTIAL SELF-DESENSITIZATION: 
A CASE STUDY 


MAX W. RARDIN 
University of Wyoming 


The treatment of a disabling case of hemophobia by partial self-desensitization 
is reported. The client completed the last nine items of a 16-item fear hierarchy 
by herself at home, Relaxation was self-induced and she presented the desensi- 
tization monologue to herself. At a 1-year follow-up she functioned as a 
student nurse on an obstetrics ward with no indication of recurrence of the 
phobia. The use of subvocal cues in controlling phobic behavior is discussed. 
The importance of the discrimination process in phobic reactions is suggested. 


This paper presents the treatment of an in- 
" capacitating fear of blood and personal injury by 
a variation of Wolpe’s (1966) desensitization 
technique. The purpose of the report is not to 
add to the list of exotic fears treated by desensi- 
tization but, rather, to illustrate a variation in 
technique, which to, the author’s knowledge has 
hot appeared in the literature. 


History AND CLINICAL DATA 


The client was an 18-year-old single female in 

the first year of a nursing program and had been 
a student in an introductory psychology class 
taught by the author. At the encouragement of 
her nursing instructors, she contacted the author 
about the possibility of controlling her fear of 
blood. An interview indicated that she had a 
limited phobia. She reported no dissatisfaction in 
other areas of her life, was achieving above 
average grades, and had good relations with fam- 
ily, peers, and faculty. 
The following history of the problem was gath- 
ered during the first scheduled session. The client 
indicated that she had been fearful of blood and 
generally squeamish for several years but her 
fears had not been a serious concern until she 
entered nursing—a career goal for her since 
childhood, 

Her reaction to blood and possible physical in- 
jury varied from moderate discomfort to dizzi- 
ness and nausea depending on the topic and cir- 
cumstances. The immediate concern was her reac- 
tion to the films shown in nursing classes which 
vividly depicted various medical conditions. On a 
number of occasions, she had to put her head 
down or leave the room, She felt she would faint 
or vomit if she continued to observe the film. 
This reaction was interfering with her perform- 


*Requests for reprints should be sent to Max 
rdin, Department of Psychology, University of 
yoming, Laramie, Wyoming 82070. 


ance in the classes in which the films were shown, 
and the nursing faculty was beginning to question 
her suitability for the profession. Both S and her 
instructors felt it was imperative that she gain 
control over her reactions before the start of 
clinical classes in the hospital or she would not 
be able to continue in the program. 


‘TREATMENT PROCEDURE 


During the latter part of the first session, S 
was given a detailed description of desensitization 
as it would apply to her situation. The next three 
sessions were devoted to training in relaxation 
and construction of a fear hierarchy. The client 
was capable of deep relaxation and apparently 
experienced vivid imagery. The 16 items in the 
fear hierarchy involved increasing amounts of 
blood due to injury, surgery, and childbirth. 

Except for changes in the placement of some 
items in the hierarchy, desensitization had pro- 
ceeded routinely through seven items during the 
next four sessions when the school term ended. 
The seven items completed were: a scraped el- 
bow; a torn hangnail; squeezing out one drop of 
blood; a cut in the sole of the foot; compound 
fracture of the leg; needle in skin for a stitch; 
and a gash in arm with flowing blood. 

The remaining nine items which had not been 
dealt with were: bleeding from nose and mouth 
due to internal injury; a sucking chest wound; 
seeing a blood sample drawn; blood foaming 
from mouth; waters breaking for childbirth; head 
emerging and effect on mother; blood flowing 
after birth; delivery of placenta; and stitching 
after delivery. At this point, in order to continue 
treatment, the client was faced with the alterna- 
tive of commuting 100 miles or attempting an 
experimental self-desensitization procedure sug- 
gested by the author. 

The experimental procedures were explained as 
follows. The relaxation technique and the general 
style of the monologue accompanying the presen- 
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tation of each item was reviewed. She was re- 
minded that the goal was to imagine increasingly 
vivid and personally relevant scenes. The impor- 
tance of not proceeding if anxiety interrupted her 
relaxation was stressed, She was given general 
examples of monologue to accompany the rest of 
the items. On request, the client was able to 
produce additional appropriate examples. The 
possibility of arranging an observation of child- 
birth at a hospital was discussed. After further 
assurance that contact with the author would be 
available if there was difficulty, the client decided 
to attempt this program. 

It was agreed she would work on the hierarchy 
each night when in bed under the following condi- 
tions. She was instructed to retire in her usual 
fashion, induce relaxation, and to begin with the 
highest item already completed. Work with the 
items was to be limited to approximately 1 hour. 
She was told to omit the procedure if she were 
sick, unusually tired, or disturbed by other mat- 
ters. She later reported engaging in self-desensi- 
tization 5 or 6 nights a week for 6 weeks. 

At the end of the 6 weeks she contacted the 
author and reported being able to imagine com- 
fortably all of the items on the list and having 
visited the hospital maternity ward. She was 
late for the delivery but did see the cord being 
cut and the delivery of the placenta. At that point 
she felt mildly faint and left. She requested that 
she be allowed to use smelling salts at her next 
birth observation since her dizziness was not ac- 
companied by nausea. Because she attributed this 
faintness more to excitement than anxiety, she 
was given permission to use smelling salts with 
the condition that she not force herself to ob- 
serve if she felt highly anxious. She observed her 
next delivery successfully. The last session oc- 
curred after her return to school and was pri- 
marily a review of events to complete the case 
history. She reported having observed a complete 
delivery, successfully taken blood samples, and 
having her own blood sample taken. One year 
later she was a student nurse on an obstetrics 
ward fully assisting in deliveries to the point of 
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dabbing blood between vaginal stitches, Surgical 
repair after delivery had been the highest item on 
her hierarchy. 


Discussion 


The client’s progress with self-desensitization 
was apparently routine with no report of difficulty 
other than the incident which resulted in the 
request for smelling salts. At the conclusion of 
the self-desensitization procedure in a tape record- 
ing describing her thoughts on the experience, she 
revealed a technique she had improvised for 
maintaining control of her fear when approaching 
an anxiety-arousing situation. As she neared the 
situation, she would subvocally repeat to herself 
phrases abstracted from the desensitization mono- 
logue. The phrase “it’s not me” was most often 
used, Her use of this phrase as an aide to dis- 
crimination appears similar to that of “it’s just a 
dream” applied in the treatment of a recurrent 
dream reported by Greer and Silverman (1967). 
Both studies suggest that perhaps the discrimina- 
tion process is crucial in controlling phobic reac- 
tions, 

The need to use such phrases might raise a 
question about the degree of desensitization. Since 
she was doing so much of the work on her own, 
it may well be that desensitization was continuing 


at the time she reported her impressions. She later | 


reported a decline in the use of the phrases. 
This case suggests that self-desensitization may 
be a useful variation from the usual technique for 
clinical application. In research it offers a method 
by which the contribution of the therapist to 
desensitization might be more precisely defined. 
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purpose of this research was to compare 
curacy in person perception of judges mak- 
their decisions individually and in group dis- 
settings. The judges (Js) employed in 
tudy were 48 male undergraduates from the 
ity of Rochester, all of whom had had 
courses in psychology. Each J participated 
in an individual session and once in a group 
with two other Js. 

instruments used to measure accuracy in 
perception were two shortened versions of 
nmed cases” that had been employed by 
thor in earlier research (Fancher, 1966). 
a programmed case true events from the life 
y of a real person are presented in multiple- 
format, with alternative events from other 
s’ lives. A J’s task is to select what he 
are the true events; after each of his 
s he is informed as to which of the events 
fact the true one, so as he proceeds 
a case he has increasing true information 
ich to base his later choices. 

the individual sessions, Js worked by them- 
and recorded their choices. Their accuracy 
were simply the numbers of true events 
t they chose. In the group sessions, each J 
lounced his choices to the group, and the 
discussed each item until it reached unani- 
agreement as to the most likely choice. The 
ounced choices of each individual J as well 


‘Reprints or an extended report of this study may 
Obtained without charge from Raymond E. 
er or for a fee from the American Society for 
ation Science. Order NAPS Document 00212 
ASIS National Auxiliary Publications Service, 
Information Sciences, Inc., 22 West 34th 
et, New York, New York 10001; remitting $1.00 
Microfiche or $3.00 for photocopies. 
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GROUP AND INDIVIDUAL ACCURACY IN PERSON PERCEPTION + 


RAYMOND E. FANCHER 
University of Rochester 


as the unanimous group choices were recorded 
and assessed for accuracy. 

Each programmed case entailed 12 choices, and 
each J worked on one case individually and the 
other in a group. The research design was counter- 
balanced so that each case was used half the time 
in each condition, and so that half of the Js 
worked first individually and the other half 
worked first in a group. 

The mean accuracy score for individuals work- 
ing by themselves was 5.44 (SD = 1.37), while 
the accuracy scores for unanimous group choices 
and the announced choices of individual Js in the 
groups were 5.19 (SD = 1.46) and 4.83 (SD= 
1.34), respectively. The only statistically signifi- 
cant difference was between the first score and 
the last (¢ = 2.35, p < .03). 

Thus the results indicated that group discus- 
sions did not lead to increased accuracy, and 
that individuals announcing their choices in a 
group were actually less accurate in their judg- 
ments than when they worked alone. The group 
sessions were also highly inefficient, requiring an 
average of 50 minutes time on the part of the Js. 
The individual sessions averaged only 20 minutes 
in duration. 

These results seem to suggest that further re- 
search attention might profitably be directed 
toward the investigation of certain clinical insti- 
tutions like the “diagnostic council,” which rest 
upon the assumption that interpersonal decisions 
can be made most accurately and efficiently in 


group settings. 
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AN EXTENSION AND REPLICATION + 


EDWARD B. KLEIN 
Yale University 


LAURENCE J. GOULD 
City College of the City University of New York 
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Crandall, Crandall, and Katkovsky (1965) 
have reported on the development of a children’s 
social desirability scale (CSD). They found that 
social desirability (SD) scores were higher for 
younger than older children, duller than brighter 
Ss, girls than boys, and Negro than white chil- 
dren. Ordinal position, size of family, and social 
class were not associated with CSD. The authors 
also raised questions about the generality of the 
SD response and the underlying rationale for 
SD scales. 

In light of the above questions, the present 
study sought to replicate the Crandall et al. 
findings and to further explore the meaning of 
SD by including sociometric data and ratings of 
children’s behavior by nonsophisticated judges. 
The strategy called for a large sample of boys 
and girls drawn from as many settings as possible 
to test for generalizability and to obtain indepen- 
dent judgments of the children’s behavior from 
naive observers. 


Method 


The Ss for this study were 1008 boys and girls 
ranging in age from 7 to 14 years old. They were 
drawn primarily from a number of camps in the 
greater New Haven area during the summer of 1964. 

The 48-item, true-false version of the CSD, a 
sociometric device (asking S to name the three chil- 
dren he likes and dislikes) and a background form 
were group-administered. At the same time counselors 
were asked to rate each S in terms of his friendliness, 
leadership, social awareness, and seeking approval 
from peers and adults. 


1 An extended report of this study may be obtained 
without charge from Edward B. Klein, Psychiatry 
Department, Yale University, 34 Park Street, New 
Haven, Connecticut 06519, or for a fee from the 
American Society for Information Science. Order 
Document 00211 from ASIS National Auxiliary Publi- 
cations Service, c/o CCM Information Sciences, Inc., 
22 West 34th Street, New York, New York 10001; 
remitting $1.00 for microfiche or $3.00 for photo- 
copies. Requests for reprint should be sent to Edward 
B. Klein at the above address, 
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Results 


Differences between mean CSD scores at each 
age level by sex were tested by ¢ tests. The re- 
sults showed that girls score significantly higher 
than boys at every age level, except in the 12- 
year-old group. Additional ¢ tests were carried out 
between age levels within each sex. In general, 
younger Ss of both sexes score higher than older 
Ss. This tendency is most marked when Ss are 
more than a year apart and younger than 11. 

In keeping with Crandall et al., CSD score 
did not significantly relate to social class, birth 
order, and number of siblings. Consistent with 
the construct of SD, CSD scores were found to 
be significantly correlated with Ss’ inability to 
voice a dislike of peers. Contrary to predictions 
CSD was not found to be related to independent 
ratings of counselors’ perceptions of Ss. 


Discussion 


This study has demonstrated a replication of 
the Crandall et al. findings with respect to age 
and sex in a larger and more heterogeneous group 
of youngsters. Our findings also suggest that age 
11 may be critical in that the CSD score appears 
to stabilize at this time. In addition, the signifi- 
cant negative correlation obtained between num- 
ber of children rated as disliked and CSD occurs 
in both sexes primarily from age 11 onward. This 
finding would appear to provide some construct 
validity for the scale and suggests that the kinds 
of correlates associated with adult approval moti- 
vation begin to emerge in samples as young as 1i 
years old. 
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Norms based on a stratified 
standardization sample con- 
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Mental and Motor Scales yield 
Separate scores: 


Mental Development 
Index (MDI) 


Psychomotor Development 
Index (PDI) 


_ The child's interpersonal and 
affective behavior and the over- 
all quality of his performance 
also are rated by the examiner 
on the accompanying Infant 
Behavior Record. 
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Recent works of note... a 


THE TOKEN ECONOMY: 
A Motivational System for Therapy and Rehabilitation 


Troporo AYLLON and Narman H. Azren. The authors’ pioneering work at Anna State 
Hospital (Illinois) results here in the detailed account of an effective new application of 
learning theory to the behavior problems of institutionalized patients. Representative of a 
departure from the current system of treatment in mental hospitals is the use of a wide 
variety of innovative reinforcers to strengthen desired behaviors, including original experi- 
ments that evaluate specific procedures. 288 pp., illus., paper, $3.95 


PSYCHOLOGICAL COUNSELING, Second Edition 


Epwarp S. Boron, The University of Michigan. Up-to-date new edition includes an en- 
tirely new chapter on process problems in vocational counseling and coverage of recent 
developments in behavior therapy and theory and in research in personality and voca- 
tional choice. 472 pp., $7.00 


THE GOALS OF PSYCHOTHERAPY 


Edited by Arvin R. Maurer, Miami University. Sixteen prominent psychiatrists, psycholo- 
gists, and psychoanalysts join the editor in this assessment of modern psychotherapy. 
In discussing the goals of their profession, the contributors draw on the basic theories of 
personality, psychopathology, and human nature. The volume establishes a new basis for 
structuring the field of psychotherapy, in terms of a three-family organization of psycho- 
therapies, illustrated with samples from the text, 301 pp., $6.00 


THE CLINICAL STUDY OF SOCIAL BEHAVIOR 


Donatp R. Peterson, University of Illinois, Seeks to arrive at a clearer definition and more 
effective resolution of the issues confronting clinicians in their study of behavior. Examining 
basic conceptual and methodological structures, the author suggests a number of solutions 
to practical problems as more helpful than those presently available. 254 pp., illus., $5.50 


THE HALFWAY HOUSE MOVEMENT: A Search for Sanity 


Harop L. Rausn and CHARLOTTE L. Rausx, both of The University of Michigan, The 
first survey of halfway houses for the mentally disturbed in the United States. The volume 
discusses, among other topics, the role of the halfway house in meeting the problem of 
transition from the hospital to the community, preventive aspects of mental health, and 
the value of theories of social and psychological intervention. 247 pp., $5.50 


Scheduled for 1969: 


A SOCIAL HISTORY OF HELPING SERVICES: 
* Clinic, Court, School and Community 
URRAY LEVINE and ADELINE Levine, both of the State Universit New York at 
Buffalo. The first detailed history relevant to the aae teh o aert, 


this collaborative work integrati chi i 
concepts. 275 pp., $6.50 aay Psychological, sociological, and historical data and 
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Meredith Corporation 


440 PARK AVENUE SOUTH, NEW YORK 10016 


EUROPE 


A flexible travel program has been develop- 
ed in connection with the XIXth Interna- 
tional Congress of Psychology in London, 
July 27/August 2, 1969. Several groups will 
leave New York to Europe, the Mediter- 
ranean and the Near East -- utilizing the low 
group fares of Pan Am, TWA, BOAC and 
other major airlines. Round trip jet fares 
start at $230 (New York/London/New 
York.) Write for brochure to: Division ‘P’, 
Travel Wholesalers International, 1707 L 
Street, N.W., Washington, D.C. 20036. 


NEW IN 1969 


DEVELOPMENTAL 
PSYCHOLOGY 


Editor: Boyd R. McCandless 


Developmental Psychology will contain articles which represent 
the broad range of growth and development and their major associ- 
ated variables. Chronological age as well as sex, socioeconomic 
status and efforts of physical growth variables (including time of mat- 
uration and body build) are all considered relevant development 
variables. 


Although the adolescent and the aging population appear to de- 
serve particular attention at this time, Developmental Psychology 
will indeed cover the entire span. Cross-species articles and articles 
Concerning developmental research with retardates will also be in- 
cluded. 


The articles in Developmental Psychology will not be limited to 
any one approach such as “experiments versus studies” or “cognition 
versus motivation”. 


Bimonthly. One volume per year. 
Subscription Price: $10.00 per year 


AMERICAN PSYCHOLOGICAL ASSOCIATION 
1200 Seventeenth Street, N. W. 
Washington, D. C. 20036 


AMERICAN PSYCHOLOGICAL ASSOCIATION 
Subscription Rates for Calendar Year 1969 


AMERICAN PSYCHOLOGIST. Official papers of the Association and articles on Psychology. Monthly. The 1969 
volume is Vol. 24. Subscription: $10.00 (foreign $10.50). Single copy $1.00. 


CONTEMPORARY PSYCHOLOGY. Critical reviews of books, films, and research material in the field of psychology. 
Monthly. The 1969 volume is Vol. 14. Subscription: $10.00 (foreign $10.50). Single copy $1.00. 


JOURNAL OF ABNORMAL PSYCHOLOGY. Theoretical and research conditions in behavior pathology. Bimonthly; 
first issue appears in February. The 1969 volume is Vol. 74. Subscription: $10.00 (foreign $10.50). Single copy $2.00. (Con- 
tinues volume sequence of the former Journal of Abnormal and Social Psychology.) 


Journals of the 


JOURNAL OF APPLIED PSYCHOLOGY. Applications of psychology in business and industry. Bimonthly; first 
issue appears in February. The 1969 volume is Vol. 53. Subscription: $10.00 (foreign $10.50). Single copy $2.00. 


JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY. Original contribution in the field of com- 
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heterosexual males who were in psychi 


structions of patients’ early lives. The 


or conclusion of a study comparing 
al and heterosexual men who were 
ychoanalytic therapy was that parental 
ẹ paramount in the etiology of homo- 
ity (Bieber, Dain, Dince, Drellich, 
Gundlach, Kremer, Rifkin, Wilbur, & 
1962). Those authors described the 
” pattern as one where the mother is 
inding and intimate with her son and 
ant and minimizing toward her hus- 
who is a detached (particularly a hos- 
hed) father to the son. They con- 
at any son exposed to that parental 


problems. The Bieber study was based 
nsive questionnaires completed by the 
for each patient; the patients them- 
re not aware of the study. Two series 
ions proved especially useful in dif- 
the homosexual and heterosexual 
Developmental Six Score (concern- 
ood fears and activities) and a 
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nal Institutes of Health Grant No. FR 
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ment of Psychiatry, Loma Linda Uni- 
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groups who had never sought psychotherapy. The data were based on retro- 
spective self-reports regarding childhood fears and activities, and interparental 
and parent-child relationships. In the Bieber study, each patient’s psychoanalyst 
completed the questionnaire, so that their data were based on analysts’ recon- 


tion will likely develop severe homo- ~ 


CHILDHOOD PARENTAL RELATIONSHIPS OF 
HOMOSEXUAL MEN * 


EVANS 2 


Loma Linda University School of Medicine 


A 27-item questionnaire adapted from Bieber et al. was administered to 43 
homosexual and 142 heterosexual men in order to determine whether certain 
items which Bieber et al. had reported as differentiating homosexual and 


otherapy would also differentiate similar 


present results were remarkably similar 


to those of Bieber et al. in revealing more “negative” features in the back- 
grounds of the homosexuals. Nevertheless, the results were considered neither to 
support nor refute the Bieber conclusions regarding the etiological importance 
of parental relationships in homosexuality. 


Twenty Questions Score (relating to inter- 
parental and parent-child relationships). 

ere is an obvious risk in generalizing find- 
ings from patients in psychotherapy to a non- 
patient population. The purpose of the present 
study was to determine whether question- 
naire items adapted from Bieber et al. would 
differentiate samples of heterosexual and 
homosexual men who had never sought psy- 
chotherapy. 


METHOD 
Subjects 


T 

The sample consisted of 185 American-born, Cau- 
casian men between the ages of 22 and 47, who had 
at least a high school education, had never sought 
psychotherapy, and were living in the Los Angeles 
metropolitan area. All Ss were volunteers in a study 
of cardiovascular disease, but only the 43 homo- 
sexuals knew that aspects of homosexuality were also 
being studied. The latter volunteered, as homosexuals, 
through the cooperation of a Los Angeles based 
organization; they did not constitute a representative 
group of homosexuals. The 142 “heterosexual” Ss 
volunteered for the cardiac study through a number 
of sources, and there was no opportunity to develop 
the kind of rapport needed to elicit information 
about their sexual preferences and behavior. For 
purposes of this study, it was assumed they were all 
heterosexual, though there may have been homo- 
sexuals among them, which would tend to attenuate 
group differences. 

The homosexual men ranged in age from 22 to 46, 
with a mean of 33.8 years (SD = 7.1); the hetero- 
sexuals ranged from 25 to 47, with an average of 
393 (SD=44), and the difference was significant 
(t= 4.84, p < .001). They were reasonably similar to 
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patients in the Bieber study, where the homosexuals 
averaged approximately 35 and the heterosexuals 
approximately 38 years of age. 

In education, the homosexuals ranged from 12 to 
19 years, with a mean of 14.4 (SD=2.2); and the 
heterosexuals ranged from 12 to 20 years, with a 
mean of 15.1 (SD=2.1), with the difference ap- 
proaching statistical significance (t= 1.87, p< .10). 
Again, they were relatively similar to the Bieber 
patients, who averaged approximately 15 years of 
education. 

There was an obvious and expected difference be- 
tween the two groups in marital status. Among the 
heterosexuals, 5% were single, 87% married, and 
8% separated or divorced; 86% of the homosexuals 
were single, 5% married, and 9% divorced. In 
the Bieber et al. study, 8% of the homosexuals and 
51% of the heterosexuals were married. 

As to sibling constellations, there were 12%, 35%, 
14%, and 40%, respectively, only, oldest, middle, 
and youngest children among the homosexuals; 
comparable figures for the heterosexuals were 8%, 
28%, 27%, and 37%. Those distributions were not 
significantly different. Bieber et al. reported 10% of 
their homosexuals and 22% of their heterosexuals 
were only children, which difference was significant 
at the .05 level. 

Proportionately more of the homosexuals were 
employed in clerical work and in the arts, and 
fewer of them in other professions, management, and 
sales work. The difference in heterosexual-homo- 
sexual occupational distributions was significant (x? 
= 42.88, df=5, p< 001). The occupational classifi- 
cations of the present Ss and the Bieber patients 
were fairly comparable. 

The homosexual volunteers rated their sexual ex- 
perience’ on a 7-point scale adapted from Kinsey, 
Pomeroy, and Gebhard (1948), which ranged from 
entirely heterosexual to entirely homosexual. Of the 
43 Ss, 58% described their experience as having been 
exclusively homosexual, 35% as predominantly homo- 
sexual with incidental heterosexual, and 7% as 
predominantly homosexual but more than incidental 
heterosexual experience. For their homosexual pa- 
tients, Bieber et al. reported 68% as exclusively 
homosexual, 28% as having some heterosexual ex- 
perience, and 4% as inactive, so that the proportion 
of exclusive homosexuals in the two studies was 
similar (x? = .89, p= .50). 

The homosexual Ss also completed an 11-item 
questionnaire designed to determine their sexual 
identification. In their overall feelings, 40 Ss (93%) 
considered themselves moderately or strongly mascu- 
line, and responses to the other 10 items also indi- 
cated essentially masculine identification. Bieber et 
al. reported that approximately 2% of their homo- 
sexual patients were “markedly effeminate,” so that 
seemingly Ss from the two studies were similar in 
this regard. 


Procedure 


Each S completed a 27-item questionnaire adapted 
from Bieber et al. so as to be as nearly comparable 


Evans 


as possible. The essential content of the questionnaire 
appears in Table 1. Included were the Develop. 
mental Six (Items 2-7) and Twenty Questions (Items 
8-27) scores, and one additional question regarding 
physical make-up in childhood (Item 1), which had 
also differentiated the Bieber groups. Four possible $ 
choices were provided for each item, whereas the | 
Bieber study used a yes-no dichotomy for all except 
three items. Following is an example from Bieber e, 
al. completed by the analysts: “Was patient exces 
sively fearful of physical injury in childhood? (yes 
no).” The corresponding item modified for the pres- 
ent study, for completion by S himself, was: “During 
childhood, were you fearful of physical injury? | 
(seldom/sometimes/often/always).” ‘ 

Questionnaires were used in the analysis only 
where all 27 items had been answered, which elimi- 
nated 11 potential Ss, one homosexual and 10 hetero- 
sexual, Differences between groups were calculated by 
means of chi-square, with twofold classifications cor- 
rected for continuity. 


RESULTS 


The content of each questionnaire item is 
given in Table 1, together with the proportion 
of Ss responding in each category, and the 
significance of differences between the homo: 
sexual and heterosexual groups. Comparable 
figures from the Bieber study are also it- 
cluded, with the significance values calculated 
from figures in Appendix A (Bieber et al, 
(1962) using only Ss for whom definite 
response was available. Differences betweel 
the Bieber groups reached at least the BY 
level of significance for 24 of the 27 items, 
and the other three approached significance. 
(Bieber et al. reported all items significant, 
and the discrepancy may be due to the meth 
used here of ignoring “No Answer” and “Not 
Applicable” categories in the calculation 0 
chi-square.) 

In the present study, homosexual-heter0- 
sexual differences were significant at the 0 
level or less for 24 items, one other ap 
proached significance, and for two items no 
difference was found. Thus, despite the ve) 
different method of collecting data, the 20% 
patient status of the Ss, and (perhaps minor | 
differences due to geographical location, 
results were remarkably similar to tho% 
reported by Bieber et al. als 

Specifically, in retrospect, the homosext! Í 
more often described themselves as frail 
clumsy as children and less often as athletic 
More of them were fearful of physical 
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TABLE 1 
QUESTIONNAIRE CONTENT AND ITEM RESPONSES 
Bieber study Present study 
Homo- | Hetero- Homo- | Hetero- 
Roper sexual | sexual ? Response | sexual | sexual ? 
Frail 50 17 Frail 37 11 
Clumsy 24 08 Clumsy 14 06 
Athletic 13 33 Athletic 05 45 
Well coordinated 13 42 .001 | Coordinated 44 38 .001 
Yes 75 46 Seldom 23 49 
No 25 54 .001 | Sometimes 51 46 
Often 19 04 
Always 07 01 .001 
Yes 90 56 Always 56 12 
No 10 44 .001 | Often 30 35 
Sometimes 14 46 
Never 00 07 .001 
Yes 34 10 Never 09 03 
No 66 90 .001 | Sometimes 49 83 
Often 40 14 
Always 02 00 001 
Yes 61 V A a Never 12 38 
No 39 73 .001 | Sometimes 35 51 
Often 42 11 
Always 12 01 .001 
Yes 17 64 Never 09 01 
No 83 36 .001 | Sometimes 65 15 
Often 23 52 
Very often 02 32 001 
Yes 16 64 Never 19 05 
No 84 36 .001 | Sometimes 70 29 
Often 09 35 
Very often 02 32 | .001 
Great deal 01 13 Great deal 16 28 
Average 42 50 Considerable | 53 39 
Little 36 24 Little 26 23 
Very little 21 13 .002 | Very little 05 09 .23 
Yes 20 38 Great many 21 30 
No 80 62 |.01 | Several 37 32 
| Few 35 33 
None 07 05 .70 
Yes 64 36 Never 30 18 
No 36 64 | .001 | Seldom 37 63 
Often 16 17 
7 Always 16 01 001 
Yes 57 34 Highly 07 00 
No 43 66 | .002 | Moderately 07 03 
Slightly 09 13 
No 77 85 .02 
Yes 39 17 Often 05 02 
No 61 83 .002 | Sometimes 21 07 
Seldom 30 14 
Never 44 77 001 
Yes 36 12 Never 53 87 
No 64 88 | .001 | Seldom 21 ti 
Sometimes 21 02 


Often 05 01 .001 
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TABLE 1— (Continued) 


Bieber study Present study 
Questionnaire item 
Res Homo- | Hetero- R Homo- | Hetero- 
ponse sexual | sexual b ponso sexual | sexual 

Mother considered Yes 67 51 Strongly 28 11 
puritanical No 33 49 |.05 | Moderately 33 35 
Mildly 23 23 
No 16 30 
Mother’s relationships | Frigid 72 56 Frigid 12 00 
with father/other Not frigid 28 44 .04 | Cold 26 23 
men Warm 63 77 
Mother allied with son | Yes 63 40 Often 33 06 
against father No 37 60 .002 | Sometimes 21 18 
Seldom 16 35 
Never 30 42 
Mother openly pre- Yes 59 38 Always 12 01 
ferred son to father | No 41 62 .005 | Often 14 06 
Seldom 21 31 
Never 53 62 
Mother interfered Yes 37 25 Often 12 00 
with heterosexual No 63 75 |.08 | Sometimes 16 08 
activities Seldom 19 20 
Never 53 71 
'Son was mother’s Yes 52 36 Never 30 27 
confidant No 48 64 03 | Seldom 19 32 
Sometimes 23 36 
Often 28 05 
Son was father’s Yes 08 29 Strongly 09 09 
favorite No 92 71 .001 | Moderately 16 40 
Mildly 40 37 
No 35 14 
Felt accepted by father | Yes 23 48 Strongly 23 42 
No 77 52 | .001 | Moderately 35 42 
Mildly 23 11 
REI cu No 19 06 
Son spent time with | Great deal 03 03 Great deal 02 08 
father Average 12 39 Considerable | 09 39 
ae i 37 31 Little 53 32 
ery little 48 27 |.001 i 21 
Father encouraged Yes 48 61 eon 26 at 
masculine attitudes/ | No 52 39 | .07 | Sometimes 26 32 
activities Seldom 23 21 
Aware of hating father | Yes 61 37 Ra g e5 
as a child No 39 63 | 002 | Seldom 19 a0 
Sometimes 37 is 
Afraid father might Yes 57 43 net fa 04 
physically harm him | No 43 57 |.06 | Sometimes 19 23 
Seldom 30 5 

Ne 
Accepted father Yes 21 51 Strongly ae 51 
No 79 49 -001 | Moderately 28 37 
Mildly 33 He 
Respected father Yes 30 49 ee n 56 
No 70 51 |.01 | Moderately | 21 a 
Mildly 21 08 
No 21 03 

L 


Note.—Significance levels based on chi-square, with twofold classifications corrected for continuity. Decimals omitted 
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jury, avoided physical fights, played with 
girls, and were loners who seldom played 
baseball and other competitive games. Their 
mothers more often were considered puritan- 
ical, cold toward men, insisted on being the 
center of the son’s attention, made him her 
confidant, were “seductive” toward him, allied 
with him against the father, openly preferred 
him to the father, interfered with his hetero- 
sexual activities during adolescence, discour- 
aged masculine attitudes, and encouraged 
feminine ones. The fathers of the homo- 
sexuals were retrospectively considered as less 
likely to encourage masculine attitudes and 
activities, and Ss spent little time with their 
fathers, were more often aware of hating him 
and afraid he might physically harm them, 
less often were the father’s favorite, felt less 
accepted by him, and in turn less frequently 
accepted or respected the father. Unlike 


‘Bieber’s patients, these homosexuals were no 


: different from the heterosexuals in amount of 
time they estimated their parents spent to- 


gether or in the interests shared by their 
Parents. 

In addition, a total score on the 27-item 
questionnaire was obtained for each S by 
weighting each item from 0 to 3 points, with 
the higher weighting at the “masculine” end, 


_ 80 there was a maximum possible score of 81. 


The scores of the homosexuals ranged from 
9 to 64, with a mean of 42.9 (SD = 11.6); 
those of the heterosexuals from 36 to 77, with 
a mean of 57.3 (SD = 9.2). Though there 
Was considerable overlap in scores for the two 
groups, the difference was highly significant 
(t= 7.50, p < .001). 


Discussion 


ane results could not be accounted for on 
“a ee sample characteristics other than 
al orientation; no relationship was found 
“ie age and questionnaire scores, and the 
ž i pte of marital status, occupational 
ie aaron; and sibling constellation. The 
ih at the homosexuals knew homosexual- 
ke nee might have affected the 
faa ut if there was any tendency to 
me ee the direction of “normal,” it was 

cient to obscure group differences. 
cae preponderance of homosexual ex- 
€, no relationship was .observed be- 
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tween Kinsey-type ratings (completed only 
by homosexual Ss) and questionnaire scores 
(x? = 0.0), perhaps because of the limited 
variation in proportion of homosexual experi- 
ence. However, a product-moment correla- 
tion of .47 (f= 3.41, p< .01) was found 
between the 27-item questionnaire and the 
1l-item sexual identity questionnaire; the 
homosexuals with more “desirable” family 
backgrounds tended to consider themselves as 
more masculine. 

It may be noteworthy that the present re- 
sults were so similar to those obtained by the 
Bieber group despite a major difference in 
the level of observation. In the present study, 
the data were based on retrospective self- 
reports of how they now view their childhood, 
by Ss who had never been in psychotherapy. 
The Bieber data, on the other hand, were 
based on psychoanalysts’ reconstructions of 
patients’ early life circumstances, derived 
from impressions during psychotherapy. Argu- 
ments could be advanced for the superiority 
of one method over the other, and certainly 
both have limitations. The agreement in re- 
sults could be interpreted as evidence of valid- 
ity in both methods, or perhaps as an indica- 
tion that the two methods are not essentially 
different. 

The results strongly suggested poor paren- 
tal relationships during childhood for the 
homosexual men, at least as seen in retro- 
spect; however, the etiological significance 
of such relationships, or even the etiology 
of the relationships themselves, is another 
matter. Bieber et al. considered the chances 
high that any son exposed to the parental 
combination of maternal close-binding inti- 
macy and paternal detachment-hostility will 
develop severe homosexual problems. Never- 
theless, only 28% of their homosexual pa- 
tients had such a parental combination, and 
the 11% of their control patients who had 
such parents did not become homosexual. 
Furthermore, Bieber et al. very much under- 
emphasized one-third of the “triad,” the son 
himself. They reported that “each parent 
had a specific type of relationship with the 
homosexual son which generally did not occur 
with other siblings,” and that son was the 
“focal point for the most profound parental 
psychopathology.” As to why a particular son 
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is singled out, Bieber et al. proposed that son 
is unconsciously identified by the mother with 
her own father or brothers, and the son 
thereby becomes the recipient of sexual feel- 
ings carried over from the mother’s own early 
life. Similarly, the father transfers to that 
son his unresolved hostility and rivalry with 
his own father/brothers. The above is an 
oversimplified summary of the Bieber formu- 
lation, but it does not exaggerate the ne- 
glect of the son’s contribution to the triadic 
relationship, beyond eliciting parental trans- 
ference feelings. 

The personalities and behavior of parents 
undoubtedly affect a child’s personality, but 
some consideration must be given to the no- 
tion that the child’s innate characteristics 
at least partially determine parental reactions 
and attitudes toward him. For instance, that 
the father of a homosexual son becomes 
detached and/or hostile because he does not 
understand or is disappointed in the son is 
just as tenable as that the son becomes 
homosexual because of the father’s rejection. 
Similarly, that a mother may be more inti- 
mate with and bind her homosexual son more 
closely because of the kind of person he is, is 
just as reasonable as the idea that he becomes 
homosexual because she is too binding and 
intimate. Bieber et al. did question whether 
paternal rejection and hostility were stimu- 
lated out of feelings of disappointment and 
failure because of the son’s homosexuality, 
but concluded that was not likely since only 
17% of the fathers were reported to have 
been aware of the son’s homosexuality. Surely 
most parental reactions crucially affecting the 
child’s personality occur when the child is far 
too young to be labeled homosexual or hetero- 
sexual. The Bieber group also concluded that 
the father’s attitudes were not traceable to 
the fact that the sons were inadequate and 
unattractive children, since the mothers did 
not find them so, That the mothers did not 
find these sons unattractive is no indication 
the fathers did not; the evidence suggests 
the fathers did find them unappealing. 

Judging from experience with adult homo- 
sexual males, O’Connor (1964) also refuted 
the idea that lack of a good father relation- 
ship was a consequence rather than a cause 
of homosexuality, on the grounds that would 
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make it difficult to account for the many 
homosexuals whose fathers were physically 
absent. Bene (1965) rejected the notion that 
the lack of a positive father relationship 
might be due to the son repulsing the father 
rather than the father repulsing the son, and 
she cited O’Connor’s reasoning. That homo. 
sexuality occurs in sons whose fathers are 
physically absent is irrelevant to the fact that 
when the father is physically present the 
relationship with the homosexual son is often 
a poor one. Furthermore, that homosexuality 
occurs in the absence of a father not only 
detracts from the etiological significance of 
a poor paternal relationship but in fact sup- 
ports the importance of other causal factors 
(possibly such as innate physical /personality 
characteristics of the son), 

Information was obtained relevant to an- 
other conclusion of Bieber and colleagues, who 
stated: “We have come to the conclusion that 
a constructive, supportive, warmly related 
father precludes the possibility of a homo- 
sexual son; he acts as a neutralizing, protec- 
tive agent should the mother make seductive 
or close-binding attempts [p. 311].” The 
questionnaire responses gave no full and com- 
plete answer as to whether these fathers were 
constructive, supportive, and warmly related, 
but there was evidence that the father rela- 
tionship of some homosexuals was as good 
as that of many heterosexuals. A score was 
calculated for each S based on Items 20-27 
in Table 1, all of which concern the father- 
son relationship. With 0-3 points possible 
for each item, the total scores for the 
homosexuals ranged from 0 to 22, with a 
mean of 11.7; and scores for heterosexuals 
ranged from 3 to 23, with a mean of 16.5. 
While the difference was significant (t = 5.95 
P< .001), 16% of the homosexuals scor 
above the heterosexual mean, and 16% of the 
heterosexuals scored below the homosext 
mean. Therefore, it would seem that a moder- 
ately good relationship, at least as reflecte 
in the above questionnaire items, does 10 
preclude the appearance of homosexuality, 
even though it is well established that a poo 
father relationship is common among homo- 
sexual sons. The responses for the Bieber 
homosexuals on the corresponding question- 
naire items suggested their father relationships 


ed 
al 
d 
t 


than those of the present homo- 
which could merely reflect methodo- 
ferences, but more likely is related 
that the Bieber Ss had all sought 
apy, whereas none of the present Ss 


a similar fashion, two other question- 
es were calculated, one regarding 
on relationships (Items 10-19) and 
pertaining to development (Items 
h a maximum score of 30 on the 
items, the homosexuals ranged 
fo 29, with a mean of 18.8; the hetero- 
tanged from 13 to 29, mean 22.9. 
the difference was significant (¢ = 
< .001), 30% of the homosexuals 
ed above the heterosexual mean, and 12% 
erosexuals scored below the homo- 
. As to the seven developmental 
with a possible score of 21, the homo- 
ranged from 1 to 15, with a mean of 
the heterosexuals ranged from 6 to 
| of 14.3. That difference was most 
t (t= 10.39, p<.001), and only 
e homosexuals exceeded the hetero- 
n, with 4% of the heterosexuals 
elow the homosexual mean. Of the 
tent areas, then, the developmental 
arly differentiated Ss best, with the 
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father and mother items similar in their dif- 
ferentiation. The childhood behavior reflected 
in some of the developmental items, of course, 
is not unaffected by parents, but the findings 
suggest the possibility of something more 
fundamental in homosexuality than a poor 
father relationship. 

The results of the present study agreed 
closely with those obtained by Bieber et al, 
but they neither supported nor refuted the 
Bieber conclusions as to causal relationships. 
The complicated problem of the etiology of 
homosexuality probably could be more pro- 
ductively investigated with a prospective 
study. 
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CHILDHOOD PARENTAL RELATIONSHIPS AND THE ESTABLISHMENT 
OF GENDER ROLES OF HOMOSEXUALS 


RALPH H. GUNDLACH + 
Postgraduate Center for Mental Health, New York, New York 


Evans’ questionnaire returns distinguishing “never-in-therapy” male homosex- 
uals from comparisons confirm Bieber et al. findings. Evans’ argument, that an 
association between questionnaire items regarding parent-to-child behaviors 
and later homosexuality is not causal, is rejected as not applicable. Studies by 
Money and Stoller with hermaphrodites and transsexuals establish the impor- 
tance of parental belief about their child’s sex in determining its sense of 
gender identity. Data regarding lesbians contrast with that for male homo- 
sexuals, and raise questions about multiple roads to adult sex-role character 
including: Does the partner’s sex always have the same significance to S’s self- 


concept ? 


It is with considerable satisfaction I learn that 
Evans’ (1969) 27-item questionnaire—taken from 
the 450 items used by the Bieber team (Bieber, 
Dain, Dince, Drellich, Grand, Gundlach, Kremer, 
Wilbur, & Bieber, 1962)—answered by a sample 
of West Coast male homosexuals and heterosexu- 
als never in psychoanalysis, confirm so solidly our 
findings on these items. We are grateful to Evans 
for testing our explict generalization from a 
patient population to nonpatients. 

In considering Evans’ research I am reminded 
that there are many different and appropriate 
goals for research and that misunderstandings 
may occur between researchers because they do 
not grasp the others’ context. I may very well 
fail to grasp Evans’ through inability to get 
away from my own set. It seems to me that 
Evans presents his data with admirable objectiv- 
ity, but I suspect he really did not expect these 
‘findings, for he veers off into a discussion which 
offers no accounting for his data, nor comes to 
grips with the Bieber et al. data. His arguments 
are not based on his research. 

I see his argument as threefold: (a) he points 
out that the Bieber et al. findings cannot be 
taken to prove that parental relationships of the 
quality indicated have any etiological significance 

for homosexuality. One could just as reasonably 
assert, he says, that “the father of a son becomes 
detached and/or hostile because he does not 
understand or is disappointed in the son.” Like- 
wise, “the mother may be more intimate with 
and bind her homosexual son more closely 
because of the kind of a person he is.” Although 
these statements might seem to hold logically for 


1 Requests for reprints should be sent to Ralph 
H. Gundlach, 160 East 84th Street, Apt. 2K, New 
York, New York 10028. 
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„Or a unitary trait; and certainly the qu! 


Evans does not take account of the many cro 
tabulations between related questions, and eg 


ences in time as presented in Chapter 
“Developmental aspects of the prehomos 
child” (Bieber et al., 1962). But Evans him 
adds—undermining his argument it seems to m 
“Surely most parental reactions critically affed 
ing the child’s personality occur when the child \ 
is far too young to be labeled homosexual oF 
heterosexual.” Dipl 
(b) Evans argues that even though fathers of 
homosexuals (H) and controls (C), for i 
may be distinguished at the .001 level or hi 
on a collection of items, the two groups 
overlap in part. He thus seems to require tha | 
the answers to questionnaire items about rela- 


and categorized in simplified general phra 
whether answered by a psychiatrist or the 
son, should be as firm as a causal connection; | 
as solid as the copper wire between light switch 
and light bulb. 5 
This position seems to indicate a misconcept jon | 
about the nature of the subject matter of i i 
research and a confusion about the function l 
meaning of questionnaire data in studies such 3 
as these on homosexuality. Evans follows here, | 
perhaps, the old medical model regarding the 
etiology of a disease, like mumps or yellow fever, 
and demands no less than the identification 
one precise “cause.” But the facts show 
homosexuality is not a disease, or a syndro 


naire items are not the causal factors. 
erects and attacks a straw man so far a © 
purports that Bieber et al. assert that “a 
father relationship” or a close-binding-intim 
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mother “causes” homosexuality in a son. The 
whole tenor of that study was to delineate the 
multivariate factors that may be involved. Par- 
ticularly impressive were our findings of a strong 
heterosexual interest in many of the H which 
were sharply inhibited by fears related to female 
genitalia; and evidence that many H seek pick- 
ups not for sheer sexual pleasure and gratifica- 
tion or love, but are driven by anxieties and 
are acting compulsively. Such neurotic fears are 
hardly instinctual, but do they also characterize 
H who never had psychotherapy? Unfortunately 
Evans did not replicate these most essential items. 

(c) Evans’ third argument is that the Bieber 
et al. study overlooked the son himself: one-third 
of the triad. He asserts “the child’s innate charac- 
teristics at least partially determine parental 
reactions and attitudes toward him.” 

It is hard to understand Evans’ objection that 
the individual personality was left out in the 
Bieber et al. study, since the patient was the 
focus of the entire study, triangulated from many 
vantage points. The study was an attempt to 
enrich, extend, and systematize clinical material 
about male homosexuals found in psychoanalytic 
treatment; the focus was always on the ground 
level of the particular person. The superposition 
of an extensive questionnaire was understood as 
4 gross way of collecting disparate persons of 
family groups. The questionnaire answers and 
the Many tables served to block out the main 
geography of patterns of male homosexuals and 
their family of origin patterns, just as a map 
Serves to fix the relative sizes and some of the 
characteristics of ground-level reality. The sta- 
tistical material was treated not as primary data 
but as material to be assessed in the light of 
clinical experience and to extend clinical insights, 
Thus, some eight classifications of mother-types 
and 10 groupings of father-types provided 51 
different parental patterns; and these were 
further collected into four major groupings. 
Throughout, with every section, the full mean- 
ng Was illuminated, the clinching material was 

layed through some 35-40 cases. We found 
à remarkable number of conditions and circum- 
stances that may result in homosexuality. 

When Evans suggests that there is something 
Within the nature of the homosexual which may 
tied determining factor (“possibly such as 

ate physical/personality characteristics of the 
ion”), one could think he was unhappy with his 
tt findings. But the evidence continually 
mas against such an “innate” hypothesis. The 
ce Of genes, chromosomes, and endocrine 

ies lons have all been largely discredited as 
_ “mining homosexuality, Perloff (1965) sum- 
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marizes that “homosexuality is purely a psycho- 
logical phenomenon, neither dependent on a 
hormonal pattern for its production nor amenable 
to change by endocrine substances [p. 68].” 

What happens after the child is born is com- 
plicated by many factors; there are not only 
inner biological and emotional factors, parental 
and familial surroundings, social and cultural 
circumstances; but the various pressures and 
expectations shift as the child grows and hardens 
as he establishes his ways into his eventual adul 
character structure. ? 

At birth the baby is identified as a boy or girl 
and the child is reared by the parents with that 
idea in mind, with whatever doubts, reservations, 
pleasures, or disappointments, and intentions they 
carry for the child. So by the age of 2 or 3 it 
knows that it is a male or a female; its core 
gender role is established. But at this time, ac- 
cording to Rabban (1950), the child has little 
of the sex-role- characteristics, which are largely 
established after age 3. Still later come the pres- 
sures and anxieties about being acceptable as a 
masculine or feminine person. 

One’s sex and gender role are not always con- 
sonant, and some most interesting studies have 
been made of persons caught in this predicament. 
Money (1965), for instance, describes two cases 
that were hermaphrodites and biologically the 
same. But one was raised as a girl and the 
other as a boy and each developed appropriately 
and without difficulty into these opposite gender 
roles. Other more confusing cases arise where 
the external genitalia give the appearance of 
one sex—this is announced by the obstetrician at 
the child’s birth—but these conceal internal 
genitalia carrying the primary characteristics of 
the opposite sex. The child is reared by parents 
who have a mistaken idea as to its sex; but 
when biological difficulties arise, usually at the 
approach of sexual maturity, the error in sex 
assignment is discovered. The earlier approach 
was to proclaim the biological truth, and expect 
the child to shift its gender role; but this turned 
out to be usually disastrous, courting psychotic 
breakdowns. Money (1965) and Stoller (1968) 
found it proved much better to let the estab- 
lished gender identification prevail, and refit the 
person’s physical and secondary sex character- 
istics to the person’s self-concept. To reverse the 
psychological structures of the self, incorporated 
around sexual identity and all the habits, atti- 
tudes, values, and ways of living this entails, is 
extremely hazardous. 

Stoller made extensive and precise studies of 
some patients with biological abnormalities, and 
some without biological abnormalities but with 
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gender-role anomalies. In many of these cases not 
only the child but one or both parents were inter- 
viewed, several were in analysis, and the inter- 
actions especially of mother and child were dis- 
covered in detail. He concluded that differences 
in gender from biological sex are almost always 
produced by differences in infant rearing. Most 
interesting were those male cases he called trans- 
sexual, These were male babies that developed 
within the first year and a half a sense of female 
identity; and he found and studied adult trans- 
sexuals whom he assumed to have been reared 
as were these males. The adult transsexuals only 
wanted their male bodies to be changed so as 
to conform to their feminine outlook. These pa- 
tients, he pointed out, had no touch of psychosis; 
they were quite sane. Further, this feminine 
identity was established not as an adaptation to 
severe pressures (see Rado, 1965), but under 
conditions of conflict-free pleasure, provided by 
an overindulgent, close-binding mother. Stoller 
points out how these transsexuals are sharply 
different from transvestites, fetishists who know 
they are male but periodically take a woman’s 
role for special reasons; different from effeminate 
homosexuals who like to dress in women’s cloth- 
ing, usually to satirize and make fun of women. 
In all the child transsexuals he saw, there were 
some striking similarities. There was a close 
mother-son symbiosis, with the mothers tending 
to hold their babies against their own bodies 
for much too long a time; the mothers all 
had a special identifiable bisexuality and empti- 
ness; they encouraged cross-dressing of their 
sons; and their husbands tended to be physically 
absent from the household, and were unconcerned 
with the cross-dressing and other behaviors of 
the son imitative of the mother. These trans- 
sexuals, he emphasized, never thought of them- 
selves as homosexuals, since they had the com- 
plete sense of being women. 

Probably none of the patients in the Bieber 
et al. study, and none of the women in the 
Gundlach and Riess (1968) study, had been faced 
with the biological and social problems just de- 
scribed. The males all accepted the fact they were 
males, although the H and many C had problems 
about potency and how masculine they were or 
were perceived to be; and how, by sexual aggres- 
sion or other behavior they could promote their 
social reputation or enhance or protect their 
self-esteem. Likewise the women studied all 
knew themselves as females but their means of 
establishing or expressing their gender role is 
quite different in this society than for men. 
Girls learn femininity by absorption, without 
effort by imitation of mother (see Lynn, 1962). 
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They do not have to assert themselves to be 
women. 

I would like to end by calling attention to two 
bits of evidence about female homosexuality, 
each of which raises interesting questions. 

We find that the incidence of homosexuality 
in relation to birth order depends upon a number 
of factors (Gundlach & Riess, 1967). For in- 
stance, even with our limited sample of about 
225 H and 230 C there were significantly (.05) 
more lesbians who were firstborn among families 
with only one or two children. However, of 
families with five or more children, 12 were first- 
born but only three of these were H. Of these 
large families, 22 were fifth or later in rank, 
17 H and five C; of the 48 who were in the first 
four ranks, 16 were H and 32 C (p < .001). 

What makes this important is that quite dif- 
ferent family dynamics must operate with the 
firstborn girls from small families than for later- 
born girls from large families. It might be that 
firstborns are pushed to produce, and feel they 
must prove themselves with accomplishments as 
boys must do, while the later-born girls in large 
families are perhaps largely neglected. (Often the 
last-born male is the one focused upon by mother, 
and becomes homosexual.) But furthermore, the 
question arises as to why it is that the firstborn 
girls in large families are not as equally prone 
to being lesbian as within smaller families? 
I cannot say, but it seems that something about 
the decision cannot occur until after it is estab- 
lished that there will be five or more siblings; 
and by that time the firstborn girl must be 
10-15 years old! 

The other exciting item comes from the study 
of the patterns of female homosexuality in 4 
prison for women convicted of a felony. Ward 
and Kassebaum (1965) describe how the majority 
turned to homosexuality as a way of life: to seek 
closeness, warmth, and affection from another 
inmate as the way to survive the loneliness, 
degradation, and other psychological deprivations 
and injuries of incarceration. It seemed bound 
up with emotional needs. Yet 68% of the women 
were mothers; and only 5-15% were estimated 
to have been homosexuals “outside.” 

In men’s prisons, on the contrary, males still 
feel they must demonstrate their masculinity by 
force: by fighting, sodomizing weaker ones, buy- 
ing favors, and discharging debts with a sè i 
“favor.” However, the persons who are force’ 
to comply, or traffic in sex, are viewed W! 
contempt by those rated as highly masculine. 
Vet these forceful leaders, who use the others 
sexually, do not consider themselves homosexuals. 


| 


| 


This brings up the point that some men can 
have sexual relations only with men, or only 
with women, or with neither, or both, or with 
a fetish; and some are willing and able to engage 
in sexual activity with a great variety of objects 
and/or persons. 

Most women, however, express the feelings 
that they cannot have sex, or do not wish to, 
without “love.” But many find that they can 
love a man or a woman. Few are like the major- 
ity of male H in treatment, who seem to fear 


‘or even have a phobic reaction to heterosexual 


intercourse. So—what is the (varying) impor- 
tance of the sex of the sexual partner, to the 
Sis gender role and self-concept? 

In the light of evidence of cultural determina- 
tion of gender role and sexual practices, the pos- 
sibility of an innate physical/personality charac- 
teristic determining homosexuality seems quite 
remote. 
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PATIENT AND NONPATIENT SAMPLES * 
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A comparison is made of studies emplo: 


ying patient and nonpatient samples. 


Evans has made an important contribution to etiological studies of male homo- 


sexuality by replicating the Bieber qu 


estionnaire items with a nonpatient 


sample. Nonpatient samples are important because of the possibility of ruling 
out the contaminating factor of psychopathology other than homosexuality 


per se. Serious questions are raised abo! 


ut the role of parent-child disturbed 


relations as necessary and sufficient conditions leading to adult male homosexu- 
ality. Other etiological variables are briefly stated. 


It can no longer be questioned that faulty, 
disturbed, or pathological parental relationships in 
early childhood are more commonly reported by 
male homosexual patients than by a comparable 
group of male heterosexuals. The studies of West 
(1959), Braaten and Darling (1965), Freund 
and Pinkava (1961), O’Connor (1964), and 
Bieber, Dain, Dince, Drellich, Grand, Gundlach, 
Kremer, Rifkin, Wilbur, and Bieber (1962) are 
in agreement on this finding although they differ 
with respect to the specific critical dimensions of 
the parental relationships. In some studies, the 
content of the disturbed relationship is very gen- 
eral and lacks specific behavioral criteria. Thus, 
for example, West finds that homosexuals more 
frequently have “overintense” relations with their 
mothers and “unsatisfactory” relations with their 
fathers. In O’Connor’s study, homosexuals are 
more attached to their mothers and do not “re- 
late normally” to their fathers, On the other 
hand, Bieber et al. very carefully delineate a 
substantial number of behavioral criteria for the 
close-binding-intimate (CBI) mother and the 
hostile, detached father as illustrated by the items 
pista for cross-validation in the Evans’ (1969) 
study, 

Evans’ careful replication of the Bieber study 
with nonpatients who answer questions directly 
concerning parental relationships is a very im- 
portant contribution to the further clarification 
of the role of parental relationships in homosexu- 
ality. The etiological role of parental relation- 
ships in producing homosexuality is an inference 
which cannot be justified from psychiatric sam- 


1 This investigation was supported by a Research 
Career award from the National Institute of Mental 
Health, United States Public Health Service. 

2 Requests for reprints should be sent to Evelyn 
Hooker, Department of Psychology, University of 
California, Los Angeles, Los Angeles, California 
90024. 
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ples alone, in part because of the contamination 
of homosexuality with psychopathology. Similar 
results with nonpatient samples tend to confirm 
that inference even though they are not con- 
clusive because of the necessarily large number 
of other uncontrolled variables in a retrospective 
study. 

Prior to Evans’, five studies of nonpatient sam- 
ples have appeared. As with the patient samples, 
no precise cumulative picture emerges because 
the investigators used different dimensions of 
parental relations. Thus, Westwood (1960) 
found indifferent or poor relations with the 
father and an overprotective or possessive mother 
in approximately 50% of his sample. Unfortu- 
nately, the lack of a control group makes it im- 
possible to assess the significance of this finding. 
In the Schofield (1965) study, homosexual men 
in a nonpatient sample reported a higher inci- 
dence of poor relations with the father and of 
overprotective or overpossessive mothers than 
did a similar sample of heterosexual men. Bene 
(1965) found no evidence that homosexuals were 
more often attached to their mothers or ovet- 
protected and overindulged by them. They more 
often reported more hostility and less affection 
from and toward both mothers and fathers than 
did heterosexual men. In Apperson and Mc- 
Adoo’s study (1968) mothers of homosexuals 
were found to be less restrictive than were the 
mothers of heterosexuals, and the fathers were 
described as more cold and impatient. Thus, the 
picture of the mother which emerges from these 
studies is contradictory but that of the father is 
fairly consistent from one study to another. 
However, in Greenblatt’s (1966) study no differ- 
ences were found between homosexual and hetero- 
sexual men in ratings on a semantic differential 0 
attributes of their mothers and fathers. Indeed, 
fathers of both groups were rated by them aS 
good, generous, pleasant, dominant, and under 
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» and mothers as good, pleasant, neither 
nor dominant, and neither overpro- 
underprotective. None of these studies 
exact comparison with the investiga- 
‘patient samples because of differences 
logy or in the definition of the rele- 
sions. 
udy is therefore of special importance 
confirmation of the Bieber assump- 
it causal relation between parental rela- 
pearly childhood and adult homosexuality. 
view, Evans is overly cautious in his 
that his findings neither confirm nor 
the etiological role of parent-child rela- 
‘one set of many variables influencing or 
homosexuality in adult life. Indeed, his 
sitates this generalization since it is a 
m Whereas it is unwarranted from the 
üdy alone. His caution and the lack of 
ite caution in the Bieber study are both 
ited extremes. 
sue of contaminating psychopathology 
josexuality, unfortunately, is not settled 
? study since he did not control for 
. In fact, the amount of psycho- 
‘other than homosexuality per se in the 
samples thus far investigated is un- 
t for Greenblatt’s study: Theoreti- 
tients should show less psychopathol- 
if it is assumed that greater psycho- 
is the impetus to treatment. However, 
demonstrated in the Manhattan and 
County investigations, psychopathology 
Widespread in an untreated population. 
thology is only one set of the complex 
of variables partially influencing whether 
it is sought. 
itt took this into account in the selec- 
his nonpatient groups by administering 
[PI as a measure of psychopathology. 
the homosexual and the heterosexual sam- 
‘ound to consist of two subgroups: one 
as any group in the MMPI litera- 
the other showing some psychopathology. 
» there was more psychopathology in the 
ual “nonnormal” subgroup than in the 
tual one. Nevertheless, the total group 
much less psychopathology than is true 
Ss generally. The highest score on any 
Scale for the homosexual group was on 
on Scale and this mean was only 
S, the fact that Greenblatt’s data do not 
Bieber findings may be due to the 
ive lack of psychopathology in the total 
Nonpatient group. On the other hand, 
due to differences in methodology. 
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Some supportive evidence for interpreting the 
results as due to the relation between parental 
relations and adult psychopathology rather than 
to homosexuality per se is given by Schofield’s 
finding that the incidence of disturbed parental 
relations was greater in the patient than in the 
nonpatient homosexual samples. Evans’ finding 
that relations with the father were somewhat 
better, and in fact were “moderately good” in 
some instances, as compared with Bieber’s sam- 
ple, lends some additional support to the argu- 
ment. Apperson and McAdoo’s findings that 
mothers of homosexuals were less restrictive than 
those of the control Ss could also be interpreted 
in this way. The evidence, however tentative, of 
the relation between disturbed parental relations 
and adult psychopathology makes it imperative 
that further studies include an assessment of 
psychopathology in the samples studied. 

The negative evidence of the Greenblatt study 
(no difference between parental relations in the 
homosexual and the heterosexual groups) and the 
findings in all studies that disturbed parental 
relations characterize only a portion of the homo- 
sexual sample as well as a smaller proportion of 
the heterosexual sample raise serious questions 
about the etiological role of parental relations in 
the genesis of homosexuality. Furthermore, stud- 
ies of schizophrenic patients have demonstrated 
that a very similar picture of parental relations 
characterize the early childhood (Lidz, Corneli- 
son, Fleck, & Terry, 1957). 

Thus it appears that disturbed parental rela- 
tions are neither necessary nor sufficient condi- 
tions for homosexuality to emerge. If it should 
be the case that they are more commonly found 
in the background of severely disturbed patients 
than of those individuals who function relatively 
well, the necessity of looking at other etiological 
factors should be underlined. 

Evans arrives at his position that innate fac- 
tors may be important by evaluating the signifi- 
cance of the father’s role. He argues that be- 
cause homosexuality develops in sons whose 
fathers are absent as well as in those with mod- 
erately good relations with fathers, the etiological 
role of the father must be questioned and the 
contribution of the son’s characteristics (perhaps 
innate) must be assessed. While I am in com- 
plete agreement with Evans about potential ge- 
netic or other etiological contributions to a homo- 
sexual object choice (even though there is no 
possibility of specifying what they are with the © 
evidence currently available), the force of his ar- 
gument about the father’s role is weak if one con- 
siders the function of the father in producing or 
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preventing homosexuality. That is, Bieber et al. 
assume that the presence of a warm, strong, and 
supporting father who intervenes on behalf of the 
son counteracts the effects of a CBI mother. The 
absence of the father, as well as a poor father, 
may leave the son in the double bind of maternal 
sexual seductiveness—maternal sexual restriction. 
Finally, there are four major theoretical issues 
concerning the etiology and determinants of per- 
sistent or predominant adult homosexuality. (a) 
Is the human organism psychosexually neutral at 
birth, so that learning processes determine homo- 
sexual object choice in adults, or are there in- 
herent sexual predispositions, which selectively 
influence the effects of learning? (b) What is 
the nature and content of the learning processes 
by which homosexual object choice develops? Is 
the appropriate developmental model a deviant 
role or a personality system with intrapersonal 
traits, motives, and gender identifications incom- 
patible with the social-sexual capabilities and 
self-other expectancies of adult relations with 
the opposite sex? Does positive conditioning 
of sexual responses to persons of the same sex, or 
negative conditioning to persons of the opposite 
sex, or a combination of both account for homo- 
sexuality? (c) Are particular periods in the de- 
velopmental process, such as early childhood or 
adolescence, critical for homosexual object choice? 
(d) Are parent-child relationships in the nuclear 
family crucial in determining whether an indi- 
vidual becomes homosexual, or are peer relation- 
ships in childhood and adolescence, and deviant 
„subcultures in adolescence and early adult life, of 
equal or possibly greater importance? These four 
issues are highly controversial and cannot be re- 


solved by the research evidence currently avail- 
able. 


Evetyn HOOKER 
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performance. 


A major responsibility of the psychologist 
is to insure that the techniques that he uses 
ally do what they purport to do. As 
leeh] once put it, “Regardless of one’s theory 
bout personality and regardless Of one’s 
ice of data, whether Rorschach, MMPI, 
intuition, table, equation, or rational 


‘Am I doing better than I could do by flipping 
Pennies?’ [Meehl, 1954, p. 136].” The as- 
ance of validity is particularly urgent in 
area of psychodiagnosis. Clinicians often 
tend that they do not use their tests in a 
hanical, additive way, but that their deci- 
ms are ultimately complex judgments. 
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VALIDITY OF CLINICAL JUDGMENTS BASED ON 
HUMAN FIGURE DRAWINGS ` 


ZEV WILLIAM WANDERER? 
Teachers College, Columbia University 


The literature establishes that clinicians have routinely been making diagnostic 
judgments based on techniques of equivocal validity. In a study considering 
clinical as well as experimental specifications for a suitable methodology, it 
was found that Draw-A-Person experts are capable of identifying mental 
defectives far beyond chance expectations. Four remaining matched groups 
(schizophrenics, neurotics, homosexuals, and normals), however, were found 
not to be identifiable, even after the experts were permitted a second chance 
to make a correct diagnosis. The expertness of the judges, as predicted from 
the ranks they were accorded by their peers, was unrelated to their actual 


Clinical judgments, however, are not exempt 
from scientific scrutiny. Hunt concluded that 
“we should consider the individual clinician 
as a clinical instrument, and study and evalu- 
ate his performance exactly as we study and 
evaluate a test [Hunt, 1946, p. 317].” 

At a 1958 Society for Projective Tech- 
niques symposium, Kenneth Little summarized 
the situation regarding the validity of projec- 
tive techniques: “It is somewhat embarrassing 
to have to say that the published evidence on 
projective techniques indicates that they have 
either zero, or at best, very low positive effec- 
tive validity indices, Even in those studies 
with the most positive of results, correlations 
are of an order of magnitude which make 
predictions for the individual largely a waste 
of time [Little, 1959, p. 287].” 

In 1949, Karen Machover standardized the 
administration and formalized the interpreta- 
tion of a figure-drawing test which later be- 
came known as the Draw-A-Person (DAP) 
test. Although she saw S’s verbal associations 
to his drawings as an auxiliary source of per- 
sonality information, she relied heavily on 
universal symbolism and other psychoanalytic 
constructs in the absence of, or in addition to, 
those associations. The decision to study the 
DAP as the subject of the current investiga- 
tion was based on three criteria; its wide- 
spread use (Sundberg, 1961), contradictory 
conclusions reported in reviews of the litera- 
ture (Jones & Thomas, 1961, pp. 256-258; 
Lublin & Lublin, 1967; Swensen, 1957, pp. 
460-461), and the allegation that “few of 
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Machover’s hypotheses have been explicitly 
tested by definitive studies [Swensen, 1957, 
p. 460].” 


Quest for a Method 


A validating use of statistics that makes no 
assumptions beyond those of the theory of 
probability is the method of correct matchings 
(Meehl, 1954, pp. 11-12). Dating back to 
Vernon’s (1936) early graphological experi- 
ments, the matching method keeps the cri- 
terion constant and turns to individual differ- 
ences in the dependent variable. In that way, 
it differs from classical experiments, modeled 
after those in the physical sciences which 
systematically vary the independent variable 
and record changes in the dependent variable. 
In a matching study which uses judges who 
base their judgments on test data, the judges 
are the Ss, and their judgments are the de- 
pendent variable. Judgments should be ex- 
pected to vary in response to the differential 
stimuli represented by the productions of 
discrete criterion groups. Hence, the method 
is called “differential” (Andrews, 1948). Ta- 
bles, prepared by Chapman ( 1934) and Du- 
dek (1952), provide expected frequencies for 
such forced-choice matches. 

The requirements set forth in the American 
Psychological Association’s (1954) Technical 
Recommendations concerning the report of 
validity information imply, by analogy, pro- 
cedures for the proper validation of a test. 
Consideration of these recommendations along 
with the findings of relevant studies (e.g., 
Bolotin, 1960; Chambers & Hamlin, 1957; 
. Dana, 1962; Dawes, 1962; Dennis, 1960; 
Mogar, 1962; Mosteller & Bush, 1954; Sec- 
ord, 1952; Wallon, 1959), criticism (e.g., 
Brown, 1952; Hamlin, 1954; Hammer, 1959; 
Schneider, 1950; Shneidman, 1959), unpub- 
lished manuscripts, and personal communica- 
tions by Karen Machover and Solomon Mach- 
over, provided enriching sources for concep- 
tualizing the method elected here. In sum, the 
empirical literature suggested that a concur- 
rent validity study, applying the method of 
correct matchings of global judgments (Cron- 
bach, 1950) to clinically homogeneous groups, 
would be appropriate. Researchers have cau- 
tioned that judges be experts, that samples be 
relevant, that demographic and sampling 
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variables be controlled, and that the task be 
clearly defined—not too simple, yet not un- 
manageably complex. 


Hypotheses 


1. Diagnostic judgments, made by experts 
using the DAP, agree, beyond chance expec- 
tations, with criterion statuses. 

2. Differences in accuracy of DAP judg- 
ments vary between clinical groups beyond 
chance expectations. 

3. The judges’ diagnostic accuracies are 
positively related to their reputed expertness, 
defined by ranks accorded them by their peers. 


PROCEDURE 


Consultations were held with the Machovers in 
order to determine which five clinical groups should 
be most appropriate to a diagnostic investigation of 
the DAP. Among those suggested were a psychotic, 
a neurotic, a mental defective, and a homosexual 
group, in addition to a group of normals, Other 
recommendations resulted in the decisions that all 
Ss be American Caucasian adult males residing in the 
northern United States, 


The Criterion Groups 


In order to avoid contamination of the criteria by 
the variable being investigated, Ss were selected only 
if they were never given the DAP for diagnostic pur- 
Poses (except for the mental defectives, who are 
almost routinely given the DAP in state institutions, 
but who are certified as mental defectives on the 
basis of their scores on standardized IQ tests). The 
Ss who passed all of the selection criteria were ad- 
ministered the DAP de novo in accord with the 
instructions and materials detailed in Machover’s 
(1949) monograph. 

The sample groups were carefully matched for 
age, ethnic status, socioeconomic class, marital status, 
and other relevant variables, except for the mental 
defectives, whose very condition precluded their 
being matched for intelligence and education.? 

The overall principle in the determination of the 
external criteria for the group labels was that the 
five clinical groups represent different life situations, 
recognized by commonly encountered diagnoses. Thus, 
the schizophrenics had been so diagnosed and com- 
mitted to a state hospital for most of their adult 
lives; the neurotics, on the other hand, lived an 
worked in the community, but attended an out- 
patient psychiatric clinic on a regular, weekly basis. 


8 Tables and appendices detailing this, and other 
information, are embodied in the longer dissertation 
upon which this report is based. Microfilm and 
Xerox copies of the dissertation may be ordered 
from University Microfilms, Ann Arbor, Michiga™ 
Pub. No, 65-7406. 


defectives achieved IQ scores well into 
range and had been legally certified 
ted to a state school for mental defec- 
homosexuals lived and worked in the 
did not receive any form of psycho- 
ent, denied that such treatment was 


selves as practicing homosexuals. The 
were finally selected lived and worked 
unity, attended an evening college, denied 
seen for psychological treatment, and 
“nonpsychiatric” scores on the Cornell 
4 


dex 


with Karen Machover, reputed experts in 
tation of the DAP were asked to name 
t least five other experts whom they con- 
larly competent in the diagnostic use 
Each of the nominees was then asked 
same, etc, until a pool of over 50 names 
e experts were contacted in the order of 
so that the 20 highest ranking cooperat- 
were designated the judges for this study.® 


of the Data 


j were selected from each of the criterion 
S drew a person and then a person 
opposite to that represented in the first 
S and his pair of drawings were 
code from a table of random numbers 
Smith, 1950). The design called for 100 
ments to be made by a total of 20 judges. 
Teceived one pair of drawings from each 
ical groups, Each pair was thus seen by 
rent judges. Twenty sets of five DAPs each 
for distribution in the following man- 
envelopes were labeled by clinical groups; 
of each S’s code number were placed in 
te envelope; one number was drawn 
h of the five envelopes to make up a set 
) sets were completed. 


are due the following who provided 
to clinical groups: Sherman O. Schachter 

at the New Hope Guidance Center, 
Norman Kilpatrick and the officers of the 
Society; Sidney Robbins and the staff at 
ip State Hospital, N. Y.; Issac Wolfson 

McCulloch and the staff at Letchworth 
ite School, N. Y.; Manny Sternlicht and 
it the Staten Island Community College. 
or is grateful to the 20 DAP experts who 
judges: Helen J. Anderson, Lauretta 
Brown, Hanna F, Faterson, Bernice M. 
manuel F. Hammer, Molly R. Harrower, 

lis, Selma L. Landisberg, Solomon Lie- 
Machover, Solomon Machover, Mur- 
s, Hanna S. Marlens, Frank S. Puzzo, 
an, Joanna Steinberg, Meta Steiner, 
» and Mildred Zadek. 


CLINICAL JUDGMENTS BASED ON DRAWINGS 


145 


The Task 


Each of the 20 judges received a package contain- 
ing five pairs of drawings representing orfe each of 
the five diagnostic groups, Included also was an en- 
closure describing the characteristics of the sample, 
and a letter defining the task: “Identify each coded 
pair of drawings as to the clinical group from which 
you feel it has been drawn. This is done by ranking 
the probability of each diagnosis for each pair.” At- 
tached to each pair of drawings was a “Judgment 
Form” which instructed the judge, 


Please look at the attached pair of drawings by 
the same person and decide which of the five 
groups they most likely represent. Place the num- 
ber “1” on the line next to that group in the list 
below, for the diagnostic group that seems most 
probable. Continue ranking the categories by 
placing a “2” on the line near the next likely 
possibility, and so on, so that the category which 
is least likely to be represented by these drawings 
will be marked with a “5.” Å 


The five categories were listed below the instruc- 
tions with an adjacent space for the rank numbers. 
The judges were also asked, 


Try to pinpoint in four statements the basis for 
your decision in each case, e.g, a scoring system, 
formal elements in the drawings, content, a detail, 
global impressions, a hunch, or your own unique 
system of evaluating drawings. 


Finally, each judge was asked to complete an “Ex- 
perience Questionnaire.” 
Auxiliary Experiment 

For expository purposes, the study was replicated 
but with the substitution of cards with chance- 
determined “judgments.” That is, the code numbers 
of the Ss were set up on a results tabulation chart 
similar to the one used for the actual DAP results. 
The E called out the code number of an S, and an 
assistant ë picked a card out of a box and read the 
“diagnosis” from the card. In order to simulate the 
forced-choice restriction of the first choice, the card 
was not replaced until the next “judge’s” turn 
came, This was done for all 20 “judges,” for 
two (rather than five) diagnostic rankings. These 
randomly drawn data were then compared with 
theoretical chance and with judgments by DAP ex- 
perts in appropriate tables. 


RESULTS 
Chance Expectations for the Experimental 
Procedure 
In the auxiliary chance-illustration replica- 
tion, the random drawings produced zero to 


6Thanks are due Louis Garzetta, who painstak- 
ingly kept this procedure random, yet simulated the 
judges’ actual task. 
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three “successes” per “judge” out of a possible 
five, with an average of one. The randomly 
drawn “successes” were tabulated. The chi- 
square computed to determine the significance 
of any difference between correct matchings 
produced by the random drawings and those 
expected by theoretical chance was .815, 
which, with df= 2, is nonsignificant, Suc- 
cesses for each criterion group, by this em- 
pirical method for demonstrating the effects 
of chance, averaged close to the expected 20%. 
Similarly, when accurate matchings on a sec- 
ond run of the random drawings were com- 
bined with those obtained on the first random 
drawings trial, the resulting cumulative “hits” 
resembled frequencies theoretically expected 
by chance. Thus, the “successes” averaged 
two out of a possible five (since there were 
two chances to be correct), and “successes” 
for each criterion group averaged 40%. The 
chi-square, computed to determine the signifi- 
cance of any difference between the successes 
obtained on the random drawings and those 
expected by chance, was .143, which, df = 2; 
is nonsignificant. The experimental procedure, 
stripped of the “experts using DAP” variable, 
generated results essentially identical with 
those expected by chance alone. 


Successes Achieved by DAP Experts 


The average number of successes by DAP 
experts was twice the number arrived at by 


TABLE 1 


SIGNIFICANCE OF THE DIFFERENCES BETWEEN JUDGES’ 
Successes AND Tose EXPECTED BY CHANCE 


Number of judges 


Note. —AIl decimals rounded to nearest integer. 


a Where individual * cells yielded expected frequencies 
ess than 5, adjacent cells were combined. $ 
$p <.01; xt = 13.82, df = 2. 
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the random drawings; the DAP successes for 
the criterion groups averaged close to 40%, 
The chi-square, computed to determine the 
significance of the difference between the suc- 
cesses of the DAP experts and those obtained 
by random drawings, was 40.42, which, dj = 
2, is significant beyond the .001 level. 

After the first and second correct match- 
ings were combined, a chi-square comparison 
constructed to show the frequency of ran- 
domly drawn correct matches over two trials, 
tevealed, again, that the DAP judges’ overall 
performance differed from that produced by 
random drawings at a highly significant level, 

Chi-squares were computed to determine 
the significance of the differences between the 
successes of the DAP experts and those ex- 
pected by chance. Since the computation of 
chi-square requires that no cell have an ex- 
pected frequency of less than five, those cells 
which had fewer theoretical frequencies were 
combined (Siegel, 1956, p. 109). The result- 
ing frequencies are listed in Table 1. For ex- 
ample, although only five judges would be ex- 
pected to achieve two or more successes on 
their first-ranked diagnoses by chance alone, 
as many as 15 DAP experts scored that many 
Successes. Similarly, although only seven 
judges would be expected to achieve three or 
more successes over their first and second 
choices combined (by chance alone), as many 
as 17 DAP experts scored that many successes. 
The highly significant chi-squares, when 
viewed superficially, seem to indicate that the 
demonstrated ability of the experts to dis- 
criminate between the five clinical groups, on 
the basis of the DAP, was not due to chance. 
Further analysis of the data reveals, however, 
that the mental defectives were detected al- 
most perfectly when the judges used up their 
first ranks, and with 100% accuracy when 
the judges’ first two diagnostic opportunities 
were combined, 


Variances between Judges and between 
Clinical Groups 


For diagnoses ranked first, one judge out of 
20 scored a perfect five “hits” out of five pos 
sibilities, and one judge scored no “hits. 
Analyses of the variances between judges i"- 
dicates that the differences are probably due 
to chance. 


de range of successes by DAP experts 

e ical groups was revealed. Anal- 
jances between groups indicated 
fference in the susceptibility of the 
ups to be identified by means of 
statistically highly significant. 


utions of the Mental Defective 


highly significant variance between 
raises a question: What is the contri- 
of the mental defective group to the 
that experts can identify clinical 
y using the DAP alone? This ques- 
answered by dropping the mental 
ye group from the findings and testing 
ning data for significance. The “To- 
of the accuracy tables were ad- 
show the distribution of the successes 
experts in identifying only four of 
al five clinical groups: the schizo- 
ts, neurotics, homosexuals, and normals. 
ignificant chi-squares on Table 2 
Suggest that the initially observed 
the experts to discriminate between 
t clinical groups, on the basis of the 
probably due to the obviousness of 
igs by the mental defectives—espe- 
en the experts had two opportunities 


tion of the Possibility of a 

g Group 

statistical significance, then, of the 
il finding that experts can identify clini- 
ups by using the DAP vanished once 
ental defective group was omitted. The 
the schizophrenic group was mis- 
More often than any other group 
a different question: If the schizo- 


ld the vanished significance reap- 
“Total” column of the accuracy 
Was adjusted to show the distribution of 
of the DAP experts in identifying 
the original five clinical groups. 
nonsignificant chi-square indi- 
the observed ability of the experts 
inate the three clinical groups on the 
DAP (such as the correct identi- 
1 out of 20 homosexuals) was still 
bounds of chance expectations. The 
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TABLE 2 


Jupces’ DAP Successes COMPARED WITH CHANCE 
SUCCESSES AFTER OMITTING THE MENTAL 
Derective Group 


Number of judges 
Number of Chi- 
“hits” | Theoretical] DAP | square 
chance experts 
First choice 1,328 
2 or more 
1 


ans: xtos = 5.99, df = 2, 


schizophrenic group, therefore, was not suf- 
ficiently misleading to determine the chance- 
like performance of the experts when the 
mental defective group was omitted. 


Distribution of Incorrect Judgments 

A table was constructed to determine 
whether the misdiagnosed Ss were misdiag- 
nosed consistently, and how they were identi- 
fied by the judges. Table 3 indicates that the 
outstanding group was the mental defectives, 
which had only one misdiagnosis. Incorrect 
judgments for the rest of the groups were 
fairly scattered with no significantly consistent 
pattern. 


Other Findings 


1. Overall interjudge agreement was less 
than half as great as it would have been had 


TABLE 3 
DISTRIBUTION OF INCORRECT JUDGMENTS 


Frequency of judgments 


; |Homo-| Nor- 
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all the judges agreed about all Ss. The great- 
est contribution to interjudge agreement was 
the mental defective group, about which there 
was 90% agreement, 

2, The ranks for expertness that the judges 
received from their peers (in the early nomi- 
nation procedure) were juxtaposed with the 
ranks the judges achieved by virtue of the 
accuracy of their clinical judgments. The re- 
sulting Spearman rank correlation coefficient, 
corrected for ties, was —.36, which is nonsig- 
nificant (7¢.95 = .377). 

3. Attempts to analyze the judgmental 
process (judges’ statements in support of their 
decisions), including the procedure used by 
Chambers (1954, pp. 17-18, 23-24, 33-34, 
56-58), revealed that stylistic differences be- 
tween DAP judges were the same as between 
Ss for any one judge. No differentiating pat- 
tern emerged, 

4, Analysis of the “culture” of DAP experts 
revealed that women outnumbered men by 
better than two to one. There were five times 
as many experts who were primarily engaged 
in clinical work than in teaching and research; 
about two-thirds of the experts worked in in- 
stitutional settings, and although 76% relied 
“heavily” on the DAP, only 32% regarded it 
as a “primary tool.” 


Discussion 
Adequacy of the Method 


The differential findings—that the judg- 
ments were consistently valid for one criterion 
group and consistently invalid for the remain- 
ing groups—imply that the procedure con- 
tained no elements which would tend to favor 
or disfavor the object of the investigation. 
Comparison of the findings with those of 
Chambers’ (1954) study of the Rorschach, 
which provided the foundation for our design, 
suggests that the DAP, not the method, was 
responsible for our results. As in the current 
study, Chambers found highly significant 
overall positive results: Her mental defectives 
were identified 90% of the time, far more fre- 
quently than were the four other groups. 
Omission of the mental defective group, how- 
ever, hardly affected her findings; the result- 
ing chi-square was 16.3, leaving the positive 
results unchanged and still significant beyond 
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the .001 level. Thus, application of the same 
basic design which achieved positive results 
with the Rorschach in the Chambers’ study, 
achieved negative results with the DAP in our 
study. The negative results must be attrib. 
uted to the DAP, and not the experimental 
design. 

Five methodological innovations were added 
here to the basic correct-matchings design; 

1. The Ss were administered the DAP de 
novo in order to avoid contamination of the 
criterion by the predictor. 

2. A method was used for determining the 
relative contributions of individual criterion 
groups to the total results. 

3. By requesting the judges to rank their 
diagnoses, the judges were freed from the re- 
striction of having to make an absolute deci- 
sion on a protocol about which they may have 
lacked confidence because of what they re- 
garded insufficient or conflicting evidence. 

4. A procedure for the identification of ex- 
pert judges proved successful. The nomination 
and ranking procedure provided an additional 
source of information for determining the rela- 
tionship between the judges’ reputations 
among their peers and their relative actual 
performances. 

5. The definition of criterion groups by life 
statuses, rather than by psychiatric diagnoses 
or psychological test results alone, provided 
an operational method of avoiding the haz 
ards of unreliability or nonfactuality of diag- 
noses per se. 


Why Then is the DAP Popular? 


The studies which report positive findings 
for the DAP are primarily case studies. There 
is little doubt that there are instances when 
individuals produce sensationally revealing 
drawings. As Swensen (1957, p. 461) pointed 
out, such cases are more likely to be remem- 
bered than are the many which are not re- 
vealing. Furthermore, such cases are more 
likely to be submitted for publication than arè 
the less eloquent ones, 

Another conjecture about the cause for 
the positive regard which so many clinicians 
have for the DAP was advanced by Shaffer. 
Prior to and during the administration of the 


TL, F. Shaffer, personal communication, 1958- 
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CLINICAL JUDGMENTS BASED ON DRAWINGS 


DAP, interview material and behavioral clues 
are typically observed by the clinician; upon 
"interpreting the drawings that the client has 
produced, the clinician may attribute the in- 
 terview and other extratest knowledge, which 
he has about S, to the drawings, “seeing” in 
, them what he knew already. “Blind” studies 
eliminate this hazard. 
Little has proposed an explanation for the 
failure of projective techniques to prove valid: 
the matter of situation generalization. The 
test tests the organism, he claimed, not the 
‘astronomical number of alternative stimulus 
‘situations in which the organism functions 
‘and about which the clinician attempts to 
“predict. By means of generalization, it is pos- 
“sible to reduce the number of stimulus cate- 
gories sharply. “We are able to say that one 
man’s candy is another man’s brandy and call 
both consummatory responses, or oral behav- 
‘ior, etc. This inevitably leads to a loss in 
precision in the prediction of specific behavior 
$0 we find ourselves anticipating alcoholism 
when obesity in fact occurs [Little, 1959, p. 
-288).” 
Hammer has suggested that it is fallacious 
to conclude that drawings do not reflect psy- 
chopathology simply because associations be- 
_ tween drawing signs and criterion groups have 
"not been demonstrated. 
i 
Al of these “schizophrenic signs” are relatively in- 
frequent in projective drawings of the human figure, 
~ but where they do occur, they occur in the drawings 
of schizophrenics, . . . To test these hypotheses ade- 
quately, only instances where the sign does occur 
should be included. For example, to wait to accumu- 
late twenty subjects’ drawings and then compare the 
idence of schizophrenia in the subjects who sub- 
Mitted these drawings, would be the only way to 
fn fairly the validity of the sign [Hammer, 1959, 


Essentially, Hammer is simply proposing to 
Ìgnore false negatives in DAP predictions. A 
i high incidence of true positives would mean 
at some drawings reflect some blatant char- 
acteristics of the Ss who produced them, but 
Me could still wonder about how much confi- 
C could be placed in an instrument which, 
schi alone, would send a large proportion of 
zophrenics merrily on their way, since 
ry would be identified as anything but 
schizophrenics, . 
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Perhaps, the most basic question to investi- 
gate is: What causes clinical psychologists to 
believe in and use instruments which repeat- 
edly fail the test of diagnostic validity? Could 
the occasional, eloquent drawing provide the 
clinician with partial reinforcement, producing 
greater resistance to extinction? An experi- 
ment investigating Shaffer’s * hypothesis that 
beliefs of psychologists direct their reinforc- 
ing acts could shed much light and may prove 
convincing even to the committed. 


8L. F. Shaffer, personal communication, 1959. 
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Editor’s Note 


The Editor has invited the following discussion by Emanuel Hammer. 


: 


tana of Consulting and Clinical Psychology 


1969, Vol. 33, No. 2, 151-156 


considerations. 


Has Wanderer’s study cracked open the hard 
nut of the question of the basic validity of the 
Draw-A-Person (DAP) technique? 
As a research question runs the gauntlet 
through evidence and counterevidence, one tries 
to make sense of the resulting confusion. Wan- 
derer’s study joins those on the side which have 
failed to demonstrate that psychologists can offer 
valid diagnoses on the basis of figure drawings 
(Sherman, 1958a, 1958b; Sipprelle & Swensen, 
1956; Swensen, 1957; Whitmyre, 1953). Along 
with this, several of Machover’s (1949) in- 
terpretive hypotheses have emerged bearing evi- 
dence suggesting their lack of validity (Blum, 
1954; Fisher & Fisher, 1950; Grams & Rinder, 
1958; Hammer, 1954; Reznikoff & Nicholas, 
1958; Ribler, 1957). 
Yielding a mixture of results, Hammer and 
Piotrowski (1953) found (a) a reassuring degree 
of both reliability and validity in the interpreta- 
tion of aggression reflected in the House-Tree- 
erson drawings, and also, (b) that what variation 
there was among the clinicians’ interpretations 
appears to be secondarily influenced by their own 
Projections and/or degree of sensitivity.” 
On the other hand, and in contradistinction to 
findings of Wanderer, a number of studies 
RE demonstrated clear-cut differences between 
N of “normal” and pathological groups 
stasi & Foley, 1941; Berrien, 1935; Eigen- 
7 1951; Goldworth, 1950; Gunzburg, 1954; 
Cee & Wexler, 1950; Hozier, 1959; Plaut & 
Sha 1955; Reznikoff & Tomblen, 1956; 
idl-Waehner, 1942; Singer, 1950; Springer, 
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DAP: BACK AGAINST THE WALL? 
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Methodological and conceptual criticism of Wanderer’s study, “Validity of 
Clinical Judgments Based on Human Figure Drawings,” revolves around (a) 
correlation procedures, (6) sampling errors, (c) arbitrary categorization of 
data, (d) overlap between diagnostic groups employed, (e) the insufficiency of 
fractionated material as projective samples, and, most importantly (f) clinical 


1941; Wexler & Holzberg, 1952). Steinmann 
(1952) devised a scoring system which was found 
by Graham (1955) to correlate .70 with the 
degree of pathology in psychosis. Hiler and Nes- 
vig (1965) also demonstrated that psychiatric 
patients could be differentiated from normals on 
the basis of projective drawings. They validated 
the characteristics of pathology as reflected in the 
following qualities: “bizarreness,” “distorted as- 
pects,” “incompleteness” and “transparency” of 
drawings. The most valid of the criteria was 
“bizarreness,” which subsumed a drawing looking 
either “grotesque,” “inhuman,” “sinister,” “sick,” 
“ghoulish,” “weird,” or “gnomelike.” In a cross- 
validation sample, it was found that 46% of the 
patients and only 2% of the control group pro- 
duced a drawing which, by independent judgment, 
was “bizarre.” 

Griffith and Peyman “(1965) found that eye 
and ear emphasis in the DAP was, to a statisti- 
cally significant degree, associated with ideas of 
reference. Albee and Hamlin (1950) devised a 
scale which, while unable to differentiate between 
schizophrenics and anxiety neurotics, did reliably 
differentiate between these two groups and a nor- 
mal one. 

Wanderer’s study is now added to the negative 
side of the mixed results which have proliferated 
over the years. As we evaluate his study, there 
are, however, several ways in which it appears 
the hypotheses were handicapped. 

No significant correlation is found between the 
“expertness” of the judges and the validity of 
their sortings. With all the experts possessing na- 
tional reputation, they quite evidently fall into a 
tight cluster with little of the spread necessary 
to allow a statistically meaningful correlation. 

Totally disqualifying one facet of the study, 
the judges did not even see the same set of five 
cases, There is thereforé neither common ground 
nor any defensible basis for appraising one expert 
against another in correlating validity and “de- 
gree of expertness.” Surprisingly, scientific method 
was abandoned to a situation where one judge 
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may have had a relatively classical set of draw- 
ings while another a quite confusing set. 

In Table 1, “Number of hits” is put into the 
categories of “0,” “1,” or “2 or more.” This kind 
of lumping of the data seems rather arbitrary. 
Combining all “hits” of 2, 3, 4, and 5 into this 
last category tends to dilute the data favoring 
the hypothesis. The judge who attained five out 
of five correct sortings does thereby not con- 
tribute any more weight to the hypothesis than 
the one who attained only two out of five correct 
matchings. 

Also, the fact that the judges were asked to 
sort one DAP set into each of five categories did 
not allow a judge to elect to call two of the cases 
psychotic when they looked that way to him. 
Thus, in this instance when a judge was wrong 
on his first choice he was automatically wrong 
for a 40% rather than a 20% assessment, that is, 
the second case felt to also fall in the category 
already used up now had to be placed in a dif- 
ferent one. Had both drawings been allowed to 
be classified schizophrenic, it might here have 
resulted in only a 20% error. (Then, of course, 
the same handicap occurred, now on the next 
level, when, in going through the drawings a sec- 
ond time, the judges again had to put only one 
case in a category.) 

Wanderer subtracts the influence of the con- 
spicuous contribution to the validity of the sort- 
ings made by the mental defective cases. It is 
hard to understand why he did not merely rotate 
each category and use analysis of variance to 
solve this problem more neatly. 

Wanderer quotes the present author, in ref- 
erence to that rare but striking schizophrenic 
sign which may appear in a drawing, as saying 
that the most appropriate way to test its validity 
statistically would be to compare in what groups 
it occurs when it does occur. Wanderer then criti- 
cizes this as a procedure ignoring false negatives. 
His argument here would be very much like 
contending that hallucinations or delusions in a 
psychiatric patient cannot be used as suggestive 
of a psychotic process since they do not occur in 
most psychotics. Wanderer’s misconception of 
“false negatives” would result in the most striking 
diagnostic clues being the very ones which “re- 
search” demonstrates should be ignored.* 

To turn to Wanderer’s sample, there is a ques- 
tion of overlap in the five groups (with a limited 
sample of 20 in each) which tends to contami- 
nate the criteria, Experience in outpatient mental 
hygiene clinics is that there is a high proportion 

8“Some circumstantial evidence” says Thoreau, 


“is very strong, as when you find a trout in the 
milk.” 
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of borderline and ambulatory schizophrenics in 
this population, Wanderer doesn’t indicate how, 
or if, he sifted them out of his “neurotic” (men. 
tal hygiene clinic) sample. Additionally, we know 
that intense homosexual conflicts are frequently 
part of the psychotic, as well as the neurotic, pic- 
ture; and this would clearly overlap with the 
DAPs of the homosexual group. In fact, the overt 
homosexual group, having come (at least com- 
paratively) to peace with needs in this direction, 
may flood their drawings with Jess of the homo- 
erotic than would the neurotic and psychotic 
groups, patients in whom the problem areas press 
insistently forward in their projective responses, 

When the comparison is between homosexual 
and normal groups, Machover, Puzzo, Machover, 
and Plumeau (1959) demonstrate statistical sig- 
nificance in the reliability and the validity of 
DAP differentiation “blindly” done: The average 
two-way total scale intercorrelation among three 
graduate-student judges who each independently 
scored the protocols was found to be .83. The 
efficacy of differentiating male homosexual from 
nonhomosexual controls was significant beyond the 
.001 level of confidence. 

As to how many of Wanderer’s “homosexual” 
group may have been neurotic or psychotic, we 
do not know. There was no procedure to assay 
or rule these out. i 

The complications of overlapping of groupings 
may be seen more clearly in the examples that 
(A) a neurotic depression and (B) a psychotic 
depression, may look more alike than different in 
a projective sampling, particularly of just two 
drawings obtained. The same may be true of 
(C) a neurotic paranoid integration and (D) @ 
paranoid type of schizophrenic. In fact, on this 
brief sample, A, looking more like B than like its 
official category mate C, and C looking more like 
D than like its experimentally defined companion 
A would mean that the depressive and paranol 
dimensions, respectively, would emerge but not 
be differentially related to whether their presence 
was of neurotic or psychotic proportions. When 
such is the case, we could scarcely move towar 
the conclusion that the drawings here provide 
nothing diagnostically valid. The same point may 
be made when the major drawing theme 1s the 
pivotal one of strong dependency needs—whether, 
in line with Wanderer’s groups, on the part ° 
(a) a “normal” subject sustained by a marriag? 
to a warm, maternal woman, (b) a “neurotic 
heavy drinker, (c) a “homosexual” whose Le 
ferred erotic activity is performing fellatio, ie 
(d) an institutionalized patient responding to t 
dependency support provided by the hospital. 
Hutt (1968) points out, “Projective tests seem 
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to be especially valuable in assessing covert fac- 
tors... . They are less successful in delineating 
or predicting overt behavior which is likely to be 
the end-product of many intervening factors.” 

Wanderer’s study rests on a basic assumption 
of mutually exclusive pigeon holes for people. 
His data, however, if viewed from the broad 
perspective, stand as a banner to the refusal of 
complex human beings to fit into neat categories 
(at least on brief segments of behavior). 

As far back as 1947, Tomkins suggested that 
the psychiatric entities “do not, in our opinion, 
represent homogenous entities even at the level 
of symptomatology.” Schafer (1954), too, speaks 
of 


researchers who naively expect that dumping all 
patients described as “paranoid” (or “anxious,” 
“schizophrenic,” “well-adjusted,” etc.) into one group 
will consistently yield highly instructive (results) 
,.. and whose conception of test theory and research 
stops right there [p. 285]. 


The fact that the schizophrenic group, in Wan- 
derer’s study, was misdiagnosed more often than 
all of the other groups is consistent with the view 
that there is not a schizophrenic group per se. A 
proliferating of more varied types is reflected in 
Beck’s (1954) expansion of four to six subtypes 
of schizophrenia as reflected in the Rorschach. 
Evelyn Hooker makes the point more broadly in 
asserting that there is no such entity as the homo- 
Sexual. In describing some of the “worlds” of 
homosexuals seen in one large city, she reports 
no common pattern of personality structure, much 
less dynamics, in homosexuals. Willis (1967) 
similarly found no similarity among homosexuals 
other than in their choice of sex object. 

The mentally defective group was undoubtedly 
diagnosed validly because of its clear-cut lowered 
intellectual and conceptual qualities. But it may 
also have been easier to diagnose because mental 
defectives are a more distinct entity whose mem- 

ts more intimately share a common denomi- 
nator with less spread and variability. And where 
tach of the other four groups had crossover 
features with each other, this one group had few 
or none (at least in one direction, that is, prob- 
ably, not a single “normal,” “neurotic,” “psy- 
shotic,” or “homosexual” was also mentally de- 
fective), 

Shaffer and Schafer are, I think, both right; 
awrence Shaffer (quoted by Wanderer in the 
a under discussion) partly so, and Roy 

: fer (1954) more so. We do, to be sure, tend- 

a read into projective material from what we 
Ow of the case. But with this providing the 
tame of reference, we then can read from the 
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drawings significantly further. If—as in actual 
practice (operating neither “blindly” nor employ- 
ing a mere splinter of the battery)—we know the 
patient to be in a psychotic depression and we 
find no affect, depressive or otherwise, in the 
projective drawings (or elsewhere in the projec- 
tives), we may extend from the known into the 
previously not known. We can then understand 
that a state of emotional dehydration may be 
present, and may be here employed as a last- 
ditch defense against the intolerable despondency 
and despair. The blandness may involve de- 
fensive denial of the inner experience. To follow 
the thread into the tissue of the understructure, 
we can then link up surface and subsurface so 
that it makes sense of the apparent inconsistency. 
If an individual offers a drawing production 
seeped in sadism and gore (and his Rorschach 
and TAT is similarly so) but behaviorally he is 
mild and meek, do we discard the projective level 
data as invalid, or do we deduce that this man’s 
surface behavior is maintained at the price of 
considerable repression, inner strain, and tension? 

In these cases, we can see illustrated the prin- 
ciple that a small projective sample may be in- 
sufficient to convey the central diagnostic quality 
of the patient, but may nevertheless still con- 
tribute deeper understanding to the personality 
picture. 

Roy Schafer (1954) vigorously underscores the 
dangers of item interpretation and emphasizes a 
reliance instead on themes which extend across 
many projective products. Similarly, in the pres- 
ent writer’s The Clinical Application of Projec- 
tive Drawings (Hammer, 1958) may be found, 
prominently asserted: 


In actual clinical practice, the dangers of basing 
interpretative deductions on isolated bits of data are 
obvious. In practice, confirmation of interpretative 
speculations on the basis of one drawing must be 
checked against not only the other drawings, but the 
entire projective battery, the case history, the clinical 
impression gleaned during the interview with the 
subject and all other available information [p. 49]. 


An experimental investigation which takes a mere 
chip off this procedure bears little resemblance to 
clinical practice and, as a study, has built into it 
only the slimmest chance of surviving its handi- 
cap. 

Another basic consideration has always been 
S’s perceptions of the purpose of the test and his 
particular reason for the examination, a highly 
determining variable coloring S’s style and re- 
sponses (Cronbach, 1946; Hutt, Gibby, Milton, & 
Potthurst, 1950; Murstein, 1965; Tolor, 1968)— 
crucial material unavailable in “blind” assessment 
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studies. In accord with this, Tolor (1968) speaks 
of the “situational factors (which) can exert 
great influence on all types of performance, and 
must be taken into account.” Tolor then goes on 
to join Buck (1966) and Machover (1949) in 
underscoring the importance of the patient’s 
verbal associations to his own drawings, data 
whose absence in Wanderer’s study fractionates 
the already fractionated material further. As 
Tolor (1968) advocates, 


Place greater weight on associations produced in 
response to the drawings and do not depend solely 
on (the graphic projection). . . . Enlist the patient’s 
assistance in establishing interpretive hypotheses. The 
mode used by a patient to reconcile disparate per- 
formance, to rationalize behavior, and to gain insight 
represents important data for interpretation [p. 226]. 


A projective device, by its nature, has a buck- 
shot quality. Because of its uncertain aim, a 
projective technique cannot be appraised on the 
basis of whether or not it invariably focuses upon 
the same dimension in each S. Sometimes in the 
DAP a diagnostic hint emerges, but sometimes a 
personality need, sometimes a psychodynamic 
clue, and sometimes an expressive style of per- 
sonality, Two drawings of a person do not consti- 
tute a system in itself, but only a small facet of 
an unfolding system of responses maintaining it- 
self in contact with other systems of the pattern- 
ing.* 

Only a minority of the experts employed as 
judges in Wanderer’s study regard the DAP as a 
“primary tool.” No one uses the DAP alone. It 

_ was never intended by Machover, or anyone else, 
as anything more than a supplement, a graphic 
adjunct to the verbal techniques. Therefore, 
Wanderer’s statement, 


but one could still wonder about how much confi- 
dence could be placed in an instrument which, used 
alone, would send a large proportion of schizophren- 
ics merrily on their way, since they would be identi- 
fied as anything but schizophrenics [p. 149, italics 
supplied], 


reflects an astonishing failure to understand the 
role and utility of projective drawings in the 
battery. 

Overall, Wanderer’s study demonstrates what 
we know Clinically: that to order and integrate 
findings from the various sources of data in the 
testing situation, the clinician cannot rely solely 


4Sometimes, for example, with one schizophrenic 
the pathology comes through on one technique and 
with another on a different technique, and frequently 
only in the relationship among the techniques. This 
is, of course, the reason we employ multiple measure- 
ments from diverse modalities. 
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on any small, compartmentalized sampling. A 5- 
or 10-minute projective yield cannot, alone and 
when “blindly” interpreted, serve as the basis for 
a diagnosis.5 Interpreting a personality must be 
in terms of its inner unity and consistency across 
some degree of spread; only thereby may we, at 
times, reveal the man. 

Anything less belongs with parlor tricks. It is 
in line with this that we may view the especially 
pertinent study of Caligor (1962). He found that 
paranoid trends could be detected in only 25% 
of a group of paranoid schizophrenics when one 
drawing of a human figure was employed. He 
returned to the problem, employing a set of eight 
such drawings, and found that 85% of the cases 
could be validly detected on the basis of the more 
extensive drawing projections. 

Yet in spite of its defects—statistically, in 
terms of sampling, experimental design, and 
clinical sophistication—the impact of Wanderer’s 
study still retains some voice which speaks to 
the intellect. As scientist-practitioners, we cannot 
ignore our everyday clinical evidence, nor can 
we, on the other hand, ignore the things that go 
on inside the limitations of the tight research de- 
sign. In a field which is after truth rather than in 
much possession of it as yet, we must draw our 
working guidelines simultaneously from the micro- 
scopic analysis of bits and pieces of human be- 
havior and also from our broader experience of 
its totality. 

Clinicians may respond in one of three ways. 
They may ignore studies such as Wanderer’s. But 
that makes little sense. They may, on absorbing 
the encounter with Wanderer’s experiment, drop 
the DAP from the battery. But that would make 
little sense, too, unless they were using the DAP 
blindly, without the rest of the battery, and 
merely to establish a diagnostic category. They 
may, harmonizing the implications of Wanderer $ 
with that of Caligor’s study, resonate to the limi- 
tations of a two-drawing technique. They may 
then strengthen the projective drawing assessment 
with an expansion in width and depth. And that, 
to this clinician, makes more sense. 


5At least if the sample is of drawing products. 
Can an equally brief sample of Rorschach behavior 
or of TAT stories (say, the first two cards of the 
former, or the first story or two of the latter), SH 
of MMPI answers, or of interview do so? We don't 
know, but few, if any of us, would expect it to do 
so. 
©The writer, for example, does not employ, ° 
teach, the DAP or the H-T-P but rather an inte- 
grated drawing battery including the combined use 
of these two techniques, the H-T-P-P, both achro- 
matic and chromatic (in pencil and then again 7 
crayon), its searching Post-Drawing Interrogation 
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in order to extend previous findings, 302 


cent study (Eisenman, 1968a) found 
Barron’s (1953, 1963) view that 
y-simplicity is an important per- 
dimension. However, Eisenman 
found only modest support for the 
at complexity-simplicity can be con- 
of as a creativity variable. Since Bar- 


preference for complexity to creativity, 
lure of the Eisenman study to show 
ig support for the complexity-creativ- 
raises some problems. First, was 
really measuring creativity with 
ygon measure of complexity-simplicity 
nce? Second, is there any relationship 


plicity and Barron’s line drawings, 
j known as the Barron-Welsh Art 
WAS)? Third, do creativity measures 
hy degree of interrelationship with one 
This third point may be the crux of 
ter, since failure to obtain significant 
tions suggests that one’s various 
‘cannot be subsumed under the gen- 
“creativity.” 
Ous researchers (Barron, 1955; Eis- 
k Robinson, 1967; Getzels & Csiks- 
, 1966; Piers, 1968) have reported 
or small correlations among their 
vity measures. This lack of sub- 
elation among creativity tests has 
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CREATIVITY, AWARENESS, AND LIKING 


RUSSELL EISENMAN 1 
Temple University 


As a follow-up to a previous investigation of Barron’s creativity theory, and 


college students took an unusual uses 


test, the Barron-Welsh Art Scale (BWAS), and a polygon measure of com- 
plexity-simplicity. In addition, they rated how much they liked being in the 
study on a 1-7 scale, and stated what they thought was the purpose of the 
research. The fluency and originality scores, derived from the unusual uses 
test, showed very high intercorrelations (p’s<.001), while the BWAS and 
polygon measures were intercorrelated with each other (r=.55, p < .001) and 
with originality and fluency. Although the results seem partially due to 
method variance, the correlations suggest, in contrast to several previous 
findings, that there are creativity measures which show a reasonable degree of 
interrelationship, thereby justifying subsuming them under the construct “cre- 
ativity.” The importance of awareness and liking are also considered. 


been seized upon by Wallach and Kogan 
(1965) who have criticized the well-known 
Getzels and Jackson (1962) creativity-IQ in- 
vestigation because Getzels and Jackson’s cre- 
ativity measures correlated as highly with IQ 
tests as they did with one another. Other stud- 
ies, in addition to the ones mentioned above, 
suffer from the lack of interrelationship 
among the creativity measures utilized. One 
focus of the present study was to see if tasks 
widely employed for measuring creativity 
showed any substantial degree of intercorre- 
lation. To this end, a large number of Ss were 
tested with the BWAS, Eisenman’s polygons 
for complexity-simplicity preferences, and ex- 
pressed uses for three common objects. The 
uses test has typically been scored for origi- 
nality (infrequency of response in the sample 
tested) and for fluency (total number of valid 
responses). Evidence (Christensen, Guilford, 
& Wilson, 1957; Wallach & Kogan, 1965) sug- 
gests that common responses tend to occur 
early in the response sequence, and the more 
original responses occur later. Thus, there is a 
sort of built-in correlation between fluency 
and originality; since the person who gives 
many responses is likely to exhaust his com- 
mon responses and move into the more origi- 
nal ones. To correct for this, a fifth creativity 
score was employed which simply consisted of 
dividing number of original responses by total 
number of valid responses. The originality/ 
fluency measure gives us an “originality 
average” in much the same way that a batting 
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average is obtained in baseball via the formula 
hits/times at bat. 

Psychological researchers are becoming in- 
creasingly aware of Ss’ feelings about the 
research in which he participates, both be- 
cause Ss’ expectations may influence the re- 
sults and because of ethical concerns. Two 
such areas of “subject’s reaction” were consid- 
ered in the present study: awareness and lik- 
ing. Awareness has been frequently investi- 
gated in verbal conditioning studies, but 
greatly ignored in creativity research. It 
seemed reasonable to suppose that S’s aware- 
ness may be correlated with some performance, 
either because the aware S can better give Æ 
the results that Æ “wants,” or because aware- 
ness might reflect general brightness or in- 
terest. In summary, the present study was 
designed as a follow-up to Eisenman’s 
(1968a, 1968b) findings, and to consider as- 
pects of creativity research which have been 
neglected (awareness and liking) or which 
have often failed to receive substantiation 
(interrelationships among creativity mea- 
sures). 


METHOD 
Subjects 


The Ss were 302 students from undergraduate 
classes at Temple University. 


Procedure 


The Ss were asked to participate in a psychological 
research study. No explanation was given for the 
instruments used, other than telling Ss that E was 
interested in preferences. The testing was done in 
large groups, with care being taken to see that no 
“cheating” occurred. All questions were answered as 
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briefly as possible, with a minimum of information, 
Most queries were handled by telling Ss, “it’s up to 
you.” Typically, the unusual uses test was adminis- 
tered first, with Ss told to list all the uses they 
could think of for a brick, a paper clip, and a pencil. 
Approximately half the Ss took the BWAS second 
and the polygons third, while the other Ss received 
the polygon preference measure second and the BWAS 
third. No order effects were found. The polygons were 
on photographs, with three each of 4-, 12-, and 24- 
point random shapes, and one each of 4-, 8-, and 10- 
point symmetrical shapes. A complexity score was 
obtained by subtracting total number of points on 
three least preferred shapes from total number of 
points on three most preferred shapes. Fluency scores 
were derived from the unusual uses test by obtain- 
ing the total number of valid responses given by S, 
while originality scores were obtained by crediting $ 
with 1 point for originality every time his response 
occurred with a frequency of 1% or less among all 
responses in the present study. Scores across the 
three unusual uses items were combined for pur- 
poses of computing Pearson ’s. An assistant and Æ 
scored the unusual uses items for originality and 
fluency, and agreed on over 90% of the classifica- 
tions. Since BWAS and the polygon measure are ob- 
jectively scored, there was no need for two scorers. 

At the conclusion of the unusual uses, BWAS, and 
polygon measures, Ss rated how much they liked be- 
ing in the study, using a scale of 1-7 with 1 indi- 
cating the greatest dislike, and 7 indicating greatest 
liking. The Ss also wrote what they considered to be 
the purposes of the study, and Ss who said such 
things as “creativity,” “originality,” or “test of 
thinking” were considered aware. For computing 
correlations, aware Ss were assigned a score of 1, and 
unaware Ss a score of O. Two raters achieved 96% 
agreement. 


RESULTS 
Creativity 


Table 1 shows the intercorrelation of all 
seven variables. Fluency and originality ate ' 


TABLE 1 
INTERCORRELATION OF SEVEN VARIABLES 


Variables a) e 6 fut 
Fluency (1) 
Originality (2) ases 
Originality/Fluency (3) oo» gjess 
Complexity (4) 19** 36t ST 
Barron-Welsh Art Scale (5) 2 38*** 3oee* Sipe 
Liked the study (6) 45** 45*** 40*** —07 02 
Aware of purpose of the study (7) —03 13* 26*** 05 03 ase" 


Note.—For liking the study, Ss rated the study from 1 to 7, indicati likin ences, Si We 
classi as 0 (unaware) or 1 (aware), Decimals farehenonitel Wesel aioe a 


highly related to each other (r= .85, p< 
001). However, the correlations of the fluency 
and originality measures with BWAS and the 
complexity-simplicity polygon measure sug- 
gest that originality is more strongly associ- 
ted with these alleged creativity measures 
than is fluency. Preference for complexity is 
associated with BWAS score (r= .55, p< 
001). The major value of the “originality 
average” obtained by the formula originality/ 
fluency seems to be the reduction in the rela- 
tionship with fluency from .85 for originality 
ad fluency to .60 for originality/fluency and 
fluency. In other words, Ss obtain a high cor- 
relation of originality and fluency in part be- 
Cause the more fluent S has a greater chance 
fo be original. Even with the correction of 
dividing fluency scores by originality scores, 
here is still a healthy correlation of .60 be- 
fluency and the “originality average,” 
liggesting that original Ss are also fluent. 
Awareness and Liking 

“If the person did well on the unusual uses 
st, he tended to like the study. However, 
lis performance on BWAS and the polygons 
Was not related to his rated liking of the 
study. Awareness showed less of a relationship 
W other variables than liking, The Ss who 
Mere original showed a slight tendency to be 
ate of the nature of the study in contrast to 
4 with lower originality. Neither fluency nor 
me two preference measures (BWAS and 
Polygons) showed any relationship with 
ah eness. Awareness was associated with 
king the study (r = .45, p < .001). 


Discussion 


Ih Contrast to many previous studies, the 
Investigation yielded reasonable cor- 
tions among creativity measures, support- 
iB the belief that these measures are dealing 
k a common concept of “creativity.” How- 
a a a Strongest correlations were found be- 
uency and originality measures from 

m vusual uses test, which can be explained 
. as method variance (Campbell & 
£, 1959) since a single response leads to 
than one score. The r of .55 between 
he Complexity-simplicity preferences 
163) S that previous studies (Eisenman, 
Which used polygon preferences to 
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test extensions of Barron’s (1963) theory 
were accurate insofar as complexity, has 
some overlap with complexity s. Complexity 1 
refers to preference for complex line drawings 
in what developed into the BWAS, while com- 
plexity » is preference for polygon complexity. 
Of course, these two measures are far from 
being identical. The r of .55 accounts for less 
than one-third of the variance, and previous 
research (Eisenman & Rappaport, 1967) has 
suggested at least one way in which the BWAS 
and the polygon preference measures are dif- 
ferent. Nevertheless, there is some overlap 
which is a welcomed change in creativity re- 
search, since many previous studies have 
found less of an interrelationship. 

The patterning of the correlations with 
the BWAS and with complexity-simplicity 
preferences is highly similar. Correlations with 
BWAS are never greatly different than the 
corresponding correlations with complexity- 
simplicity, which further strengthens the be- 
lief that these two alleged creativity measures 
have much in common. 

It is probably a testimony to the subtlety 
of the preference measures that they showed 
no significant correlations with either aware- 
ness or liking. On the other hand, some Ss 
seem capable of correctly hypothesizing that 
the unusual uses test is aimed at measuring 
creativity or some kind of cognitive function. 
Perhaps the ability to figure out what the re- 
searcher is investigating reflects general 
brightness or interest. 

The answer would seem to be “yes” for all 
three questions posed in the introduction. The 
present study suggests that (a) polygon com- 
plexity-simplicity has claim as a creativity 
measure; (5) complexity-simplicity and 
BWAS are related; (c) there are creativity 
measures which show some reasonable degree 
of intercorrelation. 


REFERENCES 


Barron, F. Complexity-simplicity as a personality 
dimension. Journal of Abnormal and Social Psy- 
chology, 1953, 21, 287-297. 

Barron, F. The disposition toward originality. Jour- 
nal of Abnormal and Social Psychology, 1955, 51, 
478-485. 

Barron, F. Creativity and psychological health. 
Princeton, N. J.: Van Nostrand, 1963. 

CAMPBELL, D, T., & Fiske, D. W. Convergent and 
discriminant validation by the multitrait-multi- 


160 


method matrix. Psychological Bulletin, 1959, 56, 
81-105. 

Curistensen, P. R., Gurerorp, J. P, & Wirson, R. 
C. Relations of creative responses to working time 
and instruction. Journal of Experimentul Psychol- 
ogy, 1957, 53, 82-88. 

EIsenMAN, R. Personality and demography in com- 
plexity-simplicity. Journal of Consulting and 
Clinical Psychology, 1968, 32, 140-143. (a) 

ErsenMan, R. The psychology uf modern art. In G. 
P, Powers & W. Baskin (Eds.), New outlooks in 
psychology. New York: Philosophical Library, 
1968. (b) 

Ersenman, R., & Rappaport, J. Complexity pref- 
erence and semantic differential ratings of com- 
plexity-simplicity and symmetry-asymmetry. Psy- 
chonomic Science, 1967, 7, 147-148. 


RUSSELL EISENMAN 


E1senMan, R., & Rosson, N. Complexity-simplic- 
ity, creativity, intelligence, and other correlates, 
Journal of Psychology, 1967, 67, 331-334. 

Gerzers, J. W., & CsIKszeNTMIHALYT, M. The study 
of creativity in future artists: The criterion prob- 
lem. In O. J. Harvey (Ed.), Experience, structure 
and adaptability. New York: Springer, 1966, 

Gerzets, J. W., & Jackson, P. W. Creativity and 
intelligence. New York: Wiley, 1962. 

Prers, E. V. Adolescent creativity. In J. F. Adams 
(Ed.), Understanding adolescence. Boston: Allyn & 
Bacon, 1968. 

Wattacn, M. A., & Kocan, N. Modes of thinking in 
young children. New York: Holt, Rinehart & 
Winston, 1965. 


(Received March 11, 1968) 


and Clinical Psychology 
161-165 
REPRESSION-SENSITIZATION: 


AMONG COLLEGE STUDENTS + 
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This study investigated the variables of adjustment and voluntarily approaching 
‘psychotherapy as each relates to repression-sensitization (R-S). A group of 
college males and females who sought psychotherapy (T) were compared on the 
= R-S scale with a similar group which had not sought psychotherapy (NT), The 
© results indicate that the T group scored significantly higher on the R-S scale 
than the NT group. In addition, the T group was found to be more sensitizing 
than the NT group even when the two groups were equated on a paper-and- 


ELATION TO ADJUSTMENT AND SEEKING PSYCHOTHERAPY 


4 ing in therapy. 
Bri 


cant point of interest in research on 
tization (R-S) concerns the 
of this personality construct to 
ent. The R-S dimension is presumably 
Of an individual’s tendency to ap- 
OF avoid stress (Byrne, 1964). Ini- 
expected that scoring at either 
the R-S dimension would be in- 
maladjustment (Byrne, 1961). 
a flurry of studies relating the 
(Byrne, 1961) and the revised R-S 
, Barry, & Nelson, 1963) to 
d-pencil measures of adjustment 
61; Byrne et al., 1963; Byrne, 
Shefñeld, 1965; Joy, 1963; Tucky 
1964). In general, however, these 
Suggest that repressors are bet- 
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HA pencil measure of adjustment. Among males in the T group, those terminating 
= therapy after fewer than four sessions were less sensitizing than those continu- 


sure of adjustment. Second, repressors are 
likely to deny that they have problems and 
thus appear well adjusted on a paper-and- 
pencil measure. 

In order to correct for the above limita- 
tions in assessing adjustment, Feder (1967) 
studied hospitalized psychiatric patients, and 
Tempone and Lamb (1967) studied patients 
at a mental health center. Both studies found 
that sensitizers tended to be in the psychiatric 
group and repressors in the control-adjusted 
group. The present study attempts to ascer- 
tain if a similar pattern exists in a population 
of college students seeking psychotherapy at a 
university outpatient mental hygiene clinic. 

It is likely, however, that psychiatric out- 
patients are not only more maladjusted than 
controls but also respond to their problems 
differently. Therefore, one person may seek 
psychotherapy, while another, equally mal- 
adjusted person, does not. People seeking 
therapy are, in effect, approaching their prob- 
lems and should score toward the sensitiza- 
tion end of the R-S dimension. The present 
study investigates the R-S dimension in such 
a population while controlling for adjustment. 

Related to the issue of R-S and adjustment 
is the question of what the R-S scale really 
measures. A correlation of .87 between the R-S 
scale (a high score indicates sensitization) and 
the Taylor Manifest Anxiety Scale led Golin, 
Herron, Takota, and Reineck (1967) to sug- 
gest that the two tests may be interchange- 
able. Lower but clearly significant correla- 
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tions are reported between the R-S scale, and 
Social Desirability and Acquiescence Response 
Set (Feder, 1967). Also, Lefcourt (1966) re- 
ported that Ss scoring low on the R-S scale 
interpreted the scale as a measure of mental 
illness, whereas sensitizers perceived the scale 
as an indicator of honesty with one’s self. 
Thus, Lefcourt writes, 


the question may be raised as to whether the R-S 
scale research is at all involved in the same area 
of interest as studies dealing with behavior refer- 
rants of repression, isolation, or sensitization [1966, 
p. 448]. 


It appears that the behavioral referrants of 
the R-S scale are under question despite the 
historical roots of the scale (Bruner & Post- 
man, 1947). This study examines the behavior 
of approaching personal problems by seeking 
therapy in relation to the R-S scale. Since the 
R-S scale appears to be related to adjust- 
ment, this study attempts to equate Therapy 
(T) and Nontherapy (NT) Ss for adjust- 
ment and then determine if T Ss score higher 
than NT Ss on the R-S scale. The present 
study also explores the relationship between 
R-S and early termination of psychotherapy. 

With a college population, including stu- 
dents who voluntarily sought psychotherapy, 
the following predictions are examined: 

1, Ss in the T group are expected to score 
higher on the R-S scale (sensitization) than 
Ss in the NT group: 

2. With adjustment controlled by matching 
Ss, the T group is still expected to score 
higher on the R-S scale than the NT group. 

3. T Ss who terminate before the fourth 
session will score lower on the R-S scale than 
T Ss who continue in therapy for a larger 
number of meetings. 

Because of their significance in this study 
and in previous research (Byrne et al., 1965; 
Joy, 1963), the correlations between the R-S 
scale and the CPI scales will also be reported. 


METHOD 


All the college undergraduate students who volun- 
tarily sought psychotherapy at a university mental 
hygiene clinic during approximately a 3-month 
period were asked to participate in the study, pro- 
viding they had not received psychotherapy during 
the previous 2 years. Of those asked to participate, 
approximately 75% completed the testing giving a 
T group of 63 Ss (27 males and 36 females). 
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The Ss for the NT group (V=112; 61 males, $1 
females) were obtained from introductory psycho). 
ogy and personal adjustment classes, excluding thoy 
who had received psychotherapy during the 2 previ- 
ous years. Eighty-two percent of the NT group and 
73% of the T group were either freshmen or sopho- 
mores. The remaining students in each group wer 
either juniors or seniors. 

The CPI and the Minnesota Multiphasic Person. 
ality Inventory (MMPI) were completed by Ss un- 
der the usual conditions of written instructions, All 
Ss in the T group completed the testing before their 
second therapy session. The CPI was employed as a 
measure of adjustment and the R-S scale (Byrne e 
al., 1963) was obtained from the MMPI. Although 
there is some item overlap between the R-S scale and 
nearly all of the CPI scales, this overlap would tend 
to make the T and NT groups similar on the R-S 
scale, not different as predicted. 

For the analysis having to do with the number of 
therapy sessions attended, the figures excluded the 
intake session which is a brief diagnostic interview 
and often results in assignment to another therapist, 
The criterion for early termination was attendance 
at no more than three therapy sessions. It was 
reasoned that a satisfactory resolution of the clients’ 
problems would not likely occur in fewer than four 
therapy sessions, 


RESULTS 


Prior to presenting the results pertaining to 
the hypotheses, the relationship between the 
R-S and the CPI scales is reported in Table 
1. As may be seen, the R-S scale is signifi 
cantly and negatively correlated with 15 of 
the 18 CPI scales for both the T and NT 
groups. However, certain problems, discuss 
in the next section, render these findings diff 
cult to interpret. E 

The first hypothesis called for a comparison 
of the T and NT groups on the R-S scale. The} 
T group obtained a mean R-S score of 62.03; 
which, when compared with the mean A 
score of 42.43 for the NT group, gives a $ 0 
5.49 (p < .001, one-tailed). Thus the T grou? 
is significantly more sensitizing than the 
group. No sex differences in R-S scores were 
observed in the T group (f = .91) or in the 
NT group (¢ = .32). 

The second prediction was that the T grou? 
would score higher on the R-S scale with a 
justment controlled, To equate for adjust 
ment, as many Ss as possible in the T group 
were matched with an S in the NT group 0% 
the basis of an identical CPI scale score. T 
was done separately for each of the 18 cP 


TABLE 1 


CORRELATIONS BETWEEN THE R-S anp CPI 
SCALES FOR THE T AND NT Groups 


CPI scales T group | NT group 
— 48** ‘= 45 
—.14 +.05 
—.33** | —.27** 
— 48% | —.39** 
—.54** | —.51** 
—.26* —.34* 
ant 67** = .61** 
—.45%* | —.39%* 
— 15 —.12 
—.28* —.39* 
pee 76** =. 78** 
—76** —,.64** 
= .65** =. .68** 
—.70** | —.67** 
on 66** BS .66"* 
—.12 —.04 
—.55** | —.40** 
—.38%* | —.41°* 


ote.—p levels two-tailed. „ N = 63; 
NT group, ees aro: ailed. For the T group, N = 63; 


9 <o. 


males.’ The results of the matched-paired 
mparisons, reported in Table 2, show sig- 
ant differences between the T and NT 
Mps on R-S for all 18 CPI scales and sup- 
the second hypothesis. Thus the T group, 
comparison with the NT group, tends to- 
td sensitization even when the groups are 
uated on a measure of adjustment. These 
pong differences occurred despite many in- 
ances of item overlap between the R-S scale 
ind the CPI, 

The third hypothesis concerns the relation- 
between R-S and early termination of 
hotherapy. Because males and females 
Y view early termination differently, com- 
Sons were made for each sex. Table 3 
Ows that among males early terminators 
a less toward sensitization than the re- 
ünder of the T group. No significant dif- 
hy ce was found among the female Ss, al- 
4 a similar trend is apparent. 

sponte the large S pool in the NT group, it was 
feat: to find a match for every TS. Over the 
tefore ear a number of Ss with no match and 
19 with aed in the analysis, ranges from 3 
ean of 9.6. In over half of these 


Ber appropriate match was found but the S 
Np, ly been matched with another S in the T 
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The significant negative correlation between 
the R-S and most of the CPI scales is similar 
to findings reported by Byrne et al. (1965) 
and Joy (1963). The Ss scoring toward sensi- 
tization tend to score lower on many of the 
CPI scales. However, the particular interpre- 
tation of this apparently predictable observa- 
tions is not so clear. As Byrne et al. (1965) 
pointed out, repressors might well appear bet- 
ter adjusted on a paper-and-pencil measure 
because they tend to deny that they have 
problems rather than because they are better 
adjusted. A second difficulty in interpreting 
these correlations, and one not mentioned by 
Byrne, is the item overlap between the R-S 
and most of the CPI scales. The extent of 
item overlap ranges as high as 9 of 31 items 
on the CPI Tolerance scale. Those scales 
having the greatest item overlap generally 
show high correlations with R-S (e.g., Sense 


TABLE 2 


Comparisons OF THE T Group WITH THE NT Group 
ON THE R-S SCALE WHEN Ss ARE 
MATCHED ON THE CPI Scares 


CPI Scale 


Sociability 
Femininity 
Dominance 
Achievement via 
Independence 
Capacity for Status 
Communality 
Self-Control 
Psychological- 
Mindedness 
Self-Acceptance 
Responsibility 
Sense of Well-Being 
Intellectual 
Efficiency 
Achievement via 
Conformity 
Tolerance 
Good Impression 
Flexibility 
Social Presence 
Socialization 


Mean R-S 

Num-| scores 

ber of 

pairs} T | NT 

group | group 

44 | 58.9 | 46.6 
55 | 62.8 | 40.9 
54 | 58.8 | 44.2 
55 | 60.3 | 42.7 
60 | 61.9 | 46.6 
60 | 62.5 | 43.6 
48 | 58.2 | 40.1 
59 | 61.2 | 42.4 
59 | 61.0 | 42.9 
56 | 59.1 | 45.6 
50 | 55.8 | 45.7 
49 | 56.8 | 45.2 
58 | 59.7 | 43.9 
52 | 61.3 | 49.8 
59 | 62.6 | 44.1 
54 | 64.1 | 40.3 
48 | 57.7 | 46.1 
47 | 60.4 | 49.1 
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s one-tailed. 
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TABLE 3 


Comparison or THERAPY Ss WHO ATTENDED THREE 
or Less SESSIONS WITH THOSE ATTENDING 
Four or More SESSIONS ON THE 
R-S Scare 


Number of sessions 


group 0-3 4 or more t 
N M N M 
Males 8 55.12 19 69.42 1.72* 
Females | 11 54.73 25 61.84 1.17 


*p < .05, one-tailed. 


of Well-Being, 12 of 44; Social Presence, 13 
of 56). Furthermore, most of the overlap 
items have opposite scorings on the R-S scale 
and the CPI; for example, those items scored 
for a true response on the R-S scale are 
nearly all scored for a false response on the 
CPI. This pattern of scoring would lead one to 
expect the negative correlations which were 
observed (see Table 1). 

Consistent with the findings of Feder 
(1967) and Tempone and Lamb (1967), this 
study indicates that R-S is related to a broad 
non-paper-and-pencil measure of adjustment. 
Thus people seeking psychotherapy at a uni- 
versity outpatient mental health clinic tend 
toward sensitization in comparison with a 
comparable control group not seeking psycho- 
therapy. In addition, however, the findings of 
the present study strongly suggest that the 
R-S scale is more than a measure of adjust- 
ment. Even when T and NT Ss were equated 
for adjustment on the CPI, Ss in the T group 
tended to be sensitizers and Ss in the NT 
group tended to be repressors. As evidenced 
by the behavior of seeking therapy, this study 
suggests that the R-S scale does have the be- 
havioral referrants, such as avoidance and 
sensitization, which have been questioned by 
Lefcourt (1966). Perhaps the R-S scale mea- 
sure “adjustment” as well as the tendency to 
approach or avoid stress, Such a relationship 
does not make the terms interchangeable and 
certainly does not reduce the value of the 


*Since considerable item overlap exists between 
the 18 CPI scales, they may not be seen as inde- 
pendent measures, 
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R-S concept. When indicated, future research 
with the R-S scale might profitably contro 
for adjustment to insure that observed dit. 
ferences are not a function of adjustment 
variables tapped by the R-S scale. 

The present study observed no sex differ 
ences in R-S in either the T or NT groups 
Most previous studies report similar finding 
or simply do not raise the question. This is 
reflected in a review article by Byrne (1964) 
in which no section is given to sex differences 
on the R-S dimension. However, the present 
study did obtain different results for males 
than females as regards the relationship bef 
tween R-S and early termination of psycho, 
therapy. Merbaum and Badia (1967) ant 
Mendelsohn and Griswold (1967), in recent 
studies with the R-S dimension, also obtained 
different results for males than females. Thes 
studies suggest that the assumption of no st 
differences on the R-S scale may be rathet 
tenuous, perhaps depending upon the othe 
variables in the investigation. 

The finding that males who terminati 
counseling after three or fewer sessions tend 
to score lower on the R-S scale is subject 
at least two interpretations. First, the rela 
tionship between adjustment and R-S allo 
for the possibility that many Ss staying Wi 
therapy are simply more maladjusted than th 
early terminators. It was possible to get som 
information about this by comparing th 
males in the T group who attended three 0 
less therapy sessions with those attending fol! 
or more sessions on those CPI scales whid 
have no more than one item overlapping Wl 
the R-S scale. The ¢ tests (two-tailed) indi 
cate that the two groups do not differ 3 
preciably on the Responsibility, Communa 
ity, and Achievement via Independence sca'® 
However, the early terminators show bet 
adjustment on the Flexibility scale (t = 2: 
p < 02) and on the Psychological-Mindeda 
scale (t = 2.04, p < .10). The second pos 
bility is that, since sensitizers tend to a 
proach their problems, they are more likely t4 
pursue their problems in therapy. Future 
search should investigate the relationship ? 
tween R-S and premature termination W^ 
controlling for adjustment. Premature tea l 
nation should be measured in a way W? 
reliably assesses a client’s retreat from psy" 


if repressors tend to avoid con- 


onal problems, future research 
itably explore the relationship be- 
d progress in therapy. 
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SEXUAL ASPECTS OF TAT ADMINISTRATION 
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This study investigated the frequency of administration of four sexual-romantic 
TAT cards by 20 male and six female graduate student examiners to clinic 
clients. Male examiners gave more of these cards to female clients than to male 
clients (p= .01) but female examiners did not make such discriminations. The 
results confirm Schafer’s observation that the examiner’s voyeuristic needs may 
be gratified through testing and parallel the findings of Rosenthal that male Es 
behave differently to female Ss than to male Ss, 


While it might be comforting to believe 
that the psychological examiner (E) is a 
perfectly standardized instrument, never per- 
mitting any personal aspect of himself to in- 
trude in the testing situation, evidence col- 
lected in recent years shows beyond any doubt 
that the E-S relationship is witness to most 
of the vicissitudes of any two-person inter- 
action. The personality, needs, and expecta- 
tions of the psychologist have been shown to 
influence the administration (Masling, 1959), 
scoring and interpretation (Haase, 1955; 
Masling, 1957), and eliciting of responses 
(Masling, 1965; Turner & Coleman, 1962) of 
psychological tests. 

The importance of the sex of the E-S pair 
has been well described in the psychological 
experiment (Rosenthal, 1966, 1967), but has 
not been given adequate attention in the 
testing situation. Only one aspect of the sexual 
situation has been investigated—the influence 
of the sex of the E on Ss’ projective’ test 
responses—with mixed results: three studies 
reported no differences (Alden & Benton, 
1951; Garfield, Blek, & Melker, 1952; Holtz- 
man, 1952), one study reported significant 
differences (Clark, 1952), and one study re- 
ported that sexual differences sometimes oc- 
curred and sometimes did not (Rabin, Nelson, 
& Clark, 1954). Perhaps the most penetrating 
analysis of the sexual aspects of testing has 
been provided by Schafer (1954) in his dis- 
cussion of the voyeuristic needs of E: 


The tester is in the position of a psychological 
voyeur. He peeps into the interior of many individu- 
als and never once commits himself, as would be 


1 Requests for reprints should be sent to Joseph 
M. Masling, Psychological Clinic, State University 
of New York at Buffalo, Buffalo, New York 14214. 
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required under normal conditions, to a relationship, 
... Going further, our knowledge of primitive, 
affect-laden levels of thinking suggests that psycho- 
logical voyeurism may be unconsciously elaborated 
as an act of hostile, sexual intrusion. That is to say, 
even if the tester did not, to begin with, choose 
testing in part as a more or less sublimated outlet 
for his infantile voyeuristic inclinations, these inclina- 
tions may well seize on the looking-in-secret aspects 
of testing and thus may invade the professional role 
[p. 21], 


One possible peeping situation arises when 
Ss are confronted with sexual associations’ 
suggested by projective test stimuli. Although 
strangers do not ordinarily discuss sexual 
matters with each other, particularly with 
someone of the opposite sex (Benney, Ries- 
man, & Star, 1956), E is in a position of 
forcing an S to look at and talk about sexual 
stimuli. 

The selection of a battery of tests is ordi- 
narily determined by the individual needs | 
of the client, the background and experience | 
of the clinician, and the time available for the ; 
diagnostic session. Within these broad limits | 
the clinician has a good deal of freedom to | 
choose the testing instruments he will use, 
and if he uses the TAT he has the option of 
selecting among 31 cards in the TAT set. The 
selection of particular TAT cards, therefore, 
allows E to project his own needs into the 
testing situation. If there is a voyeuristic need 
in some Zs, or a need to be sexually forward 
or provocative, one would expect that there | 
would be a nonrandom use of particular TAT 
cards, those portraying obvious sexual 0f 
romantic themes, as a function of sex and 
age of E and S. F 

The following hypotheses were tested in 
this study: 


SR a 


= 


1. When an Æ tests a male S and a female 
She will use more TAT cards with obvious 
sexual and romantic themes for the cross-sex S 


than the same-sex S. 
2. The Ss in a cross-sex testing situation 
vill be given more TAT cards with obvious 


sexual and romantic themes than Ss in a 


same-sex testing situation. 


METHOD 

Examiners 
The Es were 20 male and six female graduate 
dudents in clinical psychology, all at the internship 
or postinternship level, working for the Psychological 
Clinic of the State University of New York at 
Buffalo. The clinic files for the period between Febru- 
iry 1958 and November 1963 were inspected and all 
idministrations of the TAT were recorded. Nine of 
male Es and five of the female Es tested both 
imale and female S, thus allowing a separate analy- 
i$ of the data in which each Æ could serve as his 

own control. 


Subjects 


“The test Ss were clients requesting psychological 
eivices from the clinic. They represented the usual 
lange of symptomatology seen at a University clinic, 
ftom transitory emotional problems to severe neurotic 
iis lurbances. Only the test records of Ss who were 
approximately the same age as the Es were used. 


TAT Cards 


“There are only five cards in the TAT set which 
jrtray the interaction of a male and female of 
bout the same age, and of these one card, 6 GF, 
labeled primarily for use with female Ss and was 

fore not considered in this study. The cards 
ised were 2, 4, 10, and 13 MF: 2 portrays a young 
itl holding school books and a young farm worker, 
a to the waist; 4 shows a young woman trying 

letain a young man from leaving her; 10 presents 
ie of a couple, apparently in an embrace; 

F shows a woman in bed, nude to the waist, 


ie young man in the foreground covering 


Treatment of the Data 


l 
Wa was kept of the number of times cards 
A A 13 MF were administered; the sex of 

A and the total number of TAT cards used. 

minister, 1 was tested using the,14 Es who- had 

ithe the TAT to both a male and female S. 

2 was tested using one case from all 26 
ina sample; where Es had more than one 
sis, € case the selection was made on a random 


RESULTS 


mi 28% each Æ being used as his own 
ol, are presented in Table 1. In no cir- 
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TABLE 1 
Use or TAT Carns 2, 4, 10, ann 13 MF BY 
Mate AND FEMALE EXAMINERS WHEN 


Tuey Test ONE MALE AND ONE 
FEMALE SUBJECT 


Erami More cards | More cards | Same num- 
ET | to male Ss |to female Ss | ber to each 
Male 0 5 4 
Female 1 1 3 
Note.—N = 14. 


cumstance did a male Æ give more of the 
sexual-romantic cards to a male § than to a 
female S, while four male Zs gave the same 
number of cards to the male and female Ss. 
If the tied instances are disregarded, a bi- 
nomial expansion (Siegel, 1956) for the likeli- 
hood that all five nontied events would be 
found in the male Z-female S$ cell produced 
p= .03. There was no significant difference 
in the distribution of the five female Es re- 
garding card choice. The data for the male 
Es suggest confirmation of Hypothesis 1 
when the tied cases are not considered, but 
no support was found for the female Es. 

Hypothesis 2 similarly is supported for the 
male Es but not for the female Es. Table 2 
shows the distribution of the four sexual- 
romantic cards to male and female Ss by male 
and female Es. Male Es clearly gave more 
of these cards to female Ss than to male Ss 
(p= .01, Fisher exact test, Siegel, 1956), 
whereas the female Es treated male and 
female Ss alike. 

To test for the possibility that this distribu- 
tion of TAT cards was primarily a function 
of the larger number of cards used by the 
male Es, correlations were computed between 


TABLE 2 
Use or TAT Carns 2, 4, 10, anD 13 MF 


By male examiners* | By female examiners 


Cards x r 
Given to | Given to | Given to | Given to 
male Ss | female Ss | male Ss | female Ss 

3or4 f 9 1 1 

lor2 7 3 2 2 


Note—For male examiners, N = 20; for female examiners, 


N =6. 
* = .01 (Fisher exact test). 
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the total number of cards administered and 
number of sexual-romantic cards used. For the 
male Es, the correlations were —.20 for 
the male Ss and .16 for the female Ss; for the 
female Es, the correlations were .00 for 
the male Ss and .11 for the female Ss. None 
of these correlations was significantly greater 
than. zero. As a further test, Hypothesis 2 
was evaluated by converting the number of 
sexual-romantic cards each Æ administered 
into a proportion of all TAT cards he used; 
this measure produced results identical to that 
reported in Table 1: Fisher exact test = .01 
for the male Es and no significant differences 
for the female Es. 
Inspection of the use of the four sexual- 
romantic cards showed that the significant 
difference in Table 2 was caused by male Es 
giving Cards 4 and 10 more frequently to the 
female Ss than male Ss. The Fisher exact test 
computed on these data showed p = .06 for 
Card 4 and p = .05 for Card 10. While the 
male Es varied in the use of these cards as 
a function of the sex of S, the female Es 
showed a different pattern: with one excep- 
tion, they either gave the card to all Ss or 
they gave them to none. Evidently, the female 
Ss were concerned almost exclusively with the 
stimulus quality of the TAT cards, while the 
male Es seemed to have one eye on the card 
and the other on the sex of the S. Card 13 
MF, the most obviously sexual of all the 
TAT set, did not discriminate because it was 
so widely used by all Es. 
Ifthe male E-female S relationship contains 
„some element of sexual play, there would be 
“a temptation for the male Æ to prolong the 
session by administering more TAT cards to 
female Ss than to male Ss, Table 3 shows a 


TABLE 3 
TOTAL NUMBER Or TAT CARDS ADMINISTERED 


By male examiners* | By female examiners 


Cards 
Given to | Given to | Given to | Given to 
male Ss | female Ss} male Ss | female Ss 
11 or more 2 7 0 0 
0 to 10 6 5 3 3 


Note.—For male examiners, N = 20; for female examiners, 


=6. 
*p = .16 (Fisher exact test). 
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distribution of total number of TAT cards 
used by male and female Es. As was found 
in the previous analyses, the female Es treated 
male Ss and female Ss equally, but there was 
a nonsignificant trend for the male Es to give 
more cards to female Ss than to males 
(p = .16, Fisher exact test, Siegel, 1956), 


Discussion 


In general, this study showed that female 
clinicians were extremely task-oriented, while 
the males were distracted somewhat by more 
personal considerations. The observation of 
Schafer (1954) that Zs may use the testing 
situation to gratify voyeuristic needs was 
confirmed for males. The report of Kinsey, 
Pomeroy, Martin, and Gebhard (1953) that 
voyeurism is almost exclusively a male activ- 
ity was also supported here. The results also 
parallel the conclusion reached by Rosenthal 
(1967) based on his investigation of the 
psychological experiment that male Es pay | 
more attention to the sex of S than do 
female Es, 

Unfortunately, the use of internship level 
graduate student Zs limits the confidence with 
which these findings can be generalized to 
older clinicians, although the sample size 1s 
much larger than those commonly employed 
in experiments in this area. Mature, experi- 
enced clinicians may never behave like this, 
or scarcely ever. But apart from the use of 
graduate students, this study sampled a genu- 
ine clinical situation—real clients were seen 
in a real clinic for important problems. The 
results suggest that the observation of some 
clients (and others) that psychologists (male) | 
think about sex a great deal may not be | 
totally in error. 
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ANALITY AND TIME 


TUPPER F, PETTIT? 
New York University 


A Time Scale (TS) of 40 items was constructed to ascertain the ways in which 
time may have special meaning and/or may serve to organize an individual’s 
experience. The TS and other scales were administered to 91 Ss. The TS had 
significant correlations of .643 with the Composite Anality Scale, of .507 with 
the Grygier Anality Scale, and of —.316 with the Grygier Spontaneity Scale. 
All correlations were in the predicted direction. The results were interpreted as 
in accord with the psychoanalytic formulation of the anal character, which 
makes it possible to relate the intensity of certain character traits to an indi- 
vidual’s estimation of the importance of time. 


Most investigators (e.g., Bell, 1966; du 
Preez, 1967) have concentrated on time as a 
physical dimension, studying such variables 
as estimation of time intervals, speed of tap- 
ping, and the like. In these experiments it 
seems to be generally recognized that there 
are individual differences which give rise to 
differences in time estimation, but these di- 
mensions have yet to be specified in any 
systematic way. 

A fruitful approach to understanding the 
importance of time as a psychological dimen- 
sion may be to relate an individual’s sense 
of time to his relatively enduring character 
traits, In a recent book, Cohen (1967) men- 
tioned briefly the value which can accrue to 
time, some people being parsimonious with 
it and others being profligate with it. In these 
or other circumstances time may have a sym- 
bolic meaning. Such an explanation has been 
advanced in psychoanalysis with reference to 
the anal character, although Cohen dismisses 
this as “interesting speculation” which is 
difficult to verify. 


The Anal Character 


In “Character and Anal Erotism” Freud 
(1962) first revealed his observation that a 
certain cluster of character traits are fre- 
quently found in one individual. These traits 
are orderliness, parsimony, and obstinacy. He 
observed that the appearance of this triad 


1The author wishes to thank Irving Sarnoff and 
M. E. Tresselt for their assistance in the preparation 
of this paper. Requests for reprints should be sent 
to Tupper F. Pettit, 100 Bleecker Street, No. 14C, 
New York, New York 10012. 


of character traits in an individual coincided 
with the disappearance of his anal erotism. 

Freud conceived of the dynamics that give 
tise to such a triad in the following way. If 
a conflict occurs between the mother and 
child over toilet training, competition arises 
between the two. The only weapon the child 
has at his disposal against the all-powerful 
mother is his feces which both he and his 
mother regard as having value. Given these 
circumstances, the accumulation of feces pro- 
vides the infant with the dual satisfaction 
of a pleasurable tension and defiance of the 
mother. The control of his sphincter, both 
holding on and letting go, is the simultaneous 
control over his pleasurable tension states 
and an exertion of autonomy. The situation 
is exacerbated as the mother comes to place 
more demands on her child for both produc- 
tion and control. 

In this case, an inordinate amount of 
libidinal energy becomes fixated at the anal 
stage of development. But since there is little 
satisfaction to be gained for anal erotism mMm 
our culture, the child develops a reaction 
formation against his anal erotic desires and 
the anal products during the latency period. 
The reaction formation affects both feelings 
about anal activities and character traits, an! 
the two are inextricably linked. The original 
Pleasure and satisfaction associated W! 
anal activities are replaced by feelings of 
shame and disgust. Consequently, the tri 
of character traits—orderliness, parsimony, 
and obstinacy—become prominent in the 
personality. 

Since the anal character was first postu- 
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ated, many psychoanalytic writers have sub- 

' stantiated Freud’s notions with idiographic 
data. An aspect which Freud originally com- 
mented on, and which has received a good 
deal of attention, is the anal character’s con- 
cept of money. To him, money is a sym- 
bolic representation of the valued feces and, 
as such, is dealt with in the same irrational 
way: to be accumulated and hoarded and 
occasionally be spent profligately. 

Another aspect which has received some- 
what less attention is the meaning that time 
Tas for the anal character. In many ways time 

takes on the same irrational, symbolic mean- 


(1946) wrote that in compulsion 
oses the patient’s relation to time, like 
‘his relation to money, is “an arena wherein 
i instinctual conflicts are fought out [p. 
282]. Abraham (1953) noted that “many 

rotics are continually worrying over waste 
‘Oi time... . The patients often save time 
Ona small scale and waste it on a great one 


-[p. 384).” 


ainiscent of Freud’s characterization of the 
al pleasure a child attains from the in- 
Gteased bowel tension and control over reduc- 
Hon of that tension: a person who lacks 
Punctuality may “postpone an activity in 
er to create a tension which will be a 
ce of satisfaction [p. 290].” 

x n what way would the traits which are 
Ander consideration make themselves manifest 
| behavior, particularly in regard to time? 
ng Freud’s triad, it is expected that a 
person with anal character traits would try 
) order his experiences so that they are 
7 ed, well delineated, and sharply fo- 
as opposed to being diffuse and 
Herentiated. Time can become very im- 
ìt in ordering experiences by providing 


important, the anal character should dis- 
Spontaneous experiences and surprises, 
ei are, after all, a matter of timing. 

J addition, the anal character should be 
simonious in the way he uses time and 
€ about giving any of it up, that is, 
g it. Jones (1961) observed that 
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the concept of time is, because of the sense of value 
attaching to it, an unconscious equivalent of excre- 
tory product. . . . People of this type are particu- 
larly sensitive about their time being taken up 
against their will, and they insist in every way on 
being master of their own time [p. 420]. 


Schlesinger (1963) did a factor analysis on 
154 items which had been used in various 
anality questionnaires. She extracted 12 fac- 
tors. These factors and the resulting intercor- 
relations of factor loadings are in agreement 
with and tend to confirm theoretical expecta- 
tions and clinical experience. The 12 factors 
are as follows: 


I: Responsibility in dealing with others 
II: 


Regularity and meticulousness 
III: Retentiveness as a style of life 
IV: Obstinacy 


V: Rigidity or intellectual curiosity 

VI: Frugality 

VII: Concern about dirt and contamina- 
tion 

VIII: Orderliness 

IX: Self-righteous hostility and competi- 

tiveness 

X: Anxiety over possible loss of control 

XI: Sensitivity to smells 

XII: Retentiveness in relation to posses- 
sions 


Rosenwald, Mendelsohn, Fontana, and 
Portz (1966) took poor performance on a 
hand-eye coordination task in an odorous, 
dirty medium reminiscent of fecal matter as 
an indication of ineffective defenses against 
anal impulses. They also found significant cor- 
relations between ineffective defenses against 
anal stimuli and other traits thought charac- 
teristic of anal characters (namely, in- 
decisiveness and selective sensitivity for anal 
concepts). 

The purpose of this investigation was two- 
fold: to develop a questionnaire which gets 
at the meaning that time has for individuals 
and the importance of time as a factor in con- 
trolling experiences; and to test the psycho- 
analytically derived hypothesis that the inten- 
sity of an individual’s anal character traits 
is positively related to his estimation of the 
importance of time. 
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METHOD 
Time Scale (TS) 


A time questionnaire was constructed which con- 
sisted initially of 76 items. Items were chosen which 
seemed in keeping with the theoretical formulation 
previously developed. Several questions were taken 
directly or in modified form from an 85-item ques- 
tionnaire developed by Loehlin (1959). The final TS 
consisted of such items as: I feel that I am more 
conscientious about being on time than most people 
I know; I often don’t know what the date is; 
It doesn’t bother me if the teacher is late to class; 
and Time is money. 


Composite Anality Scale (CAS) 


Two anality questionnaires were used, neither of 
which contained items having explicit reference to 
time. The first was developed for this investiga- 
tion from the study by Schlesinger. The items were 
selected simply by picking the items which loaded 
each of her 12 factors the most heavily. This resulted 
in the CAS, which consisted of 66 items, such as: 
You accumulate a great many things because you 
rarely throw anything away; When I see a messy 
room I have a strong impulse to straighten it out; 
I use more toilet paper than is necessary; and I feel 
one of the most embarrassing things people can do 
is break wind. 


Grygier Anality Scale (GAS) 


The second anality questionnaire consisted of four 
subscales (Ad, Ah, Ac, and Aa) containing 41 items 
taken from the Dynamic Personality Inventory de- 
veloped by Grygier (1956). The entire inventory 
consists of 325 items and was constructed within the 
theoretical framework of psychoanalysis. There are 
many scales for measuring patterns of psychosexual 
development, masculine and feminine identification, 
etc. The four subscales used to form the GAS were 
Ad, which, according to Grygier, measures “attention 
to details; orderliness, conscientiousness and perfec- 
tionism”; Ah, which measures “hoarding behavior, 
anxious possessiveness, and stubborn, clinging per- 
sistence”; Ac, which measures “conservatism, rigidity, 
tendency to stick to routine”; and Aa, which measures 
“submissiveness to authority and order.” 


TABLE 1 
INTERCORRELATIONS FOR ALL Supyects 


Composite Grygier 
Analy | Analis | Sponta- 
Scale” | Seale” gay. om 


643%" 


Time $ S07 —.316* «052 
Composite Anality 570 —.201 -113 
Grygier Anality — —.405  —.052 
Grygier Spontaneity —.186 
Note.—N = 91. 
*p <.01. 
** p < 001 
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The Grygier scales are administered by asking § 
to indicate whether he “likes” or “dislikes” a certain 
thing. Some of the questions included in this study 
were whether S likes or dislikes: Perfect order and 
symmetry (Ad) ; Hanging onto old clothes you never 
wear (Ah); Keeping out of trouble as much as you 
can (Ac); and Patriotism as the highest ideal (Aa), 
Several recent studies (Bishop, 1967; Gordon, 1966, 
1967; Rapaport, 1963) have yielded evidence in 
support of the validity of Grygier’s anality scales, 


Grygier Spontaneity Scale (GSS) 


The final scale used in this study was the Grygier Oi 
Scale, which measures “impulsiveness, changeability, 
spontaneity, speed of reaction, emotional expressive- 
ness.” It is this type of experience and emotional 
expression that the anal character guards against by 
relying heavily on the time dimension to structure 
his life, since it is often a loss of the sense of time 
that allows one to be surprised or to react spontane- 
ously. Thus, high scores on the time and anality 
scales should go hand in hand with low scores on 
the GSS. 

The GSS consisted of 12 items, three of them 
appearing simultaneously in the GAS. The scale 
included questions regarding whether S liked or dis- 
liked: Smoking; Acting on the spur of the moment; 
and Spending money lavishly. 

These four questionnaires were administered to 
91 voluntary, undergraduate psychology students in 
the following order: TS, GAS and GSS combined, 
and CAS. Thirty-seven Ss were males, and 54 were 
females. 


RESULTS 


The TS was refined by dropping those items 
with the lowest item-test correlation. This was 
done in four steps, with the item-test correla- 
tions for the remaining items being recom- 
puted between each step. All statistics Te- 
Ported in this paper are for the final, 40-item 
version of the TS. The reliability for it, as 
calculated by Kuder-Richardson Formula 20, 
was .833. 

The reliabilities of the anality scales were 
calculated using the split-half method. The 
reliability of the CAS was .811 and of the 
GAS .682. 

Table 1 shows the intercorrelations of the 
scales and sex for all Ss. The TS correlate 
.643 and .507, respectively, with the CAS? an 
GAS; both correlations are highly significant. 
The TS had a significant correlation of —. 

* Subsequently, slightly revised 40-item versions 
of the TS and CAS have been administered to 9 
undergraduate psychology students as part of another 


study. The correlation obtained between the tw? 
measures was .611. 
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TABLE 2 
INTERCORRELATIONS FOR MALE SUBJECTS AND FEMALE SUBJECTS 


Composite Anality Scale Grygier Anality Scale Grygier Spontaneity Scale 
Scale 
Male Female Male Female Male Female 
Time .667** .621** .506* .520** —.270 —.344* 
Composite Anality .650** .542** —.164 —.202 
Grygier Anality —.169 —.577** 
Note.—For males, N = 37; for females, N = 54. 
#9 <01 


* p < 001. 


with the GSS. Sex was not a significant factor 
on any of the scales for the pooled Ss. 

To determine if males and females scored 
differently on the scales, correlations were 
calculated for the males as a group and for 
the females as a group on the different scales. 
The data are presented in Table 2. The differ- 
ences between corresponding correlations are 
in each case nonsignificant except for the 
correlations between the GAS and GSS. This 
correlation is —.169 for the males and —.570 
for the females, and the difference between 
them is significant at the .05 level. 


DISCUSSION 


This study gives considerable support to the 
theoretical relation between time and anality. 
The correlation between the TS and the CAS 
18 unexpectedly high. Based on this sample 
of 91 Ss, the .99 confidence limits for the 
Population 7 are .35 and .72. Thus it seems 
Safe to conclude that there is a sizable rela- 
tionship between these two dimensions and 
that time does have special meaning for 
People with anal character traits. The next 
step is to establish more precisely, through 
mal means, the nature of that rela- 
lonship and the centrality of its importance 
to such individuals. 
ae ‘sda which may be raised is that 
ie c esirability rather than anal character 
lia, 3 the mediating variable between the 
a Si anality scales, While the issue cannot 
Ei with empirically here, several factors 
tation tg to militate against such an expla- 
Beha, e is that many items refer to acts 

ke ‘avior in which the socially desirable 

lon of endorsement would be difficult to 


> 


determine; therefore, what an individual really 
does in a given situation is liable to be the 
overriding influence in the response. The 
second factor is that several items which were 
clearly socially undesirable correlated very 
highly, while other items which could be 
easily interpreted in a socially desirable way 
had very low item-test correlations. 

Of the two anality scales, the CAS seems 
to be the superior one. Its reliability is much 
higher and its content seems more directly 
related to the dimensions of anality. The reli- 
ability of the GAS is so low as to make fine 
discriminations between Ss questionable. The 
things which seem to be working against it are 
its comparatively short length, the unfamiliar- 
ity of the “Like-Dislike” format, and the 
outdated quality of many items—at least for 
many contemporary college students. Exam- 
ples of such questions are whether S likes: 
Absolute respect for one’s parents, without 
reservation; Conscientious objectors; Wearing 
a uniform; and Keeping a large stock of 
canned goods in your cupboard. 

The GSS yielded correlations with the other 
scales which seem equivocal. The significant 
correlation between the GSS and TS was as 
expected, but the correlation of —.201 be- 
tween the GSS and CAS is too low to enable 
firm conclusions to be drawn. The relatively 
high correlation of —.405 between the GSS 
and GAS is undoubtedly spuriously high be- 
cause of three items which appear in both 
scales, and this constitutes 25% of the GSS. 
The results only suggest that the theoretical 
relationship between time, anality, and spon- 
taneity is valid. 

The significant difference found between the 
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correlations of the GAS and the GSS for males 
and females is likely an artifact of the Grygier 
scales, since it is difficult to explain theoreti- 
cally, and it is not confirmed in the correla- 
tions between the CAS and GSS. 

It was the psychoanalytic formulations of 
the variables under consideration which led 
to this study, and in this way the heuristic 
value of psychoanalytic theory is demon- 
strated. The two specific and important 
formulations which derive from the theory 
are (a) the way in which enduring character 
traits may be the result of the handling of 
early sexual impulses, and (b) the way in 
which time may take on symbolic meaning. 

An alternate explanation which could be 
advanced to explain the relationships found is 
a cultural one. That is, traits which are con- 
sidered “anal” may not be the result of a 
defense against anal impulses, but rather 
internalized cultural values. In fact both 
factors are probably present and it would take 
a combination of longitudinal and experi- 
mental studies to determine their complex 
relation. A serious complication in such studies 
is that many of the relevant cultural values 
are passed on to the child at the time of 
toilet training. However, the one distinguish- 
ing feature between the two explanations— 
one that could be investigated experimentally 
—is that in the former time would take on 
symbolic value, but it would not in the latter. 

The present study would seem to show the 
value of relating the dimension of time to 
enduring personality traits, whether or not 
one wishes to accept the specific theoretical 
formulations which are central to this paper. 
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VALIDITY OF INFORMANTS’ 


RATINGS OF THE BEHAVIOR 


AND SYMPTOMS OF PSYCHIATRIC PATIENTS 
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University 


Psychiatric patients were rated with 
their nearest relative and also by a 
pital. Data on 138 co: 
moment correlations 
behavior factors and by canonical 
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of Minnesota 


respect to symptoms and behavior by 


nurse after admission to a psychiatric hos- 
secutive admissions were analyzed by computing product- 
between 12 family rating scale clusters and four ward 
correlations between the two sets of data. 
The canonical analysis indicated there were 


two underlying factors accounting 


for the intercorrelations among the two sets of ratings, The results of both 


analyses were seen as providing support 


psychiatric symptoms and behavior. 


One of the critical problems in mental 
health research is the need for reliable and 
valid procedures of assessing psychiatric con- 
dition and change in psychiatric condition. 
This is particularly true in the case of patients 
living in the community where opportunities 
for observation by the psychiatric staff are 
severely limited. Adequate procedures for 
evaluating the symptoms and behavior of such 
patients are needed. 

Recent changes in the philosophies of treat- 
ment as well as in the methods of treatment 
have contributed to shorter hospital stays. 
These shorter stays have often been accompa- 
nied by an increase in the readmission rate. 
Moreover, there has been an increased tend- 
ency for hospitals to use disposition pro- 
cedures such as provisional discharge or trial 
visit as opposed to a regular or direct dis- 
charge. A practical and efficient procedure of 
following these released patients on a long- 
term basis could prove useful. Any research 
concerned with the long-term effectiveness of 
treatment has been forced to either obtain 
generous financial support or rely on such 
practical criteria such as readmission rate or 
amount of time the patient remains out of the 
hospital. Although these criteria are easily 
obtained, they are also poorly understood 
because other factors, in addition to the 
nature and severity of the patients’ symptoms, 
Would seem to influence them. Consequently, 

*The authors acknowledge the assistance of Dean 
hese in the computation of the canonical correla- 
D. A for reprints should be sent to Norris 
ares epartment of Psychology, Indiana State 

y, Terre Haute, Indiana 47809. 


for the validity of relatives’ ratings of 


it would be highly desirable to have an 
easily obtained, yet meaningful, procedure for 
assessing the behavior and symptoms of the 
patient in the community. 

One obvious procedure of assessing the psy- 
chiatric status of patients in the community 
would be to have a relative of the patient 
provide the data, Katz and Lyerly (1963) 
developed a standard set of items covering 
various symptom and behavior areas. The 
fact that relatives’ ratings of symptoms and , 
behaviors formed identifiable clusters of psy- 
chopathology, when cluster-analyzed, proba- 
bly gives some evidence of validity of their 
ratings. Katz and Lyerly (1963), however, 
have provided some additional data on the 
discriminative validity of their scale. Also, 
Ellsworth, Arthur, Kroeker, and Childers 
(1967) offered some data to indicate that the 
ratings of relatives were reliable and valid. 
Small, Small, and Gonzalez (1965) concluded 
that relatives, as informants on the prehos- 
pitalization symptoms of patients, provided a 
more complete picture than did the patients 
themselves. 

The purpose of the present study was to 
explore further the validity of relatives’ 
ratings of patient behavior by examining the 
correspondence between the ratings made by 

the relatives and those made by the staff. In 

addition, this study was intended to provide 
data bearing specifically on the validity of 
the Katz Adjustment Scales (KAS; Katz & 

Lyerly, 1963) as a device for assessing the 

prehospitalization symptoms and behavior of 

psychiatric patients. 
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METHOD 
Sample and Procedure 


Consecutively admitted patients with functional 
psychiatric disorders were selected for study. Within 
3 days of admission to the psychiatric ward of a 
university hospital, each patient’s nearest relative 
was mailed a KAS. Of all the KAS mailed, 83% 
were returned and usable. The Psychotic Reaction 
Profile (PRP; Lorr, 1961) was completed by the 
nurse for each patient within one week of admission 
to the hospital. 

Complete data, that is, a KAS plus a PRP, were 
obtained on 159 patients. For 138 of these, 87%, 
the rater was either the spouse or parent(s). KAS 
data for the remaining 21 cases were obtained from 
such a wide variety of raters, including friends, 
siblings, children, and more distant relatives that it 
was decided to exclude these from the major analyses 
of the data. 

The 138 patients included in the analyses ranged 
in age from 16 to 65 with a median age of 26 
years, Females made up 69.1% of the sample, The 
diagnostic breakdown of the sample was as follows: 
schizophrenics, 49%; other psychotics, 16%; neu- 
rotics, 16%; and others (including the personality 
pattern and trait disturbances, the sociopathic per- 
sonality disturbances and the transient situational 
personality disorders), 19%. 


Katz Adjustment Scale 
The KAS (Form R1) is designed for use by a 
relative of a patient in rating the patient’s symptoms 


TABLE 1 
KAS-PRP InTERCORRELATIONS 


PRP scales 
KAS clusters 
WwW TD PB AD 
Belligerence -07 |—07 05 |—02 
Verbal Expansive- 
ness —07 09 09 | —01 
Negativism —02 |—05 03 | —11 
Helplessness 14 01 —01 28** 
Suspiciousness —04 06 00 12 
Anxiety 05 07 01 2r* 
Withdrawal- 
Retardation 32**| 07 —10 15 
General Psycho- 
pathology 08 00 08 07 
Nervousness 04 |—07 08 19% 
Confusion 10 09 11 13 
Bizarreness 13 D 02 238% 
Hyperactivity 01 | —02 05 14 


Note—For the PRP, W = Withdrawal, TD = Thinking 
Disorganization, PB = Paranoid Belligerence, AD = 
Denr Decimala omitted. df = 136, eas 
<.05. 


* p < 01. 
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and social behavior. The 127 items, written to cover 
typical aspects of psychopathology, are to be judged 
on a 4-point scale of frequency of occurrence. The 
KAS can be scored according to a set of 12 item- 
clusters obtained on the basis of a cluster analysis 
of the 127-item set (Katz & Lyerly, 1963). 


Psychotic Reaction Profile 


The PRP (Lorr, 1961) consists of 85 statements 
describing behavior commonly observed in hospital- 
ized psychotics and is to be completed by a nurse 
or aid on the basis of observations of the patient. 
The PRP yields scores in four relatively indepen- 
dent areas of psychopathology, namely, Withdrawal, 
Thinking Disorganization, Paranoid Belligerence, and 
Agitated Depression. The validity of the PRP as a 
measure of severity of psychopathology has been 
demonstrated by specific validation studies (Lorr, 
1961; Vestre, 1966) and by its sensitivity to change 
in psychiatric condition in response to therapy 
(Casey, Hollister, Klett, Lasky, & Caffey, 1961). 

Since the data were collected over a period of 18 
months on three different wards, it was necessary to 
involve 30 different nurses to obtain ratings. Over 
two-thirds of the ratings, however, were completed 
by nine of these nurses. No assessment of interrater 
reliability was obtained within the present study, 
but previous data by Lorr (1961) and by Vestre 
(1966) indicated reliabilities ranging from .65 to .90 
and .58 to .92, respectively, for the separate scales. 
In both studies the Agitated Depression scale yielded 
the lowest coefficient. Also, the fact that this group 
of nurses had considerable experience in rating pa- 
tients for research purposes, using the PRP and 
similar scales, provides some confidence in the 
reliability of their ratings. 


RESULTS 


Product-moment correlations between the 
PRP and the KAS are presented in Table 1. 
Five of the 12 KAS variables correlate signifi- 
cantly with one or more of the PRP scales. 
It can also be seen from Table 1 that three 
of the four PRP variables correlate signifi- 
cantly with at least one of the KAS clusters. 

A qualitative matching of the factors com: 
prising the two sets, on the basis of descrip- 
tions furnished by Lorr (1961) and by Katz 
and Lyerly (1963), suggested that certain fac- 
tors should be somewhat parallel. In Table 2 
scales with similar item content are paired. 
For three of four PRP scales, at least one of 
these pairings is statistically significant at the 
01 level and in the expected direction. For 
the remaining scale, Paranoid Belligerencé, 
none of the pairings approaches statistic 
significance. With the exception of the pait- 
ings with Paranoid Belligerence, the only 
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TABLE 2 


CORRELATIONS BETWEEN Pars OF PRP AND KAS SCALES MATCHED FOR SIMILAR 
Trem CONTENT, ACCORDING TO Type oF FAMILY RATER 


PRP scales KAS clusters Total group Rated by spouse | Rated by parent 
Withdrawal Withdrawal-Retardation ou 36** 29* 
‘Thinking Disorganization Confusion 09 17 01 
Bizarreness 28** 42** 12 
Paranoid Belligerence Belligerence 05 00 09 
Verbal Expansiveness 09 03 18 
Negativism 03 07 02 
Suspiciousness 00 00 02 
General Psychopathology 08 11 07 
Agitated Depression Helplessness 28** 23 24* 
Anxiety A bad 16 32%. 
Nervousness 19* 14 19 
Sere omitted. For total group, df = 136; for remaining two groups, df = 67. 


p< .01. 


matched pair of scales not significantly cor- 
‘related is that of Thinking Disorganization 
and Confusion. The KAS cluster, Hyperactiv- 
ity, could not be reasonably matched with any 
of the four PRP scales. 

Of the 48 intercorrelations in Table 1, five 
teach significance at the .01 level, with one 
additional correlation significant at the .05 

level. Five of these significant correlations 
“occur between variables matched for similar 
content. 

Even though the qualitative comparison of 
factors between the two rating devices con- 
firmed most of the expected relationships, 
multivariate procedures were used to deter- 
l mine the degree of confidence which could be 
placed in the overall hypothesis of an under- 
lying linear relationship between the two sets 
of variables. Canonical correlations (Cooley 
& Lohnes, 1962) were computed between the 
PRP scales and the KAS clusters for the 
Purpose of determining the number of in- 


dependent factors influencing both sets of 
variables. 

The canonical correlations and the results 
of significance tests are shown in Table 3. 
As Cooley and Lohnes (1962, p. 37) indicate, 
the chi-square approximation for the distribu- 
tion of lambda can be used to test the hy- 
pothesis that the two sets of variables are 
unrelated. The two significant canonical cor- 
relations obtained indicate that there are two 
factors which underlie the intercorrelations 
among the two sets of ratings. The character- 
istics of these two factors can be inferred from 
an inspection of the weights (contributions) 
of the individual variables. Table 4 shows the 
weights associated with the individual vari- 
ables for each of the two significant canonical 
factors. 

Latent Root 1 is defined primarily by high 
positive loadings on KAS Bizarreness and 
PRP Thinking Disorganization and by high 
negative loadings on KAS General Psycho- 


TABLE 3 
CANONICAL CORRELATIONS OF PRP AND KAS VARIABLES 


Latent root Canonical co Wilks lambda Chi-square df p< 
Lee relations 

l 48 515 83.4 48 002 

2 AL 667 50.8 33 1028 

3 39 ‘804 27.4 20 “129 

4 23 945 71 9 1626 


ved, chi-square is distributed with df = (p-r) (a7). 


" reme\ot®-—df for chi-square for testing Latent Root 1 is p PRP scales Xa 


KAS clusters, where $ is 4 and gis 12. Also, with r roots 
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TABLE 4 
WEIGHTS ASSOCIATED WITH Latent Roots 1 AND 2 


A Root 1 Root 2 
Variables weights weights 
KAS 
Belligerence 04 —.19 
Verbal Expansiveness .87 .92 
Negativism —.48 —.96 
Helplessness 48 —.54 
Suspiciousness AL 1.00 
Anxiety —.25 38 
Withdrawal-Retardation 24 —1.74 
General Psychopathology | —1.40 61 
Nervousness —.14 —.16 
Confusion —.29 26 
Bizarreness 1.54 —.15 
Hyperactivity —.14 08 
PRP 

Withdrawal —.36 —.91 
Thinking Disorganization AS 7 
Paranoid Belligerence — 38 36 
Agitated Depression 40 14 


pathology. Thus, the loadings associated with 
Latent Root 1 seem to define a dimension 
reflecting behavior and symptoms of schizo- 
phrenic disorganization, 

Latent Root 2 is characterized by high 
negative loadings on PRP Withdrawal and 
KAS Withdrawal-Retardation. These weights 
would suggest that this dimension clearly re- 
flects degree of interest in, or responsiveness 
to, the environment and activity level. 


Discussion 


In general, the results suggest that there 
are two common factors underlying the nurses’ 
and the relatives’ ratings of symptoms. The 
first factor clearly reflects schizophrenic dis- 
organization or bizarreness; the other, a di- 
mension of withdrawal. It might be noted 
that there was some suggestion of a third 
factor emerging in the canonical analysis (see 
Table 3). Examination of the associated 
weights suggested a dimension reflecting de- 
pression and agitation in that PRP Agitated 
Depression and KAS Anxiety and Helpless- 
ness obtained the highest loadings. Although 
the results were somewhat inconsistent, and, 
as can be seen in Table 3, did not reach 
significance, the zero-order correlations of the 
PRP Agitated Depression scale (Table 2) are 
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consistent with such a factor. Thus, the re. 
sults would seem to indicate that there is, 
indeed, overlap between symptoms and behay. 
ior as viewed by relatives of patients and as 
viewed by the professional staff. 

There are certain facts which should be 
borne in mind in interpreting the magnitude 
of the KAS-PRP intercorrelations in Table 1, 
Most obvious, of course, are the differences in 
the nature of the raters (nurses vs. relatives 
of patients) which presumably could result in 
differences in objectivity or bias in observing 
and reporting behavior and symptoms and 
also differences in knowledge of psychiatric 
symptoms. The fact that the KAS items 
are written in nontechnical language and 
refer to rather specific aspects of observable 
behavior should reduce both of these potential 
problems. 

A more important fact to be borne in mind 
in interpreting the magnitude of the Table 1 
intercorrelations, however, relates to differ- 
ences in time at which and setting in which 
the two sets of ratings were obtained. Rela- 
tives based their ratings on the behavior of 
the patient during the days just prior to 
admission; nurses based their ratings on the 
behavior of the patient during the first week 
of hospitalization. Perhaps even more impor- 
tant than the 1- to 2-week time differential, 
would be the differences in the setting oF 
situation. Patients show wide differences in 
how they respond to being hospitalized. Being 
admitted to a psychiatric ward tends to give 
some patients a feeling of comfort and secu- 
rity, for example, a patient who is disturbed 
and frightened by suicidal impulses. For other 
patients, admission to the hospital is a dis- 
turbing or upsetting experience which leads 
to further disorganization of behavior. Conse- 
quently, the time differential between the two 
sets of ratings and the different settings ™ 
which the observations were made can be seen 
as factors which would reduce the relation- 
ships between the two sets of observers. In 
fact, one explanation for the absence of rela- 
tionships involving PRP Paranoid Belliget- 
ence might be along these lines. Of the four 
dimensions (scales) of psychopathology me 
sured by the PRP, it would seem that Para- 
noid Belligerence, that is, hostility, irritabil- 
ity, resistiveness, and suspiciousness, WO 


most dependent upon the situation 
vironment. 

Table 2 presents the correlations by type 
of family rater for those scales matched for 
smilar item content. The two sets are reason- 
my sma and the differences could easily 

seit from chance variation. The largest dif- 
ference is in the low correlations between the 
o measures for Thinking Disorganization in 
e group rated by parents, as opposed to 
oup rated by the spouse. The KAS-PRP 
Jations for the 21 patients rated by 
sons other than parents or spouse were 
edly different, but the small size of the 
p makes it difficult to be certain that 
is a real difference. This difference in the 
elation matrix and the disparate nature 
the raters in this small group are the 
s for excluding these cases from the 
ical correlations. 
he finding that there is a relationship 
n the ratings of symptoms and behav- 
nade by relatives and those by the psychi- 
taff suggests that relatives might be a 
source of such information for research 
tposes, While canonical correlations tend to 
ease on replication, due to the fact that 
they capitalize on chance combinations, the 
al coherence of the variables, along with 
ignificant and meaningful zero order 
élations, suggest that a valid relationship 
between these two sets of variables. 
ite the possibility that the relatives may 
objectivity in observing and reporting 
ior and symptoms, the results of this 
Suggest their ratings are valid descrip- 
l of the nature and severity of such 
avior and symptoms. 

While the magnitude of the correlations are 
large enough to allow clinical interpreta- 
| or individual prediction, they are large 
gh to indicate the KAS is a useful re- 
rch tool. Moreover, the present correlations 
esent the minimal validity of a relatively 
and undeveloped instrument. Efforts 
[be directed toward improving the valid- 
f the instrument. First, the clusters them- 
may need refinement and a factor 
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analysis of the items using a larger sample 
and more objective methods of rotation may 
yield more homogeneous factors. In addition, 
validity could be increased without increasing 
the length of the total instrument by dropping 
items not scored on any cluster and adding 
items to strengthen those clusters containing 
only four or five items. 

An entirely different approach to improving 
validity might involve a study of variables 
related to valid ratings, that is, a study of 
moderator variables. The most obvious and 
accessible of these are associated with the 
relationship of the rater to the patient, not 
only its objective aspects, but its subjective 
components as well. Such a line of research 
should yield refinements which will make the 
KAS an even more useful instrument in 
an area where too little is available at the 
present time. 
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MATERNAL PERSONALITY CORRELATES OF SONS’ 
CREATIVITY 


GEORGE DOMINO? 


Fordham University 


California Psychological Inventory scores of 33 mothers of creative high school 
males, as identified by both teachers’ nominations and test scores, were com- 
pared with scores of a control group. Mothers of creative Ss exhibit greater 
self-assurance, initiative, and interpersonal competence; they prefer change and 
unstructured demands; they are more insightful about others, more tolerant, 
and value autonomy and independent endeavor. They are, however, less so- 
ciable, less conscientious, less dependable, less inhibited, less concerned about 
creating a favorable impression, and less nurturant and obliging towards others. 


Numerous studies have focused on creativ- 
ity but relatively little is known about what 
specific parental aspects may be related to 
filial creativity. The research literature sug- 
gests that the home climate of highly creative 
persons is significantly divergent from that of 
less creative in several aspects, and it seems 
reasonable to assume that parental personal- 
ity characteristics influence a child’s creative 
development. 

Weisberg and Springer (1961), in a study 
comparing psychiatric interview ratings of 
parents with children’s scores on Torrance’s 
tests of creativity, reported that the parents 
of highly creative children exhibited greater 
expressiveness with lesser dominance, and 
were more indifferent towards their chil- 
dren’s regressive tendencies. These investi- 
gators characterized the emerging family 
pattern as involving somewhat distant inter- 
actions, little clinging to each other for sup- 
port, little stress on conformity to parental 
values, mediocre sexual adjustment, paternal 
exercise of authority, and maternal ambi- 
valence. 

MacKinnon (1962, 1965) was able to iden- 
tify a number of differentiating parental 
characteristics from the retrospective reports 
of his creative architects, For example, the 
parents were seen as exhibiting an extraordi- 


1 The author is grateful to the Fordham University 
Computer Center and John Walsh for technical 
assistance. Anne Anastasi, Marcel Goldschmid, and 
Harrison Gough gave helpful comments on an earlier 
draft of this paper. Requests for reprints should 
be sent to George Domino, Department of Psychol- 
ogy, Fordham University, Bronx, New York 10458. 
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nary respect for the child and confidence in 
his ability to do what was appropriate, while 
often there was a lack of intense closeness 
between parent and child. 

Dreyer and Wells (1966) studied the 
instrumental-expressive orientation, role ten- 
sion, and degree of autonomy-granting of 
parents of more and less creative children, 
Parents of more creative children showed 
greater intrafamilial variance in their rankings 
of 10 domestic values, and greater role ten- 
sion reflecting more individual divergence and | 
expression of feeling. | 

The present report represents one facet of 
a research project designed to explore parental 
factors related to filial creativity. This paper 
will report on maternal personality correlates 
of sons’ creativity. The focus on mothers re 
flects the assumption that, at least in the 
American culture, mothers are likely to have 
more opportunities than fathers to influence 
their children’s psychological growth. 


METHOD 


Teachers in nine New York State high beet 
were asked to nominate students who had prose 
some observable evidence of creativity. These sche 
are located in primarily middle- to upper-class SUD" 
urban areas. The specific directions, modified ao 
Schaefer (1967), and reflecting MacKinnon’s (1962) 
definition of creativity, were: 


We are interested in identifying creative students 
Please nominate any students you know that yo 
consider as creative. For our purposes, creativi! y 
is a process characterized by: 1) originality, no 
elty, or freshness of approach; 2) adaptiveness 

reality in that it must solve a problem, achiev? 
a goal, in general be a reality oriented respons?! 
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and 3) the original insight or approach must be 
developed or elaborated. In order to be nominated 
then, a student must have actually produced a 
specific product that can be regarded as both novel 
(fresh, unusual, imaginative) and of genuine merit 
or value. The original product might be a drawing, 
poster, poem, display, composition, invention, ex- 
perimental design, special project, etc. In identify- 
ing creative students, please remember that the 
novelty and merit of individual products may bear 
little or no relation to the intelligence of a student, 
or to whether he is an intellectual, hard-worker, 
beatnik, or troublemaker. Try not to let any 
personal feelings influence your evaluation of a 
student’s creativity. 


A total of 114 male students were nominated, 37 
for artistic activities, 62 for scientific activities, and 
15 for both types of activities. 

The Remote Associates Test (Mednick & Mednick, 
1967) and the Alternate Uses test (Wilson, Christen- 
sen, Merrifield, & Guilford, 1960) were administered 
to those 114 male nominees. To be retained in the 
creative group, it was arbitrarily decided that a 
nominee must obtain a raw score of 15 or above on 
the Remote Associates Test (14 represents the lowest 
50th percentile for college students reported by 
Mednick & Mednick, 1967), and a score of 26 or 
above on the Alternate Uses test (equivalent to the 
54th percentile of college freshmen norms). 

Both the teachers’ nominations and the stringent 
test norms were deliberately used in order to identify 
Ss who could be justifiably called creative. A total 
of 38 students met the test criteria. Of these 38 Ss, 
12 had been nominated for artistic activities, 17 for 
scientific, and 9 for both. A chi-square analysis of 
type of activity versus test criteria yielded a non- 
Significant value, indicating that use of the Remote 
Associates Test and the Alternate Uses test did not 
discriminate against either the artistic or scientific 
nominees. 

A control group of 38 Ss matched on sex, educa- 
tional level, grade-point average, and geographical 
residence, was formed by using school records to 
identify possible matches, eliminating those nomi- 
nated as creative or scoring above the cutoff points 
on the creativity tests. Table 1 gives means and 
standard deviations of the creativity tests for both 
control and creative Ss. 

_ Mothers of Ss were then invited to participate 
in a study of “the influence of the family on chil- 
dren’s abilities” by completing the California Psycho- 
logical Inventory (Gough, 1964a); 33 mothers of 
Creative Ss and 31 mothers of noncreative Ss were 
tested. These women can be characterized as upper 
middle Class, median age 39, all high school graduates 
with slightly over one-fourth holding college degrees. 
ah CPI was used since it has been well validated 
in large samples of normal Ss and has been used 

Various studies of creativity (e.g., Helson, 1967; 

‘acKinnon, 1962a, 1962b, 1965). Analyses of mean 
Ries for each of the 18 CPI scales were carried 

y means of ¢ tests. 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS OF 
Creativity TESTS 


Remote Associates 
ERS Alternate Uses 


Test 
Ss 
Xx SD xX SD 
Creative 19.2 3.6 29.2 1.9 
Control 12.4 2.7 21.5 2.0 


Note.—For Creative, N = 38; for Control, N = 38. 


RESULTS 


Twelve mean differences reached statistical 
significance, 11 at the .01 level and one at 
the .05 level. The data are presented in 
Table 2. 

Mothers of creative Ss scored significantly 
higher on Dominance (Do), Capacity for 
Status (Cs), Tolerance (To), Achievement 
via Independence (Ai), Psychological-Mind- 
edness (Py), and Flexibility (Fx); they 
scored significantly lower on Sociability (Sy), 
Responsibility (Re), Socialization (So), Self- 
Control (Sc), Good Impression (Gi), and 
Femininity (Fe). 


DISCUSSION 


Although the CPI profiles for both groups 
are above average as compared to the general 
population, the results clearly indicate that 
mothers of creative Ss differ significantly 
from mothers of noncreative Ss on several 
dimensions of personality. Mothers of creative 
Ss exhibit greater self-assurance and initiative 
(Do) and interpersonal competence (Cs); 
they prefer change and unstructured de- 
mands (Fx); they are more insightful about 
others (Py), but at the same time more 
tolerant (To), and they value autonomy and 
independent endeavor (Ai). They are, how- 
ever, less sociable (Sy), exhibit less respon- 
sibility (Re) and social probity (So), are less 
inhibited (Sc), less concerned about creating 
a favorable impression (Gi), and less nur- 
turant and obliging toward others (Fe). 

The general picture generated by this CPI 
mean profile is that of a highly independent 
and capable woman perceptive in her inter- 
personal relationships but little disposed to 
worry about the impression she creates. She 
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TABLE 2 
CPI Scores or MOTHERS OF CREATIVE AND NONCREATIVE ADOLESCENTS 
Creative Noncreative 
CPI Scale t 
x SD x SD 

Dominance (Do) 30.2 5.2 26.1 4.9 3.25** 
Capacity for Status (Cs) 23.8 3.1 19.2 3.3 5.74** 
Sociability (Sy) 22.7 47 26.9 4.6 3.61** 
Social Presence (Sp) 35.4 5.3 35.2 5.2 0.07 
Self-Acceptance (Sa) 22.9 3.7 22.6 2.9 0.36 
Well-Being (Wb) 38.8 3.6 39.5 44 0.69 
Responsibility (Re) 29.1 4.5 34.6 3.9 5.48" 
Socialization (So) 35.6 5.1 42.1 4.6 5.36 
Self-Control (Sc) 27.5 7.2 35.5 6.1 4.81* 
Tolerance (To) 27.3 4.3 24.1 3.9 3.12% 
Good Impression (Gi) 16.6 5.5 20.3 6.1 2.54* 
Communality (Cm) 24.5 1.8 25.1 1.3 1,53 
Achievement via Conformance (Ac) 29.3 4.2 29.4 3.7 0.10 
Achievement via Independence (Ai) 24.3 3.8 21.1 2.9 3.80** 
Intellectual Efficiency (Ie) 44.8 5.2 42.9 4.1 1.62 
Psychological-Mindedness (Py) 15.8 2.9 12.6 2.4 4.82" 
Flexibility (Fx) 14.9 3.6 11.3 3.3 4.17" 
Femininity (Fe) 22.4 3.5 24.8 3.7 2.66" 


Note.—For Creative; N = 33; for Noncreative, N = 31. 
*p <.05, 
“p <.01. 


can adapt to change and variety, and in fact 
is restive when faced with rules and con- 
straints. She is free in her expression of 
impulse, but perhaps not always dependable. 

These themes emerging from the CPI 
profile have in many cases been previ- 
ously reported as characteristic of creative 
persons (cf. Barron, 1965; Gough, 1964b; 
MacKinnon, 1963). 

One theme relates to interpersonal en- 
deavor, These women have the assurance, 
the versatility and ascendancy necessary to 
achieve a position of high community visibil- 
ity. Yet they do not. Of the 33 mothers of 
creative Ss, only four were actively involved 
in some community activity, as opposed to 
17 of the 31 mothers of noncreative Ss 
(x? = 15.17, p < .01). They are also moder- 
ately indifferent to social demands: of the six 
CPI scales addressed to various aspects of 
socialization and control, they score signifi- 
cantly lower on four (Re, So, Sc, Gi). These 
results are consonant with those of Dreyer 
and Wells (1966) who found that the mothers 
of creative children were much less concerned 
with a place in the community and more 
concerned with everyday interests. MacKin- 


non (1961) reported that his creative archi- 
tects revealed less desire to be included in 
group activities than any other group studied 
by him and his coworkers. Yet, when these 
architects chose to participate they did so 
with consummate skill. 

A second theme concerns interpersonal 
closeness. These mothers are perceptive indi- 
viduals, alert to the needs of others, informal, 
tolerant, and observant (To, Py, Fx). Yet, 
these socially desirable characteristics are 
counterweighted by a degree of indifference 
and detachment (Gi, Sy). These results are 
consonant with the recollections of MacKin- 
non’s achitects who reported a lack of intense 
closeness with their parents. They also agree 
with the findings of Getzels and Jackson 
(1962) that mothers of more creative children 
were less observant of failings in their children 
than were mothers of less creative. 

A third theme reflects impulse expression, aS 
shown primarily by the lower mean on the Sc 
scale, but also supported by lower means on 
So and Re. Barron (1957) has seen this fac- 
tor as indicating the potentiality of the plea- 
sure principle occasionally to become ascen- 
dant over the reality principle. One might 


ms 


- 


te that this reduction of self-control ` 


allows the person to regress in the service of 
the ego, and that mothers of creative children 
are themselves more creative than their peers. 

A fourth theme concerns initiative. These 
women exhibit certain personality character- 
istics which in our culture tend to be labeled 
as more masculine. This can be seen in the 
significantly higher means on the Do and Ai 
scales, as well as by the lower mean on the Fe 
scale, Helson (1967) has reported that a 
masculine component is an important factor in 
the syndrome underlying creativity in women. 

A final theme has to do with independence. 
Independence in thought and action as a 
characteristic of creative persons has been 
Teported by several investigators. Dreyer and 
Wells (1966) reported that the mothers of 


high creative children stressed emotional se- 


curity, leading to greater confidence and inde- 
pendence in their offspring. MacKinnon 
(1961) found that creativity in architects cor- 
related highly with self-assertive independence. 
The present CPI findings are consonant in 
Teyealing these mothers to be more inde- 


_ pendent, autonomous, and self-reliant. 


‘It is interesting to note a possible (but 
superficial) discordance in these themes: high 
Social poise with little affiliation; interper- 
sonal insight with indifference; achievement 
drive and masculine initiative with biological 
femininity, Thus there is a balance between 
contrasting dynamic forces, in which positive 
and negative characteristics are both allowed 
expression. Jung’s concept of the transcendent 
fiction which accomplishes the successful 
integration of opposites is relevant here. 

The findings presented here indicate quite 
dearly that mothers of creative adolescents 

OSSeSS personality characteristics not only 
significantly different from those of mothers 
Of noncreative adolescents, but quite similar 
to those observed among creative adults. Al- 

ugh there is no direct evidence, it is rea- 
Sonable to postulate that the mothers of cre- 
ative Ss are themselyes more creative than 
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the general population, and that both their 
personality characteristics and their creativity 
are evocative of greater creativity in their 
children. 
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IDENTIFYING NEUROLOGICALLY IMPAIRED CHILDREN 
THROUGH A TEST OF AUDITORY PERCEPTION 


DAVID A. SABATINO? 
The Catholic University of America School of Education 


The purpose of this study was to determine if an experimental Test of Audi- 
tory Perception (TAP) could discriminate between neurologically impaired and 
normal children. The relationships were established between each subtest of 
the Test of Auditory Perception and the Bender Visual Motor Gestalt Test 
(BVMGT). When the TAP was administered under normal testing procedures, 
four of the six subtests discriminated tht experimental from the control Ss at the 
01 level. All of the TAP subtests discriminated the neurologically impaired 
from normal children at the .01 level under conditions of background noise. The 
correlations between the TAP subtests were positive but low in value. Negative 


correlations were found between each of 


The behavioral sciences have developed 
many tests of visual perception as means of 
identifying children who are unable to in- 
terpret environmental stimuli as meaningful 
information. Herbert (1964), in reviewing 
the psychological literature, found the Bender 
Visual Motor Gestalt Test (BVMGT) to be 
the most popular test of visual motor percep- 
tion, Sundburg (1961), found the BVMGT to 
be the fifth most commonly used clinical test. 

Bender (1938), assumes that visual percep- 
tion based on Gestalt principles can: (a) 
differentiate development levels, and (b) dif- 
ferentiate neurologically impaired from non- 
neurologically impaired Ss. It seems in re- 
viewing the literature (Barnes, 1950; Benton, 
1962; Clements & Peters, 1962) that the be- 
havioral sciences have established clinical evi- 
dence to support a relationship between visual- 
motor perception and minimal neurological 
impairment. 

In contrast, research efforts attempting to 
study the auditory perceptual function of 
children suspected of neurological impairments 
are limited. To quote Gallagher (1960), “in 
contrast to the visual-motor areas, other areas 
of sensory perception have been almost ne- 
glected [p. 19].” 

Strong (1964) factored 92 test and subtest 
variables utilizing computer-based factor anal- 
ysis. He found that the auditory decoding 
and auditory-vocal automatic subtests of the 

1 Requests for reprints should be sent to David A. 
“Sabatino, School of Education, The Catholic Uni- 
versity of America, Washington, D. C. 20017. 
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the TAP subtests and the BVMGT. 


Illinois Test of Psycholinguistic Abilities 
(ITPA; McCarthy & Kirk, 1961) correlated 
.58 and .51, respectively, with the “brain dam- 
age syndrome.” He found these two tests to 
correlate even lower with academic achieve- 
ment. More importantly, when factored for 
auditory memory, these two subtests corre- 
lated the lowest of eight other tests of this 
skill. The reliability coefficients were in the 
low forties. None of the other seven subtests 
of the ITPA correlated well with any criteria. 

Hasterok (1964) found that visual and 
auditory difficulties, as identified in ITPA 
profiles, were unsuited to answering the im- 
portant question of the relationship between 
learning problems and perceptual or sensory 
problems. He concluded that children cannot 
be matched for comparison purposes on the 
basis of sensory or perceptual problems. The 
problem is that the ITPA was designed as 4 
Measure of psycholinguistic skills and not 
perceptual functions. 


PURPOSE 


The problem within the initial design of 
this study became that of developing an ex- 
perimental test of auditory perception, since 
comprehensive measures of this function 
were not available. J 

The purpose of this study was to determine 
the ability of the BVMGT and Experimental 
Test of Auditory Perception (TAP) to dif- 
ferentiate children with neurological impair- 
ment from children with no known neurologi- 
cal impairment. In other words, the primary 
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TABLE 1 
MATCHING CRITERIA FOR THE EXPERIMENTAL AND CONTROL SUBJECTS 


Mean chronological age 


dimension of this research was to obtain data 
concerning the means by which a commonly 
used test of visual-motor perception and an 
experimentally constructed TAP would iden- 
tify children previously known to be neuro- 
logically impaired (experimental Ss) from 
those with no such impairment (control Ss). 


METHOD 


Thirty Ss with known brain damage, minimal in 
© extent, and 30 Ss with no known cerebral pathology 
Were used as experimental and control Ss. The Ss 
ranged in age from 6 years 1 month to 12 years 7 
months. The brain-damaged children were selected 
on the basis of pediatric-neurological examination, in- 
cluding electroencephalograms, from a large popula- 
tion of children referred to Children’s Hospital Learn- 
ing Disability Center, Columbus, Ohio. 

q _ The children selected as Ss were free from physi- 

tal, health, gross motor, or social-personal problems. 
All the children were given sensory hearing screening 
tests of O decibels (1SO) on pure-tone air conduction 
tests at 500, 1,000 and 2,000 Hz. 

The primary purpose was to determine if the 
experimentally constructed TAP and the BVMGT 
would discriminate between these two groups. The 
experimental and control groups were matched on 
age, Sex, and verbal IQ. Table 1 shows the matching 
criteria and accompanying level of significance. 


a 


Construction of the Experimental Test of 
Auditory Perception 


The TAP was recorded on electromagnetic tape by 
Placing the tape recorder (Wollensak-T-1500) into a 
Sound-treated chamber (Industrial Acoustics Com- 
Pany, Model 402). The stimulus output was estab- 
lished at 60 decibels in order that the inter-S test 
Consistencies could be maintained. The original test 
ibe recording was made with a known output, free 
rom any interfering noise in the background. A 
Second TAP containing a background of broad 
Spectra sawtooth noise was constructed by recording 


in years and months | Mean years Sex Verbal IQ 
onr in school 
M Range Boys Girls M Range 
Experimental 8.3* 6 to 12-5 3.4 19 11 105.5** 87-130 
Control 8.9* 6 to 12-4 3.7 18 12 106.9** 88-133 
* 
=p 01 


the original test tape and controlled noise simul- 
taneously. The signal-to-noise ratio was 60/54 deci- 
bels. 
The content of the TAP was selected to be the 
simplest and purest measures of auditory perception 
for each hypotheoretical stage. The difficulties in 
assembling such a test were: 

1. The perceptual stimuli must be received solely 
by the auditory mechanism and not by other sensory 
and perceptual systems or the function measured 
would be an integrative one. 

2. The response mechanism must not require a 
complex response or the perceptual measurement 
would become one of S’s motor or vocal response. 

The following are the subtests of the TAP: 

1. Auditory Recognition is the meaningful recogni- 
tion of sounds and words, The subtests used to de- 
termine auditory recognition were: (a) A Speech 
Sound Discrimination Test of paired nonsense sylla- 
bles which were identified as alike or different. (b) 
A Word Discrimination Test which required S to 
distinguish a meaningful word presented aurally 
from five nonsense words which sounded similarly. 

2. Auditory Retention is the recall of connected 
units of language. Speech sounds are first recognized 
as meaningful, then stored in sequential order and 
retained if further association of their meaning is to 
take place. (a) The Immediate Recall of Rote Digi- 
tal Units, administered at 1-second intervals requir- 
ing S to point to graphic number representations at 
the conclusion of the sequence. (b) The Memory for 
Connected Speech, was based on a sentence memory 
test which required an immediate meaningful recall 
of connected words. 

3. Auditory Integration is the ability of the listener 
to bring together disassociated auditory stimuli. The 
integrative function of language has been shown to 
be approximated by having children reproduce rhyth- 
mic patterns. (a) Rhythmic Structures. The child 
was requested to duplicate tapping responses to pre- 
recorded patterns. 

4, Auditory Comprehension is the summative abil- 
ity of the child to receive, retain, and integrate units 
of aural language. It is the final, complex auditory 
perceptual act based on all the functions previously 
established. (a) The Test of Auditory Comprehen- 
sion required the child to listen to a story and re- 
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TABLE 2 


DISCRIMINATION OF SUBJECTS BY THE BVMGT AND 
tae TAP AT THE STATISTICAL CRITERION 
UNDER NORMAL TESTING CONDITIONS 
AND CONDITIONS OF BACK- 
GROUND NoIsE 


z al testi 
Background noise cere E 


conditioning 
Test 
Experi- Experi- 
mental | Control | enta] | Control 
BVMGT 8 2 8 2 
TAP 29 4 23 5 


spond to questions which generally called for factual 
one-word responses of yes or no. 


Experimental Assessment and Scoring 
Procedures 


The two tests of auditory perception were ad- 
ministered on different days because it was neces- 
sary to randomize the administration of the tests 
among Ss. Both tests (the original test recording and 
the one with noise in the background) had the same 
content. 

Three of the subtests merely required yes, no, or 
pointing responses, One subtest required a tapping 
response, and two required oral responses. These were 
the subtests of word recognition and comprehension, 
Both of these subtests confined the length of re- 
sponse to one word. The dichotomous response 
choices with limited oral interaction reduced the 
possibility of any scoring problem due to an articu- 
lation problem in a child’s speech, 

The BVMGT was administered by placing a test 
card in the upper-left-hand corner of an 84 X 11-inch 
sheet of plain white paper. The card and paper were 
both held in position in such a manner that the 
examiner had full control of the test material and 
could maintain a consistency between administration 
procedures for each S. The BVMGT was scored using 
the Koppitz (1964) developmental scoring system. 
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RESULTS 


A statistical criterion was established at 
the first minus standard deviation (—1 SD) 
to differentiate the experimental and control 
S responses to the BVMGT and the TAP, 
The adoption of this statistical criterion per- 
mits Ss to be categorized in regard to their 
number of errors on the BVMGT and TAP, 
Table 2 shows the number of experimental 
and control Ss that were functioning more 
poorly (more errors) than the statistical cri- 
terion would permit for both TAP and the 
BVMGT. The data indicate that the 
BYMGT identified eight of the experimental 
Ss, while the TAP with background noise 
identified 29 Ss, and 23 under the original 
testing conditions. 

The relationship between each TAP subtest 
and the BVMGT was determined by multiple 
correlational analyses. All of the relationships 
were negative. Table 3 shows the correla- 
tional values for each S group on each sub- 
test. The degree of negative correlation runs 
from —.19 to —.64. This means that none of 
the TAP subtests assess the same behavior as 
the BVMGT. 

It is apparent from the data involved in Ta- 
bles 2 and 3 that the human functions as- 
sessed by the BVMGT are not the same as 
assessed by the TAP in either normal or 
neurologically impaired children. In fact, the 
data suggests that the routine assessment of 
visual perception may contribute to our over- 
looking a very large group of children with 
serious auditory perceptual impairment. The 
data establishes that each of the TAP sub- 
tests assessed a different function than does 
the BVMGT. Therefore, it became important 


TABLE 3 
CORRELATIONAL VALUES BETWEEN THE BVMGT anp Eac TAP Sunrest 


Covariance Correlation 
Test correlations 

Experimental Control Experimental Control 
BVMGT X Auditory Recognition (sounds) —7.332 —1.903 ; —.368 
BVMGT X Auditory Recognition (words) —1.163 —1.078 z —.397 
BVMGT X Auditory Retention (digits) —3.713 —2.731 i —37 
BVMGT X Auditory Retention (sentences) —1.011 —1.213 i —.368 
BVMGT X Auditory Integration (integration) —1.904 —4.508 —.492 
BVMGT X Auditory Comprehension —1.871 —5.772 —.244 
Re S E AAS N hoe! oe 


TABLE 4 


” OF SIGNIFICANCE BETWEEN THE MEANS OF 
‘gum EXPERIMENTAL AND CONTROL GROUPS FOR 

‘EAch TAP SUBTEST ADMINISTERED UNDER 

Conpitions or BAckcrounp NOISE 
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TABLE 5 


LEVELS OF SIGNIFICANCE BETWEEN THE MEANS OF 
THE EXPERIMENTAL AND CONTROL GROUPS FOR 
Eacu TAP SUBTEST ADMINISTERED UNDER 
NORMAL TESTING CONDITIONS 


= 


Subtests of Auditory Perception t 
itory Recognition (sounds) 8.017* 
tory Recognition (words) 9.52565* 
tory Retention (digits) 6.396* 
itory Retention (sentences) 7.235* 
itory Integration (tapping) 3.790* 
itory Comprehension (stories) 4,901* 


Subtests of Auditory Perception t 
Auditory Recognition (sounds) 4.068* 
Auditory Recognition (words) 3,5898* 
Auditory Retention (digits) 1.9447 
Auditory Retention (sentences) 3.3374* 
Auditory Integration (tapping) 1.3951 
Auditory Comprehension (stories) 4,5938* 


could most aptly discriminate the experimental 
‘and control Ss under test administrations of 


ground-noise) conditions. 
e f-test values indicated that under 
ditions of background noise, each of the 
AP subtests discriminated between the ex- 
imental and control Ss at the .01 level as 
ficant. The results are shown in Table 4. 
nder the original test conditions (no- 
ground-noise) the subtests of auditory 
tention (digits) and auditory integration 
ping responses) were unable to discrimi- 
ate between the experimental and con- 
‘tol Ss at a significant level. All the other 
ests revealed significant differences at the 
level. This means that nonsense words or 
s, which have no direct meaning for lan- 
age, in sequence or interpretation, are a 
oor means of assessing auditory perception. 
probably is in keeping with the standard 
tion of perception, which posits the in- 
tetation of incoming sensory stimuli in 
that it may be correctly relayed to 
er cortical centers, Repeating digits in 
te fashion and listening to tapping patterns 
not tasks that require interpretation of 
n auditory stimulus. 


CONCLUSION 


Clinically, the BVMGT is very commonly 
d to determine if possible neurological 
age is present in children. The purpose of 
Study was to determine if it would dis- 
te children with known neurological 
ment from children with no known neu- 


*p <01. 


rological impairment better than an experi- 
mental TAP. 

The results indicate that the experimental 
TAP discriminated the neurologically im- 
paired from normal Ss at a significantly higher 
level than did the BVMGT. Furthermore, the 
TAP correlates negatively with the BVMGT. 
This indicates that the TAP and BVMGT do 
not predict the same perceptual behaviors. 

The TAP had much more discrimination 
power when administered in background noise. 
This suggests that perceptually disturbed chil- 
dren have difficulty separating ambient audi- 
tory background (noise) from a meaningful 
stimulus signal. It is interesting to note that 
only the TAP subtests that contained mean- 
ingful language were affected by ambient 
background noise. 
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‘With the extended use of scales for rating 
interpersonal processes, concern with psycho- 
metric rigor has led investigators of rating 
‘methodology to focus almost exclusively upon 
reliability of ratings while neglecting the 
l questions of accuracy or validity (Auld 
urray, 1955; Shoben, 1956; Taft, 1959, 
16; Vargas, 1954). 
example, the work of Shapiro (1968) 
cates that “untrained raters are able 
differentiate high and low levels of psycho- 
fapeutic behavior in a manner which is 
to that of trained raters [p. 88].” 
er, this research does not indicate 
ther or not the ratings of either group of 
are related to any validity criteria. Fur- 
while suggesting, in effect, that everyone 
discriminate gross levels of functioning, 
ros (1968) study leaves the reader 
jut cues as to which raters can make the 
accurate or valid predictions. 
view of the literature dealing with 
and their personal characteristics re- 
it two variables have been of primary 
tn to investigators of general, person- 
ption rating methodology: (a) the clin- 
perience or training of the rater, and 
the tater’s adjustment or level of effec- 
functioning. While the results of investi- 
of the effects of experience are, at 
, equivocal (e.g, Amble & Moore, 1966; 


y at the Division of Counseling, Pennsylvania 
ae Requests for reprints should be sent 
ohn R. Cannon, 504 Beaumont Drive, State 
Pennsylvania. 

‘authors wish to thank Tom Collingwood and 
for their assistance in organizing and 
the data and Ted Friel and Daniel Kra- 
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EFFECTS OF RATER LEVEL OF FUNCTIONING AND 
EXPERIENCE UPON THE DISCRIMINATION OF 
FACILITATIVE CONDITIONS 


JOHN R. CANNON? anp ROBERT R. CARKHUFF? 
State University of New York at Buffalo 


Y The ratings of 80 Ss, at four different levels of experience, at each of two levels 
of facilitative interpersonal functioning (moderately high and low), were 
analyzed. Both level of functioning and experience were found to have signifi- 
cant effects on accuracy of ratings, with level of functioning a more critical 
index of ability to discriminate levels of facilitative conditions. 


Arnhoff, 1954; Greenwood & McNamara, 
1967; Jones, 1957; Oskamp, 1962; Watley, 
1967), there is substantial evidence that rater 
adjustment or level of functioning has signifi- 
cant effects on the accuracy of person- 
perception ratings (Reid & Snyder, 1947; 
Taft, 1955; Vingoe & Antonoff, 1968; Watley, 
1967). 

The present study investigates the effects of 
both rater level of facilitative interpersonal 
functioning and rater level of experience upon 
the accuracy of discriminations of the facili- 
tative conditions of empathy, respect, genu- 
ineness, concreteness, self-disclosure, confron- 
tation, and immediacy (Berenson & Mitchell, 
1968; Carkhuff & Berenson, 1967; Rogers, 
1967; Truax & Carkhuff, 1967). 


METHOD 
Subjects 


Eighty Ss were selected from a pool of 251 Ss 
whose written responses to standard client stimulus 
expressions had been assessed by gross rating scales 
of facilitative interpersonal communication (see Fig- 
ure 1). The final 80 Ss included 20 Ss at each of 
four experience levels: (a) experienced counselors and 
psychotherapists; (b) graduate trainees in counseling 
and psychotherapy; (c) undergraduates with experi- 
ence in a helping role; and (d) undergraduates with 
no experience in a helping role. The 20 Ss at each 
experience level included the 10 highest rated and 
the 10 lowest rated respondents at that level of 


experience. 


Procedure 


Communication. A tape comprised of 16 client 
statements of no more than 2 minutes duration each 
(total playing time of approximately one-half hour) 
was prepared to present as stimulus material to 
elicit helper responses from Ss (Carkhuff, in press). 
The client stimulus expressions tape, representing 
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The facilitator is a person who is living effectively 
himself and who discleses himself in a genuine and constructive 
He communicates an accurate 
empathic understanding and a respect for all of the feelings 
of ether persons and guides discussions with those persons into 
He coumunicates confidence 


fashion in response to others. 


specific feelings and experiences. 


in what he is doing and is spontaneous and intense. 
while he is epen and flexible in his relationships with others, 
in his commitment to the welfare of the ether person he is 

quite capable ef active, assertive and even cenfrenting behavier 


when it is apprepriate. 


You will hear a number of excerpts taken from therapy 
sessiens. Rate each therapist response te each excerpt 1.0, 
165, 200, 265, 3.0, 3.5, 4.0, 4.5, or 5.0 using the continuum 


shown below. 


1.0 1.5 3.0 


2 
/ 4 


None of these Some of the All conditions All ef the All are 
conditions are conditions are communi- conditions communi- 
Communicated to are communi- cated at a min- are commun- cated 
any noticeable cated and imally facil- icated, and fully, 


degree in the 
person. 


Fic. 1. Gross ratings of facilitative interpersonal functioning. 


client statements from live counseling, crossed three 
dominant affect areas with five dominant content 
areas. The affect areas included the following: 
(a) depression-distress; (b) anger-hostility; (c) 
elation-excitement. The content areas included the 
following: (a) social-interpersonal; (b) educational- 
vocational; (c) child-rearing; (d) sexual-marital; 
(e) confrontation of counselor. The excerpts were 
arranged in such a way that each affect area was 
matched with each content area. In addition, a 
silence was included as a stimulus for eliciting 
Ss’ responses. 

All 251 Ss were presented with the 16 client state- 
ments and asked to write out their own best re- 
sponses to each of the client expressions of feeling 
and content. Communication ratings of Ss’ written 
responses were obtained from two experienced raters 
who had established high intra- and interrater relia- 
bilities (ranging from .80 to .99) in previous research 
employing individual scales for the counselor dimen- 
sions (empathy, respect, genuineness, concreteness, 
self-disclosure, confrontation, and immediacy). In the 
present research project, these raters were asked to 
employ the gross ratings form for assessing inter- 
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some are not. itative level. 


In addition, 


some are com- simultan- 
municated eously and 
fully. continually 


personal functioning (see Figure 1), the same form 
that Ss were to employ in making their discrimina- 
tions. The rate-rerate reliabilities of the two expeti- 
enced raters employing the gross ratings form were 
-95 and .93, and the interrater reliability was .89. 

Taking into consideration only Ss’ own mean levels 
of communication (as rated by the two experience 
raters), the 10 highest and 10 lowest rated 5s ¥ 
each level of experience were selected for the fina 
S groups. h 

Discrimination. A second tape, comprised of t 4 
16 client stimulus expressions and four possible 
counselor responses to each expression, was eae 
to present for Ss’ ratings of counselor responses. ts 
counselor responses, representing counselor statemen! j 
from live counseling, were designed so that the fou 
responses to each client expression demonstrati 7 
varying levels of facilitative interpersonal COM 
munication (Carkhuff, in press). x by 

The Ss’ discrimination scores were established i 
determining the mean absolute deviation of an H 
ratings from the consensus ratings of two asi 
(different from the raters of Ss’ communication) W3% 
in previous outcome studies, had demonstrat 


DISCRIMINATION OF FACILITATIVE CONDITIONS 191) 
= TABLE 1 
3 MEANS AND STANDARD DEVIATIONS OF COMMUNICATION AND DISCRIMINATION SCORES 
až 
Communication Discrimination 
x Ratings on 5- Deviations from 
Group Ss point scales experts 
t M SD M SD 
a 
I High-level experienced counselors and 2.79 AS 45 1S 
therapists 
Ir Low-level experienced counselors and 1.44 .28 -18 .20 
therapists 
TI High-level graduate trainees 2.93 .23 49 al, 
IV Low-level graduate trainees 1.58 09 1.03 23 
V High-level experienced undergraduates 2.25 :20 94 .20 
VI Low-level experienced undergraduates 1.18 .06 1.14 Sy 
EVI High-level inexperienced undergraduates 2.13 .08 94 16 
VIII | Low-level inexperienced undergraduates 1.17 05 1,19 .08 
yi All low-level Ss 1.34 .23 1.04 .24 
All high-level Ss 2.52 44 71 29 
All Ss 1,93 -69 87 31 
O 


lote.—N = 10 for each group; totals, all low-level Ss, N 


fh degree of predictive validity in their use of the 
ting scales (Carkhuff, 1968, 1969, in press; Carkhuff 
Berenson, 1967). 


RESULTS 


oup means for both communication (Ss’ 
of facilitative interpersonal functioning) 
iscrimination (deviations from criterion 
) appear in Table 1. It can be seen 
at each experience level, the low-level 
oup was functioning approximately one to 
and one-half communication levels lower 
the high-level group. 

The 4 X 2 analysis of variance (see Table 
dicated that Ss’ level of functioning, Ss’ 
ence, and the interaction between these 
Variables all were significant sources of 
lance in discrimination scores. 

l ple effects of level of functioning were 
uated with £ tests, and it was found that 
every level of experience, the high-level 
oning Ss were significantly (p < .05) 
aa in their ratings than the low- 
Ss. 

order to control for the confounding of 
ence with level of functioning, an analy- 
‘Covariance was conducted to assess the 
ndent effect of experience on discrimi- 
(covaried for level of functioning). 
computed from this analysis was highly 


o 


= 40, all high level Ss, N = 40, all Ss, N = 80. 


significant ( < .001), and, therefore, sepa- 
rate analyses of covariance and ¢ tests based 
on the adjusted means and adjusted error 
terms were conducted at each of the two 
levels of functioning. 

The results of these analyses revealed the 
following: (a) At both high and low levels 
of functioning, experience had a significant 
(p < .01) effect on accuracy of ratings; (b) 
among high-level functioning Ss, there were 
significant (p< .05) differences in rating 
accuracy between more experienced Ss 
(Groups I and III) and undergraduate Ss 
(Groups V and VII), but no significant dif- 
ferences either between the experienced coun- 
selors and the graduate students (I and IIT) 


TABLE 2 


SUMMARY OF OVERALL ANALYSIS OF VARIANCE 
OF DISCRIMINATION SCORES 


Source df | MS F 
Experience 3 | .9492 | 27.63** 
Level of functioning 1 | 2.2005 | 64.05** 

(communication) 
Experience X Level 3| .1168| 3.40* 
Within cells 72 | .0344 
Total 79 
=p Sci. 
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TABLE 3 


DISCRIMINATION DIFFERENCES BETWEEN HIGH-LEVEL FUNCTIONING SUBJECTS AND 
Low-Levet FUNCTIONING SUBJECTS (VALUES OF £) 


I IV VI VII 
Group Low-level Low-level Low-level Low-level 
experienced graduate experienced | inexperienced 
counselors trainees undergraduates | undergraduates 

I High-level experienced counselors —3.98%**+ —7.00*** —8.32*** —8.93*** 

III High-level graduate trainees —3.50°** —6.51*** —7.84*** —8.44"* 
V High-level experienced undergraduates +1.93 —1.09 —2.41* —3.02* 
VII High-level inexperienced undergraduates +1.93 —1.09 —2.41* —3.02** 


Note.—Negative numbers indicate high-level group more accurate than low-level group, Positive numbers indicate low- 


level group more accurate than high-level group. 
*p <05. 


+p < 01. 
> < 001. 


or between the experienced and inexperienced 
undergraduates (V and VII); (c) among low- 
level functioning Ss; experienced counselors 
(Group IT) were significantly ( < .05) more 
accurate in their ratings than all other groups, 
with no significant differences in rating accu- 
racy between graduate students (IV), experi- 
enced undergraduates (VI), and inexperienced 
undergraduates (VIIT). 

Considering the results of the simple-effects 
tests for both experience and level of func- 
tioning, it appears that the significant effect 
of the interaction of Experience X Level of 
Functioning (Table 2) reflects the differential 
effects of experience at the two levels of func- 
tioning rather than differential effects of level 
of functioning at the four levels of experience, 

Such an interpretation provides tentative 
support for the assumption that level of 
functioning is a more critical indicator of 
rating accuracy than level of experience; and, 
in order to further investigate the relative 
effects of experience and level of functioning, 
each group of high-level functioning Ss was 
compared to every group of low-level func- 
tioning Ss. Table 3 presents the numerical 
values and probabilities of the £’s computed 
from these comparisons. Perusal of the entries 
in Table 3 reveals the following: (a) two 
groups of high-level functioning Ss (I and 
TII) were significantly (p < .001) more accu- 
rate in their ratings than every group of low- 
level functioning Ss; (b) each of the remain- 

ing two groups of high-level functioning Ss 
(V and VII) was significantly ($ < .05) more 
accurate than two of the groups of low-level 


functioning Ss (VI and VIII), and not sig- 
nificantly different from the other two low- 
level groups (II and IV); (c) none of the 
groups of low-level functioning Ss, regardless 
of experience, was significantly more accurate 
in rating than any of the groups of high-level 
functioning Ss, 

Finally, correlations were computed be- 
tween level of functioning (communication) 
and discrimination at both high and low levels 
of functioning with the following results: (a) 
The Pearson r between communication and 
discrimination for all low-level functioning 
Ss was —.26 (ns); (b) the r between com- 
munication and discrimination for all high- 
level functioning Ss was —.76 (p< .001). 
Due to the methods of scoring these variables, 
a negative correlation indicates that the higher 
an S’s level of functioning, the more accurate 
are his ratings. 


Discussion 


The present findings indicate that rater 
level of interpersonal functioning and rater 
level of experience have significant effects 
upon the accuracy of ratings of interpersonal 
processes, with rater level of functioning ap- 
pearing to be the more critical indicator of 
rating accuracy. At this point in the develop- 
ment of process research, the results lend 
support to proponents of the criterion of 
meaning who suggest that the validity of 
rating scales is largely a function of the par- 
ticular raters who employ them (Carkhuff, 
1968; Lehmann, Ban, & Donald, 1965; 
Marsden, 1965). 


af 


DISCRIMINATION OF FACILITATIVE CONDITIONS 


The results appear to be inconsistent with 
those of Shapiro (1968) and others who indi- 
cate that neophytes rate as effectively as 
trained raters, and do so with gross rating 
scales that involve minimal effects of experi- 
ence, However, the present findings may ac- 
count for the results of such previous studies 
with the possibility that both the neophytes 
and the experienced raters may have been 
functioning at the same levels, whether high, 
moderate, or low. For example, in the present 
study, high-level experienced undergraduates 
were very similar in discrimination to high- 
level inexperienced undergraduates, and low- 
level experienced undergraduates were very 
similar in discrimination to low-level inexperi- 
enced undergraduates. 

On the other hand, the results of the pres- 
ent research also.imply that accuracy of dis- 
criminations may be largely determined by 
level of functioning of the rater regardless of 
whether he is a therapist, trainee, client, or 
undergraduate student; although, to be sure, 
one may be able to make diminishing proba- 
bility statements in this regard. For example, 
an experienced therapist in the low group 
(II) may discriminate neither as well as a 
graduate trainee (III) nor significantly better 
than experienced and inexperienced under- 
graduates (V and VII). 

The representativeness of the present 
samples is supported by the essential replica- 
tion of previous findings of average levels of 
interpersonal functioning at different experi- 
ence levels (Carkhuff, 1969; Carkhuff & 
Berenson, 1967). In addition, the differential 
findings concerning the intercorrelations of 
communication and discrimination within 
groups of high-level and low-level functioning 
Ss are consistent with those of Carkhuff, 
Kratochvil, and Friel (1968) indicating the 
independence of communication and discrimi- 
nation among low-level functioning Ss and a 
high degree of relationship between these 
variables among high-level functioning Ss. The 
situation, it appears, is not unlike the insight- 
action discrepancy often found with clients 
in counseling. That is, in general, the evidence 
indicates that nothing relates very much to 
anything else among low-level functioning Ss, 
while there are very consistent and stable 
relationships between relevant variables 
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among high-level functioning Ss (Anthony, 
1968; Carkhuff, 1969, in press; Greenbaum, 
1968; Greenberg, 1968). 

In summary, there is now empirical as well 
as experiential evidence to indicate that pros- 
pective raters of interpersonal processes 
should be selected on the basis of two criteria: 
(a) demonstrated high levels of functioning 
in the relevant area; and (b) experience in 
the relevant area. Of these two selection 
indexes, the evidence suggests that level of 
functioning is the more critical. 
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group-therapy patients on a psychiatric 


ward of a general hospital to the death of a group member who died in the 
hospital the day before the session in question took place. The circumstances 


of this patient’s death were ambiguo: 


us as to whether the death was due to 


suicide or accident. In view of the ambiguity of the situation, some members 
of the treatment team felt that group discussion would generate more emo- 
tional disturbance among the patients than it would reduce. However, it was 
concluded that discussion of death and open consideration of the feelings of 
guilt, anxiety, and hostility involved can be therapeutic in a group as well as 


in an individual setting. 


From its hush-hush status as a taboo topic, 
particularly with people who are dying or who 
are closely associated with the dying, death 
is beginning to emerge as a respectable subject 
for communication, A number of studies of 
attitudes toward death suggest that frank and 
open discussion of the subject by both doc- 
tor and patient is therapeutically desirable 
(Feifel, 1959; Kayton & Freed, 1967; Mar- 
golis, Meyer, & Louw, 1965; Shneidman, 
1963; Weisman & Hackett, 1961). The pres- 
ent study is concerned with the value of such 
discussion for a therapy group which had 
recently lost one of its members due to a 
sudden and violent death. The incident took 
place on the psychiatric ward of a general 
hospital. The body of one of the group mem- 
bers, a Mr. K, had been found in the base- 
ment of the hospital at the foot of a stair- 
case from which he had apparently fallen. 
There was no way to determine whether the 
fatal fall had happened through accident or 
otherwise. Thus the circumstances surround- 
ing the death were ambiguous. This study dif- 
fers from those previously mentioned insofar 
as it deals with a therapy group rather than 
an individual therapeutic situation and the 
death discussed had occurred under ambiguous 

1A brief version of this paper was presented 
at the Annual Meeting of the American Psycho- 
logical Association in San Francisco, September 1968. 
Requests for reprints should be sent to Joseph M. 


Sacks, Veterans Administration Hospital, 2615 Clinton 
Avenue, Fresno, California 93703. 


conditions. This element of ambiguity ap- 
peared to be very important in determining 
the reactions now to be described. 

The question of the therapeutic value in 
group therapy discussion of the recent death 
of a group member became an issue dividing 
the treatment team. Some staff members felt 
that, in view of the ambiguity of the situa- 
tion, such discussion would create further 
emotional disturbance among the patients and 
that the best way to handle the matter would 
be simply to drop it. Others felt just as 
strongly that the subject should be discussed 
openly. This latter approach is congruent with 
that of Feifel (1963) who maintains that 


Not only patients, but most of us feel less apart 
when we can share feelings and thoughts about 
dying and death ... the opportunity to discuss 
their feelings helps subdue irrational fears and guilt 
sentiments associated with thoughts of death [p. 17]. 


While Feifel is here concerned with patients 
with terminal illness, we observed similar 
reactions in psychiatric patients following the 
death of a therapy-group member. 

The group meeting to be discussed took 
place on the morning following the mishap, 
at which time the County Coroner’s Office was 
still investigating the case. However, later 
that same day the coroner’s office stated that 
the cause of the fall would be listed as 
“ynknown.” K had been in the hospital ap- 
proximately 6 weeks and most of the patients 
in his group knew him personally, having 
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been on the ward with him for at least 2 
weeks, On the morning following K’s death 
the group therapy session was late in starting. 
Patients had to be rounded up and seemed 
to be dragging their feet coming in. Six pa- 
tients were present, two staff psychologists 
(Dr. O and Dr. J), a psychologist-in-training 
(Dr. D), and a nursing assistant (Mr. F). 
When the session began the entire room 
was permeated with an air of gloom and no 
one seemed to be willing to initiate any 
activity. After a few moments of silence, Dr. O 
commented that staff as well as patients were 
feeling very badly about K’s death. This 
seemed to lift the lid of silence which had 
been covering the group. When the patients 
began talking it became apparent that most 
of them believed that K had committed 
suicide. They felt that the staff knew exactly 
what had happened, that there had been a 
suicide, that it occurred because the staff 
had been negligent, and that the staff was 
trying to cover up its mistakes by passing 
. off the death as accidental. As one patient, 
a Mr, T, stated, 


This whole thing stinks like shit and you guys are 
trying to bury it in sugar. You think that'll make 
you look better, but shit still stinks no matter 
how much sugar you dump on it, so you're not 
going to hide anything. You think you’re protecting 
yourselves, but you’re not. You're letting yourselves 
down worse than you are us because everybody’s 
going to know what happened anyhow, so lay off 
with all that sugar. 


The group’s expression of hostility toward 
staff was considerable, although T was far 
more loquacious and direct in his attacks than 
any other group member. Only two patients 
appeared to consider seriously the view that 
K’s death had been an accident. The rest of 
the patients indicated an uncomfortable degree 
of doubt regarding staff competence and com- 
mitment. In effect they said that the staff was 
not genuinely interested in patient welfare for, 
if the staff were so interested, the tragic death 
of K could have been prevented. In other 
words, had the staff helped K more, or even 
if they had kept closer watch over him, he 
would probably still be alive. When Dr. J 
questioned whether one can be prevented from 
committing suicide when he is bound and 
determined to carry out the act, the patients 
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became highly irritated and said that the staff 
was merely making up excuses for itself. It 
should be pointed out that these patients 
were on an open ward. This meant that pa- 
tients were free to go to their various clinic 
appointments, to the dining room on another 
floor, to occupational therapy, recreation hall, 
etc. They definitely wanted to keep it this 
way, yet they seemed to be saying that they 
also wanted constant surveillance and super- 
vision by staff. The dilemma inherent in pro- 
tection of the suicidal patient in this type of 
open therapeutic setting has been discussed 
by Margolis et al. (1965). 

The belief that K had committed suicide 
was not unfounded as he had recently made an 
abortive attempt to shoot himself. K’s reac- 
tions generally were such as to predispose him 
to this type of resolution of his situation. K 
was 59 years of age, although he looked con- 
siderably older. He was becoming increasingly 
worried about his health and financial prob- 
lems. He had been suffering recurrent episodes 
of depression with anxiety for at least 5 years. 

After a period of apparent improvement, K 
was scheduled to meet with the staff to discuss 
a possible discharge. K was seemingly in great 
conflict over whether he should or should not 
be discharged. He had confided to other pa- 
tients that he was very anxious about leaving 
the hospital because he did not feel that he 
was as yet up to managing the frustrations his 
job would be imposing on him, and when he 
asked Dr. O for his opinion in the matter, Dr. 
O told him that he felt K was not yet ready to 
leave. However, there were numerous times 
when K verbalized a desire to leave anyway. 
On the morning of the weekly staff meeting 
he had apparently decided he wanted to be 
discharged, but this decision had ‘yet to be 
discussed with the staff in conference. When 
K’s name was called to enter the conference 
room, he could not be found. He was nowhere 
on the ward. Later he was found dead in a 
pool of blood lying at the foot of the stairs 
leading to the basement. His skull was frac- 
tured in several places and he had apparently 
died instantly from the injuries received in 
his fall. 

The patients in the group, especially T, 
angrily continued to support the view that K 
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had committed suicide and seemed to resent 
any other interpretation, such as that the 
death had been accidental or simply that the 
circumstances leading to death were unknown, 
the latter actually being the most conserva- 
tive and reasonable judgment in light of the 
facts then available. Then T challenged those 
staff members present by asking them whether 
they felt they had done all they could have 
done for K and the staff members replied 
that they had been searching their consciences 
and asking themselves if there was anything 
they had done or failed to do which might 
have prevented the incident. They recalled 
occasions on which K approached them when 
they had appointments with other patients. 
He could have felt rejected when they ex- 
changed only brief comments with him or told 
him they would see him later. 

Then rather suddenly group activity turned 
in a different direction when Dr. O asked the 
patients whether they felt that they had done 
all they might have in order to help K. Had 
they themselves any guilt regarding their own 
treatment of K? When the patients were thus 
confronted, their hostility began to dissipate 
and a considerable amount of guilt was ad- 
mitted to, especially by Mr. T and Mr. S. The 
patients generally stated that they felt rather 
badly because they did know they might have 
been a little more helpful to K than they ac- 
tually were, but S seemed to feel worse than 
the others, excepting T. S said that on the 
very morning of the mishap K had approached 
him in a friendly manner, apparently because 
he felt in need of a little companionship and 
conversation, but at the time S did not feel 
like talking; he was preoccupied with his own 
problems and simply wished to be left alone. 
Consequently, he turned his back on K and 
walked away from his without saying one 
word in answer to K’s friendly advance. Fur- 
thermore, T, who up to this time had been the 
most vociferous in attacking the staff, stated 
that he had been seriously troubled about 
something he said to K during an earlier group 
session, the session during which K described 
his attempted suicide. On that particular oc- 
casion T expressed himself in a pessimistic 
and defeatist manner while K was in the midst 
of relating his feelings during and after the 
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attempt. T said, “Well, if that’s how a guy 
feels, he ought to go ahead and do it. I can’t 
blame you. If you feel that way about life, 
there’s nothing to do but blow your brains 
out. Hell, you can’t win ’em all.” On the day 
of K’s death, a short time after the death 
was announced, T recalled this bit of mis- 
directed advice and became agitated and pre- 
occupied with the thought that maybe these 
very words had some part to play in helping 
K come to the point where he could make the 
fatal decision. He went to see Dr. O to talk 
over his feelings in the matter. In the group 
meeting the following day T stated that he 
still felt a high degree of guilt on account of 
what he had said to K, though at this time he 
said that he did not think it had any real 
influence on K’s behavior. Thus, it was possi- 
ble that T’s strong hostility stemmed from an 
acute and intense guilt reaction projected 
upon the staff. Such projection was character- 
istic of T’s general behavior. 

Aside from realistic concern and guilt, a 
great deal of anxiety seemed to underlie the . 
hostility which had been expressed in the 
group. A Mr. J, who appeared to be the most 
openly anxious of the group members, began 
toward the end of the session to talk about 
how his mother might react to K’s death if she 
were to learn of it. J was afraid that if he told 
his mother of the incident she would become 
highly upset, as K might have been a member 
of the family and she was always disturbed 
by the death of a family member. However, 
J’s concern in this matter seemed to be a 
curious concern since, if K was related to him, 
it was only in a distant way and, furthermore, 
J’s mother did not know K at all nor had she 
ever seen him. Nonetheless, J did appear to 
be disturbed and it seemed that the talk about 
his mother was merely a groping effort on his 
part to find some sort of rationalization to 
help bind his anxiety. Anxiety was suggested 
by patients’ verbalizations generally. It was as 
though they were continually saying, “If 
something horrible like that could happen to 
K, then what might happen to me in this 
place? What kind of treatment am I getting 
anyway? Whatever K got sure didn’t do him 
any good and maybe it even hurt him.” In 
other words, K’s mishap had shaken the pa- 
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tients’ credibility in the efficacy of the ward’s 
treatment program, of which previously they 
had probably held a higher opinion. Their 
new and unexpected perceptions of their situa- 
tion may have produced conflict and anxiety 
regarding possible modes of reaction to the 
new experience. 

By the end of the session the group still 
did not appear to have reached adequate 
closure with regard to the events leading up 
to K’s death. Hardly any of the patients 
were satisfied with the “unknown” theory 
which was essentially the position put forth 
by the staff. At this point, Dr. D presented 
the hypothesis that K’s death may have been 
an unconsciously motivated suicide. Even if 
K had died accidentally, there still may have 
been a “self-destructive” element at work. 
Everyone knew that K was still depressed and 
had previously thought of suicide. Even 
though K had recently been trying to over- 
come his depression, this effort may have been 
taking place at the conscious level only, while 
unconsciously self-destructive impulses may 
have been actively at work. K’s excessive 
apologizing about his attempted suicide sug- 
gested a reaction formation to the original 
consciously felt death wish. Assuming that K 
did have an accident, factors such as the 
above could have led him to be more accident- 
prone than he otherwise, under more favor- 
able conditions, would have been. Thus, the 
interpretation, somewhat paradoxical at first 
glance, but perhaps less so after further in- 
spection, was that the mishap had been an 
“accidental suicide.” Patients generally, as 
well as staff present, seemed to find this view 
plausible and a fair degree of closure on the 
matter was thus obtained. This interpretation 
is consonant with Shneidman’s concept of sub- 
intention. In seeking to establish a more 
scientific classification of the modes of death 
than the currently prevalent one of “natural, 
accidental, suicidal, or homicidal,” Shneidman 
(1964) speaks of basic attitudes toward cessa- 
tion of life as “intentioned, subintentioned, 
and unintentioned.” He relates the subinten- 
tioned orientation to instances in which the 
individual plays an indirect, covert, partial, or 
unconscious role in his own demise. Such was 
the thesis presented to the group by Dr. D 


when toward the end of the session the group 
still did not appear to reach adequate closure 
with regard to the events leading to K’s fatal 
fall. 

On the next meeting of this group 2 days 
later, patients made no mention of K. K’s 
name was not brought up until the end of the 
session and it was Dr. O who mentioned it. He 
asked the patients whether they still had feel- 
ings about K’s tragedy and expressed a degree 
of surprise that no one had anything to say 
about it. Some of the patients said they had 
been trying not to think of the incident and 
were still somewhat bothered by it, but gen- 
erally the group did not appear to be willing 
to launch into another discussion on K. Either 
they had sufficiently ventilated their feelings 
during the previous session and no longer had 
any need to talk them out or they were con- 
trolling their anxiety through attempting to 
repress the whole incident or the newspaper 
report concerning the coroner’s judgment that 
the mode of the death was accidental and 
circumstances leading to death were simply 
unknown (a judgment which prior to the coro- 
ner’s report they would not accept from the 
staff) had given them final closure on the sub- 
ject. To some extent all of these factors were 
probably present. 

The study of Kayton and Freed (1967) of 
the effects of a suicide in a psychiatric hos- 
pital showed strikingly similar reactions to 
those of our patients. In both a closed ward of 
disturbed patients and in an open, milieu- 
oriented ward, patients were angry with staff 
for invoking the pain of insecurity by not 
preventing the suicide. This reaction appeared 
to be aggravated on the closed ward when the 
usual staff-patient meeting was delayed sev- 
eral days. On the ward on which the suicide 
actually occurred, the psychiatric resident con- 
ducted a meeting with the patients that same 
evening. Interestingly, in this situation the 
physician’s manifest anxiety evoked protective 
reactions by the patient group who reassured 
him that nothing could have been done to 
prevent the suicide. Unlike our patients, and 
those of the other two wards described, these 
patients focused on the deceased as the cause 
of their insecurity and utilized mechanisms of 
denial in resisting any further discussion about 
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the incident. In their therapeutic recommenda- 
tions, Kayton and Freed suggest that a pa- 
tient-staff meeting should be convened as 
rapidly as possible following such an incident, 
and that the group leader should supply the 
specific facts of the incident to the group. 
They conclude that “as it becomes apparent 
that the stress is dissipating, no further pa- 
tient group ventilation is indicated unless it 
is expressly desired by the patients [p. 194].” 
Contrary to the opinions of some staff 
members, open group discussion of the death 
did not generate any increase in emotional 
disturbance. If anything, the opportunity to 
express feelings and to approach a consensus 
about the cause of death, seemed to have pre- 
vented a build-up of tension in this group. 
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Schizophrenic and normal groups were compared on their speed in recognizing 
the identity of pictorial stimuli which gradually came into sharper focus from 
an initially blurred appearance. The chronic schizophrenic group showed slower 
recognition than the acute schizophrenic or normal groups, but the groups did 
not differ in amount of prerecognition hypotheses or time of first response. The 
differences in perceptual recognition speed seem related to certain clinical 
differences between the acute and chronic schizophrenic, particularly with re- 
gard to the impact of external perceptual stimuli vs. internally generated idea- 


tion, 


The nature of the cognitive disorder in 
schizophrenia has been extensively described 
and studied from many vantage points. How- 
ever, the specific mechanisms and mediational 
processes contributing to this dysfunction are 
still poorly understood. Several recent con- 
ceptualizations of schizophrenia (McReynolds, 
1960; Silverman, 1964) have pointed to the 
possibility that basic systems, or parameters 
of information processing, may be altered in 
schizophrenia in ways that predispose the 
schizophrenic toward his distorted and bizarre 
interpretations of external and internal stim- 
uli. McReynolds (1960), for example, pro- 
poses that the schizophrenic’s cognitive anom- 
alies may result from inability to assimilate 
incoming perceptual input rapidly enough, 
with consequent disruptive anxiety and break- 
down of reality-based perception and cogni- 
tion. Silverman (1964) reviewed studies of 
various “cognitive control” dimensions and 
proposed a schizophrenic deficit in a “sensory 
input processing-ideational gating” mecha- 
nism which involves alterations in modes of 
information reception and evaluation. Al- 
though the specific conceptualizations offered 
by these writers still remain to be empirically 
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evaluated, they are of importance in focus- 
ing experimental interest on the operating 
characteristics of central perceptual-cognitive 
processes, qua process, in contrast to the 
more traditional attempts to identify mainly 
peripheral performance differences between 
schizophrenic and nonschizophrenic groups. 

Clinical observations have provided vivid 
examples of the extent to which the schizo- 
phrenics’ anxieties and fears of being over- 
whelmed are intensified and perhaps activated 
by the distortions of meaning which are 
assigned to various environmental occurrences 
—or, in other terms, to aberrant perceptual- 
cognitive information processing and categori- 
zation. Thus, it would appear that a crucial 
aspect of these processing systems is the stage 
in which meaning, or object-event identity, 
is assigned to perceptual input; that is, when 
a perceptual array is “recognized” in terms 
of the individual’s categories of reference. 
Furthermore, since perceptual activity can 
almost always be thought of as a temporally 
extended sequence of events (Smith, 1958), 
even when the temporal interval is no longer 
than in tachistoscopic exposures, it seems 
important to study the pattern or sequence 
through which various final perceptual re- 
sponses pass (see Flavell & Draguns, 1957, 
and Werner, 1956, on the “microgenetic” 
aspects of perception). This paper describes 
a study in which the perceptual recognition 
process in schizophrenics was investigated 
with the aim of determining characteristic 
features in the temporal pattern of process- 
ing perceptual input which might further 
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elucidate modes of cognitive dysfunction in 
schizophrenia. 

Under normal conditions ‘of perceiving, 
however, the steps in the temporal develop- 
ment of a perceptual response are difficult to 
isolate, partly because of the rapidity of the 
process and the presence of highly redundant 
cues. In tachistoscopic threshold procedures, 
it is possible, of course, to slow down and 
control the recognition process but only by 
transforming the naturally occurring continu- 
ous perceptual influx into a series of discon- 
tinuous, discrete stimulus presentations. In 
this study the aim was to “slow down” the 
perceptual recognition process by controlling 
the presentation of input, but, at the same 
time, to maintain continuous input during the 
experimental exposures. For these reasons a 
perceptual recognition technique developed by 
Bruner and Potter (1962, 1964) which allows 
the presentation of increasing amounts of 
stimulus information in a continuous manner 
was employed. In this procedure, S views 
photographic slides which are initially pre- 
sented extremely out of focus, and which are 
continuously brought into sharper focus over 
a given time interval during which S attempts 
to identify the content of the slide. The vari- 
ables measured were the point of correct 
recognition of the picture, the number of re- 
sponses given prior to correct recognition, 
and the point of first response. These vari- 
ables may be regarded as aspects of the medi- 
ating steps and transformations of data 
between perceptual input and response, 
and thus are pertinent to the activity of 
perceptual organizing processes. Previously, 
Davis and Cullen (1958) studied perceptual 
recognition in schizophrenic, neurotic, and 
normal groups using a somewhat similar pro- 
cedure in which photographs were presented 
in increasing clarity by manipulating illumi- 
nation level. They found significantly ele- 
vated recognition thresholds for schizophrenics 
versus neurotic and normal groups. These 
data, however, are based upon responses to 
only two photographs with unsystematically 
varied rate of illumination change, so that 
the stimulus presentations are not strictly 
comparable. 

More recently, Draguns (1963) studied the 
responses of schizophrenics to perceptually 
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ambiguous material by requiring recognition 
of drawings which were blurred in varying 
amounts and then presented in a series from 
very blurred to sharp. He found that chronic 
schizophrenics gave earlier recognition re- 
sponses and more perceptual recognition errors 
than acute schizophrenics or normals, and 
concluded that chronic schizophrenics tend 
toward more premature and idiosyncratic 
structuring of incomplete perceptual data. It 
should be noted, in line with the above 
comments regarding the discrete nature of 
tachistoscopic procedure, that Draguns’ task 
also involved discrete stimulus presentations, 

While the present study is addressed to 
similar questions as the Draguns (1963) and 
Davis and Cullen (1958) studies, the method- 
ology employed here allows for more control 
over the perceptual information presented the 
S, while retaining the feature of continuous 
perceptual stimulation during each trial. Both 
chronic and acute schizophrenic groups were 
studied. 


METHOD 
Subjects 


Chronic schizophrenic and acute schizophrenic 
groups were chosen from patients in a state mental 
hospital, and a training and research hospital. Pa- 
tients were included only if they had a staff diag- 
nosis of schizophrenic reaction, at least eighth-grade 
education, and revealed no evidence of organic fac- 
tors in their clinical condition. Almost all the patients 
were receiving psychoactive medication which it was 
not possible to discontinue for purposes of the study. 

Two control groups were formed, one consisting 
of state hospital nursing assistants and maintenance 
personnel. This group, however, did not seem a fully 
appropriate control group for the acute schizophrenic 
group which tended to be younger and more highly 
educated. Accordingly, a second control group of 
university students was formed. Thus, the four 
groups which were finally seen were as follows: 

Group 1. Chronic schizophrenic: 20 male patients 
between 21 and 40 years of age (mean age, 33.2 
years), mean educational level of 10.8 years, and 
mean length of hospitalization of 7.8 years. 

Group 2. Acute schizophrenic: 20 male patients 
between 18 and 42 years (mean age, 27.4 years), 


‘mean education of 13.1 years, and mean length of 


hospitalization of 1.9 months. 


8 Appreciation is extended to Calvin Chen, Director 
of Research, Northville State Hospital, Northville, 
Michigan, and his staff for their cooperation in pro- 
viding facilities and Ss for this study. 
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Group 3. Hospital control: 20 male Ss between 18 
and 40 years (mean age, 28.7 years), and mean 
education of 12.7 years. 

Group 4. College control: 20 male Wayne State 
University students between 18 and 44 years (mean 
age, 21.8 years) and mean education of 13.1 years. 


Apparatus 


The stimuli were presented from a Kodak Carousel 
slide projector which was modified by attaching a 
gear system and motor to the shaft of the focusing 
mechanism on the projector. This system made it 
Possible to control the speed of the forward or 
backward movement of the lens to focus the picture. 
A rheostat attached to the motor allowed fine adjust- 
ment of the lens movement. In this experiment, the 
lens moved at a constant speed from grossly out of 
focus to the point of sharpest focus in 120 seconds. 

The stimuli consisted of ten 35-mm, color slides of 
the following objects: fire hydrant, female fashion 
model, telephone, highway, books, statue (Rodin’s 
Thinker), pieces of silverware, baseball player, man 
and woman in a dancing pose, several carpentry 
tools, For each S the slides were presented in the 
above order, and were arranged in the slide tray 
with a cardboard blank between each slide. 


Procedure 


The S was seated at one end of a long table facing 
a rear projection screen immediately in front of him. 
He was given the following instructions: 


We are trying to find out more about the way 
people see things and learn to Tecognize what they 
see. In order to do this, we need to have the help 
of many different people. That’s why we are asking 
people in this hospital to help us with this experi- 
ment, The results of this experiment will not go 
on your record and will not in any way influence 
your stay in the hospital. 

This is a screen and behind it are a slide pro- 
jector and some equipment to operate it. I’m going 
to show some pictures on this screen, At first they 
will be very blurred and out of focus and you 
probably won't be able to tell what they are, but 
they will gradually get clearer and clearer until 
you can definitely tell what they are. What I would 
like you to do is to look at the picture and try 
to identify it as it is coming into focus, You may 
try to guess what it is at any time, even if you 
are not sure. Also, while the Picture is coming 
into focus I would like you to describe what you 
see even though you can’t tell what it is. You 
may talk about it in any way and remember that 
you may guess what it is at any time. Do you 
have any questions? I’m going to give you a 
Practice picture first, Be sure to say all your ideas 
about the picture while it is coming into focus. 
When I say “ready,” start looking at the screen 
and I'll put on the first picture. 


As soon as each slide appeared on the screen, the 
projector lens began moving toward sharper focus. 
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The S’s responses were tape-recorded and measured 
in terms of seconds elapsed from stimulus onset until 
the response was spoken. It was also Possible to 
note the amount of lens movement to the time of 
the response. Each slide was viewed for the full 
2-minute exposure time even when it had been 
recognized at some intermediate point in the exposure 
interval. 


ReEsULTS 


The data were analyzed with regard to the 
variables of time of correct recognition, time 
of first response, and number of responses 
given prior to the correct recognition response. 
In computing the statistical distributions, it 
became apparent that there was marked het- 
erogeneity of variance among the groups on 
the various pictures, which precluded an ef- 
fective factorial ANOVA across all groups 
and pictures simultaneously. Since variance 
transformations did not sufficiently reduce the 
heterogeneity, it was decided to separate those 
Pictures showing the smallest variance among 
the groups from the remaining pictures, and 
to perform separate variance analyses for 
these two groups of pictures. The justification 
for this procedure is that because the pictures 
differed markedly in their perceptual proper- 
ties and complexity as they were coming into 
focus, one can consider them as comprising 
subsets of differing perceptual difficulty. 
Grouping the pictures according to the amount 
of variation in performance which they pro- 
duce would appear to be a meaningful way 
of recognizing these different perceptual 
characteristics. In this way, separate analyses 
were done on Pictures 1, 3, 4, 6, and Pictures 
2, 5, 7, 8, 9, 10 for each dependent variable. 
Table 1 shows the mean performance scores of 
each S group, and Table 2 contains the results 
of the analysis of variance for each group. 

Recognition response. The results of the 
analyses for both picture groups (A and B) 
were consistent in revealing significant (p< 
01) differences among the four S groups in 
mean recognition time, as well as marked 
differences between pictures ($ < .01) in both 


*The picture content of these two groups is as 
follows: Group A—Picture 1: fire hydrant, Picture 
3: telephone, Picture 4: highway, Picture 6: statue. 
Group B—Picture 2: female fashion model, Picture 
5: books, Picture 7: pieces of silverware, Picture 8: 
baseball player, Picture 9: couple dancing, Picture 
10: carpentry tools, 
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Picture Groups A and B. This large variation 
among the pictures is mainly attributable to 
the quite varied perceptual qualities of the 
separate pictures. The interaction of Pictures 
x S Groups reaches significance (< .05) 
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TABLE 1 


MEAN Scores on RECOGNITION RESPONSE Time, 
First RESPONSE TIME, AND NUMBER OF 
PRERECOGNITION RESPONSES 


only with Pictures 1, 3, 4, 6. Individual com- Recognition | First | recognition 

parisons showed that the chronic schizophrenic pA in seconds) | (in seconds) | (humber) 

group differed significantly from each of the 

other three S groups which in turn were not A |B] Al] B |] Al B 

appreciably different among themselves. Al- Chronic 

though the significant interaction term in Auton 94.62 | 95.84 | 30.43 | 36.18 | 3.21 | 2.82 

Picture Group A suggests that the S groups’ ésnitat'controt |6407 |2313] ene | Sens | 2:82 | 373 
College control 54.22 | 81.43 | 9.41 | 15.03 | 3.44 | 4.60 


recognition performance should be considered 
in relation to specific pictures, the magnitude 
of this interaction effect was clearly over- 
shadowed by the variation attributable to the 
diagnostic classifications and the general be- 
tween-picture differences. The most note- 
worthy aspect of this analysis would appear to 
be the wholly consistent pattern that on each 
of the 10 pictures the chronic schizophrenic 
group had the longest mean recognition time. 

First response time. Analysis of this vari- 
able disclosed a pattern similar to the recogni- 
tion response data. The S groups differed sig- 
nificantly in time of first reponse in both 
picture groups (p < .05 for Picture Group A, 
p < .01 for Picture Group B) with individual 


Note.—Entries in A columns refer to mean scores over 
Pictures 1, 3, 4, 6. Entries in B columns refer to mean scores 
over Pictures 2, 5, 7, 8, 9, 10. 


comparisons indicating that the mean first 
response time of the chronic schizophrenic 
group was significantly longer than in each 
of the other three groups. 

Prerecognition responses. There were no 
significant differences among the S groups 
in mean number of responses given prior 
to correct picture recognition, although 
the number of prior responses given to spe- 
cific pictures did show significant variation 
(p< .05 for Picture Group A and p< .01 
for Picture Group B). 


TABLE 2 
ANALYSIS OF VARIANCE OF RESPONSE VARIABLES 
Recognition response First response time aaen 
Source df 
MS F MS F MS F 
Picture Group A j 
Between Ss 9 
Diagnosis 3 27597.27 | 32.62** | 5930.59 3.99* 5.14 21 
P Error between 76 846.27 1485.53 24.75 
Withi 240 
Bea 3 23442.57 | 37.82** | 4097.53 10.75** 11,70 3.48* 
Diagnosis X Pictures 9 1309.52 2.11* 264.79 -69 2.73 81 
Error within 228 619.85 
Picture Group B y 
Betw 9 
Teel 3 7324.83 | 21.85** | 9810.88 3.04* 90.70 2.32 
ae Error between 76 335.12 3223.57 39.07 
i Withi 400 
Paine 5 6846.92 | 42.58** 532.06 1.50 15.39 5.09 
Diagnosis X Pictures 15 249.21 159 670.30 1.89 4.74 1.72 
Error within 380 160.82 355.42 
* 
#5 Sot 
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Discussion 


The attainment of correct recognition in 
this experimental procedure is dependent 
upon the continuous reception and matching 
of incoming perceptual data, or cues, against 
the internal organizations and categories in 
terms of which past experience is represented 
and stored. This cue-category matching proc- 
ess (Bruner, 1957) presumably eventuates 
in the formation of hypotheses which express 
the S’s momentary estimate of what the per- 
ceptual object is. The hypotheses are based 
upon some weighted combination of immedi- 
ately impinging stimulus cues and attributes 
drawn from memory sources. It is plausible 
to consider that the longer the stimulus 
must be viewed before recognition occurs, 
the influence of the immediate stimulus fac- 
tors in the hypotheses chosen becomes greater 
and more compelling as the features of the 
picture become sharper. With this in mind, 
the finding in the chronic schizophrenic 
group of significantly longer recognition times 
—which can be taken as an index of the 
amount of stimulus information required 
for accurate perceptual categorization—sug- 
gests that the chronic phase of the schizo- 
phrenic disorder may be characterized by a 
relatively higher threshold for assimilation of 
cognitively meaningful material, or in other 
terms, a diminished capacity for utilizing in- 
complete perceptual cues. The finding of sig- 
nificantly longer first response times in the 
chronic schizophrenic group would lead to a 
similar interpretation. Moreover, since the 
chronic schizophrenics did not give fewer pre- 
recognition responses than the other groups, 
it would seem difficult to account for their 
delayed recognition times on the basis of such 
factors as general response inhibition, apathy, 
or inability to attend to the stimuli. While it 
is conceivable that the slower recognition 
time of the chronic schizophrenic group might 
be partially attributed to lessened contact 
with the objects in the slides due to their 
institutionalization, there is little ground for 
considering this to be a crucial variable. In 
fact, one might consider that although there 
may be less actual contact with these materi- 
als in the hospital environment, the great 
amount of time that these patients spend 
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watching television provides them with a 
variety of object stimulation. It is also impor- 
tant to note that observation of the chronic 
schizophrenic Ss while doing the task seemed 
to indicate a high degree of interest, involve- 
ment, and sustained attention on their part. 
Consequently, it appears consistent and valid 
to interpret the differences in recognition re- 
sponse time, first response time, and number 
of prerecognition responses as arising from 
variations in the manner in which ambiguous 
perceptual information is treated by the S 
groups. 

The significant main effect for pictures in 
both Picture Groups A and B indicates that 
perceptual recognition time varied signifi- 
cantly among the pictures in each group, 
among all S groups. This variation is to be 
expected in view of the marked differences 
in perceptual properties of the 10 pictures as 
they are coming into focus, in such factors 
as colors, contours, and spatial distribution 
of the elements of the picture. However, the 
various picture differences did not seem to 
exert any strongly selective effect upon the 
S groups, as seen by the generally nonsignifi- 
cant interaction terms. 

It is of interest that the acute and chronic 
schizophrenic groups fell in marked contrast 
to each other on recognition time, with the 
acutes showing the most rapid recognition of 
all groups on six of the 10 stimulus pictures. 
This pattern, although not entirely consistent 
across all stimuli, warrants the suggestion 
that the acute schizophrenic can more readily 
assimilate and match incoming perceptual 
cues to reference categories, while the chronic 
schizophrenic cannot as efficiently transform 
incomplete perceptual data into accurate 
object representations. 

Consideration of certain clinical differences 
between acute and chronic schizophrenics may 
make these differences in perceptual perform- 
ance more understandable. Specifically, in the 
acute stage the schizophrenic is typically in 
a state of intense anxiety as he experiences 
the new and alarming phenomena of the psy- 
chotic disorganization, with greater fluidity 
of perception and loss of stability of the 
previously familiar and reliable object world. 
Tn this profoundly frightening condition, the 
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individual is still trying to regain control of 
the object world and is sensitively attuned 
and responsive to perceptual input, which 
must be immediately evaluated in terms of 
its aversive or nonthreatening qualities. In 
contrast, the chronic schizophrenic has typi- 
cally reached a somewhat stable resolution 
in which anxiety is decreased, bizarre per- 
ceptual and ideational material is familiar, 
well established, and no longer terrifying, and 
the “patient seems to have accepted his ill- 
ness [Arieti, 1955, p. 380]” and turned away 
from external reality into fantasy preoccupa- 
tion. From these clinical considerations, it 
would be expected that the acute schizo- 
phrenic—still trying to master external real- 
ity, and therefore still more attuned to 
external input, would show shorter perceptual 
recognition times, and that the chronic schizo- 
phrenic, because of the preoccupation with 
internally generated ideation would require 
more intense or compelling external stimula- 
tion to bring about perceptual recognition. 
This formulation does not necessarily imply 
that the chronic schizophrenic would show 
general deficits in thresholds for awareness 
of less cognitive sensory input (e.g., light, 
sound, touch) but it does imply that the 
processing of cognitive input—the evaluation 
and matching of cues with cognitive cate- 
gories—seems differentially affected in the 
two phases of schizophrenia. This formulation 
of the perceptual recognition deficit in the 
chronic schizophrenic bears some similarity 
to the “schizophrenic defense mechanism” of 
“sensory input processing-ideational gating” 
proposed by Silverman (1964) in which the 
schizophrenic tends to “gate out” or mini- 
mally respond to disturbing ideational input 
while being able to deal more adequately with 
sensory input. 

While it would seem desirable to compare 
the findings of this study with Draguns’ 
(1964) study of perceptual recognition in 
schizophrenics, particularly since he reports 
earlier recognition judgments by chronic 
schizophrenics than acute schizophrenics or 
normals, several methodological differences 
render comparison difficult. For one thing, 
Draguns’ “recognition response” includes both 
correct and incorrect recognition statements 


so that his measure is essentially one of 
initial responsivity regardless of perceptual 
accuracy. Further, his conceptualization prin- 
cipally concerns the schizophrenics’ “erroneous 
idiosyncratic structuring of ambiguous materi- 
als.” His findings were that “chronic schizo- 
phrenics made earlier perceptual judgments as 
well as more perceptual recognition errors,” 
which suggested “a preponderance of endoge- 
nous factors in such judgments over exogenous 
object determined ones [p. 26].” However, 
with regard to the point in the series at which 
veridical object recognition would have been 
attained, his chronic schizophrenic group 
may have required more data than the other 
groups. It is not possible to directly determine 
this from his data since he allowed only one 
recognition response per picture series. In the 
present study, the perceptual recognition re- 
sponse refers only to veridical recognition of 
the object, in which the “exogenous object 
determined factors” take precedence over 
idiosyncratic endogenous ones. Further, the 
differing modes of stimulus presentation—in 
Draguns’ study, discrete trials, and in this 
study, a continuously viewed and changing 
perceptual array—present somewhat different 
information-processing situations. It would 
be of interest to employ Draguns’ stimulus 
materials in the continuous presentation 
procedure to assess the importance of this 
difference. 

The results of this experiment suggest that 
the particular methodology employed is well 
suited to the study of the temporal process- 
ing of perceptual information. However, fur- 
ther refinements are necessary to more clearly 
identify parametric aspects of the perceptual 
recognition process. For example, it would be 
highly useful to have some quantitive mea- 
sure of the perceptual information available 
at various stages of focus, perhaps with cer- 
tain standard pictures. Also, it would be of 
great interest to know the manner in which 
the stimulus display is scanned prior to correct 
recognition by studying eye-movement pat- 
terns. Studies such as these would afford a 
more detailed examination of the component 
processes in perceptual recognition and of 
the mechanisms through which the schizo- 
phrenics’ anomalous cognitions may operate. 
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PSYCHOTHERAPY PROCESS AS A FUNCTION OF THE 
TIME SEGMENT SAMPLED 


NORMAN J. KARL 1 ann NORMAN ABELES 
Michigan State University 


A content analysis of individual psychotherapy tapes of 25 female college stu- 
dents was employed to determine the relative frequencies of certain response 
measures during the first, second, third, fourth, and fifth 10-minute segments 
of the therapeutic hour, An attempt was made to test the assumption made 
by many investigators that the occurrence of a given phenomenon is evenly 
distributed over the course of therapy. The results indicate that such an assump- 
tion is grossly inaccurate—given variables were found to appear consistently 
more frequently in certain segments of the hour than they do in others, It is 
suggested that examining the variable where it occurs most often will increase 


the power of future investigations. 


Several authors (cf. Bordin, Cutler, Ditt- 
mann, Harway, Raush, & Rigler, 1954; Kies- 
ler, 1966) have pointed to the methodological 
factors which need to be considered in psy- 
chotherapy process research. Among others, 
sampling decisions ought to be tied to the 
theoretical framework to which one’s mea- 
sures are related, the size of the segment to be 
sampled must be determined, the location of 
the sample taken from the therapy hour has 
to be chosen, and the rating systems used 
should ideally represent unidimensional scales. 
Most of these questions need to be put to em- 
pirical test, and decisions regarding the valid- 
ity of the assumptions underlying much psy- 
chotherapy research must be based on data 
rather than left at the level of speculation. 

A number of studies of psychotherapy proc- 
ess have utilized an analysis of tape-recorded 
samples of therapeutic sessions (Kiesler, 
Mathieu, & Klein, 1964; Tomlinson & Hart, 
1962; Truax & Carkhuff, 1967; Van der Veen, 
1967). These investigators have employed the 
selection of random time samples from a 
number of therapy sessions. This method 
tends to assume that the form of a given 
phenomenon is essentially the same during 
the first third of the hour (or the first 4 
minutes) as it is during the last third of the 
hour (or the last 4 minutes). While it is often 
assumed that random sampling provides the 


1Now at Counseling Center, University of Mary- 
land. Requests for reprints should be sent to Norman 
Karl, Counseling Center, University of Maryland, 
College Park, Maryland. 


least biased data, one can legitimately raise 
the question whether these techniques actually 
“wash out” true differences. Van der Veen 
(1967), in fact, suggests the possibility that 
segment selection may have been in part 
responsible for his inability to replicate results 
by previous investigators. 

Kiesler, Klein, and Mathieu (1965) pro- 
vide strong evidence to support the notion 
that the time segment sampled represents a 
critical variable for the content obtained. 
Further, such variation seemed to be influ- 
enced, in their investigation, by the particu- 
lar diagnostic group to which an S$ belonged. 
Some interesting issues are raised as a result 
of that investigation: 

1, Are assumptions regarding variance as a 
function of segment sampling applicable to a 
variety of process variables? 

2. Can such variance be demonstrated over 
the course of the entire therapy process? 

The essential questions under consideration 
are whether or not changes occur in the form 
of the therapeutic process, when client and 
therapist populations and process variables 
differ from those employed in other investiga- 
tions. Are certain response categories more 
typically employed (across all clients and 
over the course of therapy) during various 
phases of the therapeutic hour? At issue is 
the tenability of the assumption concerning 
the invariance of the process for a given re- 
sponse characteristic during or across the 
therapeutic interview. 
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METHOD 
Subjects 


The clients selected for inclusion in the present 
study were all female undergraduate students who 
completed a minimum of six therapy interviews at 
the Michigan State University Counseling Center. 
They were self-referred clients seeking help with 
personal-social problems. An intake interview de- 
termined that a program of therapy was warranted. 
A total of 25 Ss comprised the present sample. 

The Ss included in the study represent 25 of the 
total of 54 randomly selected clients who were asked 
to participate in the research program being con- 
ducted at the Counseling Center. A sound-recording 
tape library was developed on the psychotherapies 
of these 54 clients, and it is from this library that 
the current sample was drawn. 


Therapist 


Clients were assigned to therapists on the basis of 
matching availability of hours. Meetings were gen- 
erally conducted weekly, except in the case of prac- 
ticum therapists, who met twice a week with their 
clients. Final assignment of a case depended upon 
the therapist’s willingness to work with the client. 
Such judgments were made on the basis of intake 
interview data and the personal impressions of the 
intake interviewer, 

The psychotherapists in the sample represented 
several experience levels: six staff counselors were 
PhD counseling or clinical psychologists with 3 to 
20 years of psychotherapy experience; 14 interns 
were advanced counseling or clinical doctoral candi- 
dates with a minimum of 1 year of intensive psycho- 
therapy supervision; five practicum students repre- 
sented a group of less advanced graduate students 
who were involved with their first cases as part of 
an introductory practicum course. 


Sampling Procedures 

All therapy tapes for each of the 25 Ss were in- 
cluded in the present investigation, The sample rep- 
resented a total of 330 tapes. Each tape was ran- 
domly assigned to be sampled during its first, second, 
third, fourth, or fifth 10-minute portion. A content 
analysis was performed on the 330 10-minute tape 
segments, 


Coding Procedure 


_. The content analysis systems employed 
derive, in part, from two sources: (a) the 
method developed by Bandura, Lipsher, and 
Miller (1960), and modified by Winder, 
Ahmad, Bandura, and Rau (1962), was ap- 
plied to measure client “Dependency” and 
“Hostility” response categories, and therapist 
“Approach” and “Avoidance” behavior; and 
(6) Lennard and Bernstein’s (1960) pro- 
cedure was employed to identify client and 


therapist “Primary” and “Secondary” role 
systems, and “Descriptive,” “Evaluative,” and 
“Prescriptive” therapist statements. 


Description of Terms 


Dependency statements include expressions 
of need to depend on others; allowing others 
to assume the initiative; to be told what to 
do; to be “cured” by an external source; de- 
scription of dependent, approval seeking be- 
havior; dependent agreement and nurturant- 
accepting behavior. 

Hostility statements include description or 
expression of unfavorable, critical or sarcastic 
remarks; oppositional, antagonistic, and argu- 
mentative attitudes; expressions of dislike, dis- 
agreement, resentment, resistance, irritation, 
annoyance, and anger; expressions of aggres- 
sive domination. 

Therapist Approach responses represent any 
verbalizations which seem designed to elicit 
further expression or elaboration of client 
verbalizations. This category includes thera- 
pist endeavors such as exploration, reflection, 
labeling, interpretation, support, and informa- 
tion giving. 

Therapist Avoidance responses include any 
verbalization which seems designed to inhibit, 
discourage, or divert further expression of 
client statements. Subsumed under this cate- 
gory are therapist disapproval, topic transi- 
tion, ignoring a client statement, mislabeling, 
and inappropriate silence. 

Primary role system statements refer to 
verbalizations in which the client or therapist 
allude to their respective roles during the 
therapeutic process and to the purposes, goals, 
and accomplishments of treatment. 

Secondary role system statements include 
verbalizations pertinent to the interpersonal 
interaction between the client and therapist. 
Transference phenomena are encountered here. 

Therapist Descriptive propositions seek or 
offer information. They similarly offer or ask 
for clarification, orientation, or reiteration. 

Evaluative therapist statements convey ap- 
praisal, value judgments, and interpretations. 
They inquire about or offer opinion, expres- 
sion of affect, or analysis. 

Prescriptive propositions occur where the 
therapist seeks or conveys directives, includ- 
ing giving or asking for suggestions. 
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The above-mentioned categories represent 
discrete measures. In other words, a unit 
either fit a particular response category or it 
did not; such determinations and examples 
are provided in more extensive scoring manu- 
als (Bandura et al., 1960; Kopplin, 1965; 
Lennard & Bernstein, 1960; Winder et al., 
1962). 


Scoring 


The basic model subsumed under both scor- 
ing procedures is that the focus is turned to 
the interaction sequence. Scoring is as follows: 
a client statement, a therapist response, and 
the subsequent response on the part of the 
client. By definition, a scoring unit is repre- 
sented by the entire verbal content of a 
speaker with the preceding and succeeding 
verbalizations of the other person acting as 
the boundary for that unit. 

The scoring sequence used may be sum- 
marized as follows: The judge listened to a 
client verbalization and classified it first ac- 
cording to the Dependency, Hostility, or 
Other content response category. Then the 
judge scored the client statements according to 
the Primary or Secondary role system 
involved, if any. Subsequently, therapist 
responses were scored as Approach or Avoid- 
ance. The therapist’s verbalization was simi- 
larly scored for presence of Primary or Sec- 
ondary role statements, and then according to 
the type of proposition employed (i.e., De- 
scriptive, Evaluative, or Prescriptive). 


Scoring Reliability 


Two raters scored response categories— 
the senior author, Judge A, and a reliability 
scorer, Judge B, a graduate student in clinical 
psychology. The two judges initially scored 
several tapes together in order to train them- 
selves in the use of the system. This pro- 
cedure obtained until the raters achieved an 
.80-.85 product-moment correlation coeffi- 
cient, which took about a week of training 
time. Training tapes were not included in the 
data analyzed as part of the study. 

The subsequent sample on which reliability 
was determined included approximately one- 
half of the total. The first 75 and last 74 tapes 
were rated by both scorers in an attempt to 
determine whether any consistent changes in 


TABLE 1 


InTERJUDGE RELIABILITY COEFFICIENTS AND THE 
STANDARD SCORES AND p VALUES ASSOCIATED 
WITH THE DIFFERENCES BETWEEN A PAIR 

OF CORRELATIONS 


Correlation co- 
efficients* 
Measure ey dis], mC | rece 
First 75 | Last 74 
tape seg- | tape seg- 
ments ments 
Dependency 83 -80 -53 | .56 
Hostility .92 .89 1.00 | .31 
Client Other .98 98 Sail ica 
Client Primary 91 96 2.51 | .01 
Client Secondary 84 84 re Kiina 
Client Other Role .99 .98 2.09 | .04 
Approach 99 99 can Voay 
Avoidance 19 83 .64 | .52 
Therapist Primary 93 97 2.60 | .01 
Therapist Secondary 92 95 1.40 | .16 
Therapist Other Role 98 97 1.23 | .22 
Descriptive 91 93 +78 | 43 
Evaluative 98 97 1,23 | .22 
Prescriptive 81 74 1,06 | .29 


a All coefficients, p < .002. 


Rater A’s scoring occurred over the course of 
coding. Judge A rated the remainder of the 
330 10-minute segments. The necessity of es- 
tablishing reliability estimates for scoring 
units was eliminated since Judge A indicated 
to Judge B where a particular client or 
therapist unit began by transcribing the first 
few words of each sequence to be scored. 
Judge B stayed within these confines when 
scoring the client or therapist response. 

Interjudge coefficients and the significance 
of the differences between pairs of correla- 
tions are presented in Table 1. The table 
entries represent the correlations of the sum- 
mated client-therapist-client units in a given 
10-minute segment. 

It is evident that the scoring categories rep- 
resent highly reliable measures. In most cases 
where reliability was lower, this fact could be 
attributed to the relatively smaller frequencies 
observed in the particular category under 
consideration. Nevertheless, interjudge reli- 
ability coefficients estimated for both the first 
75 and the last 74 segments sampled were all 
highly significant with p < .002 for all mea- 
sures. Similarly, the Z scores obtained for 
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TABLE 2 
ToTAL FREQUENCIES FOR MEASURES GROUPED BY TAPE SEGMENT SAMPLED 


Tape segment 
Measure 
1 2 3 4 5 sa 
Dependen 106 96 103 123 149 15.42** 
Hostility cd 78 106 110 110 85 9.50* 
Client Primary 142 89 96 58 156 59.82*** 
Client Secondary 16 27 22 16 29 6.59 
Approach 1195 1207 1203 1197 1187 .20 
Avoidance 21 9 14 19 30 12,95* 
Therapist Primary 166 164 124 98 254 86.18*** 
Therapist Secondary 24 62 56 63 59 19.98*** 
Descriptive 291 168 142 159 217 74.29%** 
Evaluative 891 1006 1044 1016 886 23.05*** 
Prescriptive 35 43 30 42 114 89.70*** 
*p <.05. 
ee 
#45 S001. 


difference between two coefficients for a given 
measure indicate the high degree of con- 
sistency over time in the categorization of 
verbalizations, While some of the Za scores 
achieved statistical significance, which would 
suggest that a change in scoring procedures 
between the two judges did in fact occur over 
time, it can be seen from Table 1 that these 
differences occurred where correlations were 
so high, that a difference in r of .01 or .02 
was critical enough to yield a significant Za. 

Since a maximum of two judges was em- 
ployed the question of possible interconfound- 
ing of the response categories becomes an 
important consideration. Kiesler (1966) in- 
dicates that process scales ought to tap single 
dimensions. He further suggests that low inter- 
rater reliability generally reflects the presence 
of several dimensions, and that attempts 
should be made to establish face validity as a 
further measure of unidimensionality. 

The classes of Dependency and Hostility 
seem to meet some of the minimum require- 
ments of face validity. Kopplin (1965 ) indi- 
cates that these two categories represent styles 
of interpersonal interactions which appear in 
common to most major theories of personality 
and psychotherapy. These concepts are simi- 
larly used on a common-sense basis, such that 
the lay person would classify as Hostility the 
statement, “that man was angry or resentful.” 

The evidence in terms of reliability reports 


of other investigators seems to further estab- 
lish the unidimensionality of the response 
categories. Winder et al. (1962), using two 
sets of two judges each, report the following 
respective reliabilities—Hostility, .89 and .83; 
Dependency, .91 and .92; Approach, .83 and 
98; and Avoidance, .44 and .81. Alexander 
(1967), employing three raters, found the fol- 
lowing coefficients—Hostility, .88; Depen- 
dency, .80; Approach, .97; Avoidance, .44; 
Client Primary, .87; Client Secondary, .84; 
Therapist Primary, .89; and Therapist Sec- 
ondary, .86. 

While reliability data from other studies 
does not exist for the Descriptive, Evaluative, 
and Prescriptive categories, the high correla- 
tions obtained in the present study again seem 
to point toward independent categories. 

The frequency with which each response 
measure appears during any given 10-minute 
segment is presented in Table 2. The table en- 
tries include the sums of the response cate- 
gories found in each 330 10-minute segment. 
Thus, Table 2 indicates the total frequencies 
of a given response for all clients over the 
entire course of their therapies. 

It is evident that major differences occurred 
in the relative frequencies for a given response 
category, on the basis of the segment of the 
therapeutic hour under investigation. De- 
pendency statements tended to be manifested 
most frequently in the last segment of the 
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therapy hour and least often during the mid- 
dle portion. Conversely, the middle sessions 
appeared to have the greatest degree of Hos- 
tility statements, with the least amount ap- 
pearing in the initial and final stages. Client 
Primary role statements occurred more fre- 
quently during the next-to-last segment. No 
significant differences obtained for either the 
Client Secondary role statements or the thera- 
pist Avoidance categories, Therapist Primary 
role statements did, however, show a steady 
decline through the fourth 10-minute segment 
with a sharp rise during the last 10 minutes. 
Therapist Secondary role communications, on 
the other hand, were least frequently ex- 
pressed during the first 10-minute portion of 
the hour, and then increased and leveled off 
to a relatively stable incidence of occurrence 
for the remainder of the session. Therapist 
Descriptive propositions appeared with the 
greatest frequency during the first and last 
portions of the hour, with a lower stable func- 
tion during the middle part. Evaluative propo- 
sitions appeared in a direct inverse relation- 
ship to Descriptive propositions. Prescriptive 
propositions seemed to occur with equal fre- 
quency during the first four parts of the ses- 
sion, and increased sharply to reach their peak 
during the final phase. 

The stength of the fluctuations during the 
hour was a rather marked and consistent one, 
a phenomenon which was maintained across 
several measures, 


Discussion 


The sampling technique chosen was specifi- 
cally designed to determine whether the time 
segment-process phenomenon reported by 
Kiesler et al. (1965) could be demonstrated 
when all therapy sessions were sampled. The 
former investigators sampled five consecutive 
10-minute segments from the same therapy 
hour for each client. While they did include 
early and late interviews in their design, they 
did not study the entire therapy process. The 
Present study permitted such analysis, and 
points to the pervasiveness of the function. 
Had the sampling included a single interview 
for each client, the ability to generalize to 
the entire therapy process, for this sample, 
would have been severely limited. 


211 


The results clearly reflect a marked ten- 
dency for certain segments of therapy to be 
characterized more by some content categories 
than others. Confirmation is presented for the 
assertion that time must be either controlled 
or considered when studying psychotherapy 
process, 

The strength of the effect of time on 
process is impressive. While the present in- 
vestigation uses a different client population 
(limited to females) and different measures 
from those which Kiesler et al. (1965) em- 
ployed, essential confirmation of their results 
obtained. 

One caution in interpreting these findings 
was suggested by the work of Kiesler et al. 
(1965), which indicated differential effects 
among different diagnostic groups. They re- 
ported the greatest effect of time on the 
process variable within the neurotic group. 

The sample under investigation in the 
present study was compared to a normative 
group of Michigan State University females, 
The number of deviant MMPI scales was 
greater for Ss in psychotherapy (X, = 2.00, 
Xo = 1.03, t = 3.59, df = 133, p < 01). No 
further attempt at differential diagnosis was 
made, Generalizations, therefore, should be 
limited to a group of female college under- 
graduates seeking psychotherapy. 

Similarly, generalizations beyond the par- 
ticular process variables dealt with in this 
study (i.e., Dependency, Hostility, etc.) 
should be made with considerable caution. It 
is important to emphasize that the current set 
of results are related only to the process 
variables under investigation. Replicability of 
these findings with other clients and other 
therapists remains to be put to the test, as 
does the generality of these findings with other 
response measures. It should be noted, how- 
ever, that the present investigation demon- 
strates the applicability of the time-process 
function to variables emerging from theoreti- 
cal frameworks different from those used by 
other investigators. 

While evidence is presented that the time 
segment sampled affects the process, no answer 
is given to the question of what elements are 
to go into the decision of choosing which seg- 
ments should be investigated. Kiesler (1966) 
maintains that such determinations must be 
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made in relation to the theoretical framework 
in which one is working. Where Rogerian 
measures, for example, are involved, random 
sampling may be more justified since the 
pervasiveness of the phenomena is empha- 
sized. Freudian variables, on the other hand, 
might require sampling techniques more in- 
tentionally focused on specific problem areas. 

Given that the above-mentioned theoretical 
implications are included in an experimental 
design, the present findings suggest some im- 
portant considerations for further sampling 
techniques. If, for example, a particular in- 
vestigation is primarily concerned with de- 
pendency phenomena, the characteristic mode 
of operation would be to sample a portion of 
the total tapes available for N clients. This 
procedure apparently results in a partial ex- 
ploration, at best, since the entire process 
(i.e., all interviews) of therapy is not being 
sampled for a given client. 

While random selection of time points 
within the hour would still be valid if the 
number of samples extracted was large, maxi- 
mum efficiency would seem to be obtained by 
sampling the segment in which the variable 
(e.g., “Dependency”) appears in its most 
potent form. Such a timesaving procedure 
would possibly permit all therapy tapes for a 
given client to be included in the sample, as- 
suming that such functions are found to ob- 
tain for other investigations. 

Pilot data may have to establish, for each 
investigation, the portion of the therapeutic 
hour which most dramatically characterizes 
the phenomenon under consideration. Once 
such determination has been achieved, a more 
thorough detailed analysis of the variable is 
possible through selection of the segment in 
which it is most powerfully exhibited. 
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Fifty-three Peace Corps trainees were administered a self-disclosure question- 
naire during a pretraining assessment program. Trainees who demonstrated a 
readiness to confide personal information to others were found to be more well 
liked by the other trainees and the training staff after 6 weeks of training than 
trainees who were reluctant to disclose personal information. Hypotheses con- 
cerning the relations between self-disclosure, integrative complexity, and au- 
thoritarianism were supported. The accessible trainees tended to be more inte- 
gratively complex and less authoritarian than the inaccessible trainees. Implica- 
tions of self-disclosure for interpersonal openness and effectiveness are discussed. 


Social accessibility or self-disclosure refers 
to the process of communicating information 
about oneself to other persons. The readiness 
to confide personal information has been 
shown to contribute to the development of 
social relationships. Jourard and Lasakow 
(1958), and Jourard (1959) found that the 
degree of liking another person is correlated 
with the amount of self-disclosure to that per- 
son, while Jourard and Landsman (1960) 
indicate that receiving self-disclosure inputs 
from another person leads to greater dis- 
closure toward that person. Newcomb (1961) 
has also noted that the exchange of personal 
information leads to the establishment of 
friendships. 

Self-disclosure is also related to behavioral 
measures of interpersonal competence and 
openness. Colson,? Frankfurt (1965), and 
Taylor (1965) have all found that persons 
reporting high self-disclosure revealed more 
about themselves in social interaction situa- 
tions than low scorers. Jourard (1961) found 
that nursing students who scored high on 
a self-disclosure questionnaire tended to be 
rated a year later as high in the “ability to 


1 Based on a paper read at the Eastern Psycho- 
logical Association meetings in New York, N. Y., 
April 1966. This research was conducted at Syracuse 
University and was supported by funds from the 
Peace Corps. The authors wish to thank David E. 
Hunt and Harold M. Schroder for their cooperation. 

2 Requests for reprints should be sent to Charles F. 
Halverson, Jr., National Institute of Mental Health— 
Building 15-K, 9000 Rockville Pike, Bethesda, Mary- 
land 20014. 

3W. N. Colson. Self-disclosure as a function of 
social approval. Unpublished manuscript, 1965. 


establish and maintain a communicative rela- 
tionship with patients” as well as showing a 
high degree of openness with the nursing fac- 
ulty. Presumably persons who are socially 
open to others are seen as more interperson- 
ally competent than individuals who are 
guarded and closed in their personal affairs. 

Although other research has been carried 
out on self-disclosure, the two groups of 
studies above appear to be the main lines of 
direct construct validity evidence so far. The 
present study had three aims: (a) to extend 
the construct validity of self-disclosure further 
by placing it in a larger theoretical con- 
text, or “nomological network” (Cronbach & 
Meehl, 1955), (b) to utilize the “nomo- 
logical network” to predict further aspects of- 
interpersonal competence, and (c) to test di- 
rectly whether reported self-disclosure is 
related to being liked by others. 

Concerning the first aim, that of placing 
social accessibility in a larger theoretical 
context, the theories of authoritarianism 
(Adorno, Frenkel-Brunswik, Levinson, & 
Sanford, 1950) and conceptual systems (Har- 
vey, Hunt, & Schroder, 1961; Schroder, 
Driver, & Streufert, 1967) seem particularly 
relevant, as does Tomkins’ (1964) notion of 
good-bad “polarity.” 

The theory of authoritarianism (Adorno et 
al., 1950) implies that nonauthoritarians are 
“trusting and open” in their interpersonal 
relations, while highly authoritarian persons 
manifest distrust and suspicion of others, 
Since authoritarian persons are purported to 
view people as threatening, they should be less 


213 


214 


disclosing of personal information than non- 
authoritarians. In other words, authoritarian 
individuals should rate themselves as less 
socially accessible than nonauthoritarian indi- 
viduals. 

Conceptual systems theory (Harvey et al., 
1961; Schroder et al., 1967) is also relevant 
to self-disclosure. This theory postulates that 
cognitive functioning varies along a dimension 
of information-processing ability. Interper- 
sonal cognitive functioning at low levels of 
complexity is characterized by a relatively 
simple and inflexible view of others, while 
more complex functioning represents a more 
flexible and open interpersonal orientation. 
Because their simple, inflexible structures for 
processing interpersonal information are in- 
adequate for dealing with many interpersonal 
situations, low-complexity persons presumably 
often react in a defensive manner. More com- 
plex persons, whose information processing is 
adequate to deal with most interpersonal 
situations, can relate in a more open, informa- 
tional manner, Therefore, a positive relation 
between conceptual complexity and interper- 
sonal competence and openness was expected. 
Relative to conceptually simple individuals, 
complex individuals should rate themselves as 
more socially accessible. 

Tomkins (1964) designed a “Polarity 
Scale” to measure the degree to which people 
believe that human nature is basically good 
or evil. If a person views human nature, and 
thereby presumably fellow persons, as basi- 
cally evil, he will probably behave in a more 
guarded and defensive manner towards others, 
It was predicted that Polarity (with “man 
is basically evil” scored high) would be 
negatively related to social accessibility. 

The above discussion leads to the second 
aim of the study, that of predicting behavioral 
flexibility from self-disclosure. If it is assumed 
that conceptual complexity, authoritarianism, 
and polarity of beliefs play integral roles in 
the development of social accessibility, one 
would expect socially accessible persons to 
behave not only more openly, as Jourard 
(1961) found, but also more flexibly and 
adaptively. Thus it was predicted that self- 
disclosure would ‘be positively related to be- 
havioral ratings of “interpersonal flexibility” 
_and “general adaptability.” 
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As to the third aim, although research has 
shown that confiding personal information 
plays a role in the development of social 
relationships, there has not been a direct test 
of whether individual differences in self- 
disclosure are related to being liked by others. 
In addressing that question it seemed impor- 
tant to distinguish the formation of initial 
impressions from the development of stable 
social relationships, for the same variables 
may not be important in both processes. Social 
accessibility would seem to play an important 
role in the latter, but its role in impression 
formation seems ambiguous: Reluctance to 
disclose personal information may not neces- 
sarily be noticed or salient in primary impres- 
sion formation, It was predicted that social 
accessibility would be positively related to a 
sociometric index taken after the group had 
had extensive interactions, but no prediction 
was made concerning the same index taken 
after only a brief interaction period. 

The Peace Corps training program, in which 
this study was undertaken, was well suited 
for gathering the sociometric and other inter- 
personal behavior data necessary to test the 
hypotheses. The Peace Corps trainees (PCTs) 
arrived at the training site from all parts of 
the country, and, except for a few married 
couples, did not know each other. However, 
this soon changed. The PCTs spent the next 
12 weeks as a very compact and closely-knit 
group, being together 12 to 14 hours a day, 
eating, attending classes, and utilizing their 
little free time together. This setting provided 
an opportunity to see who would come to be 
the best liked trainees in the group, and who 
would be judged the most interpersonally 
competent and flexible by the assessment and 
training staff. 


METHOD 
Subjects 


The Ss were 53 PCTs being trained for teaching in 
East Africa and all were college graduates. Each of 
the PCTs participated in a 2-day pretraining assess- 
ment program in which they were administered the 
experimental materials. 


Measures 


Self-disclosure scale. The self-disclosure scale was a 
modifed form of the one used by Jourard and 
Lasakow (1958). The scale consisted of 30 state- 
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ments of information about the self such as, “What 
I think and feel about religion: my personal reli- 
gious views,” “All present sources of income-wages, 
fees, allowance, etc.,” and “My views on commu- 
nism.” The trainees were told to read each item and 
then indicate the extent that they had talked, or 
might talk about each item to three target persons: 
(a) a stranger, (b) an acquaintance, and (c) their 
best friend. Thus the PCTs made three ratings for 
each item. They gave a 0 when they would disclose 
nothing about the particular item to that target 
person, a 1 if they would talk in general terms, and 
a 2 if they would talk in full and complete detail 
about the item. These numerical entries for each item 
were summed, yielding totals which constituted the 
self-disclosure score.* 

Peer nomination. After the 2 days of pretraining 
assessment (during which they had only minimal free 
time to interact), and again after 6 weeks of training, 
each trainee evaluated the other trainees with a 
sociometric inventory. Specifically, each trainee was 
asked to indicate the person or persons he would 
most like to be assigned with in his overseas station. 
The trainee could write down as few or as many of 
his peers as he wanted. This peer nomination was 
quite involving for the trainees because they viewed 
this procedure as having an actual bearing on the 
assignment of trainees. They each spent some time 
and considerable effort in making their choices. The 
number of positive choices received by each trainee 
constituted his peer nomination score on both 
occasions. 

Assessment board ratings. The assessment and 
training staff met after 6 weeks of training in what 
was termed the midboard evaluations. Each trainee’s 
progress in the program was assessed, and a decision 
made as to whether each trainee was qualified to 
continue training. On the basis of aptitude tests, class 
performance, psychiatric interviews, and training 
staff evaluations, the midboard assessment team 
rated each trainee on several 5-point scales relative 
to suitability for Peace Corps service. The ratings 
were not contaminated by the trainees’ social acces- 
sibility scores, as they were not available to the 
raters. 

Relevant to the present investigation were the 
mid-board ratings of “interpersonal flexibility” and 
“general adaptability.” The raters were provided with 
definitions of the scales. The definition for the 
“interpersonal flexibility” scale stressed rating how 
well the trainee was able to cope with interpersonal 
situations, while the “general adaptability” rating was 
to indicate how well he could cope with problem- 
solving situations and stress. The assessment board 
arrived at a consensual rating for each trainee for 
each scale. 

Authoritarianism. A revised 100-item version of the 
California F Scale (Stern, Christie, Lane, Sanford, & 
Webster, 1960) was employed. The revised form was 


4Scores for the three target persons were summed 
and only the total score used in data analysis because 
the individual target person scores were highly inter- 
correlated (r > .95 in all cases). 
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designed to tap personality rather than ideological 
aspects of authoritarian functioning and had several 
types of item reversals built in to minimize the con- 
tribution of social desirability and acquiescence re- 
sponse sets and to capture adequately several varieties 
of nonauthoritarianism. 

Conceptual complexity. Conceptual complexity was 
measured by the Paragraph Completion Inventory 
(Schroder et al., 1967), a semiprojective test which 
requires Ss to write several sentences in reaction to 
and resolution of situations which imply uncertainty, 
structure, and conflict. In scoring the completions, 
the raters evaluated the number of alternative resolu- 
tions expressed or implied by the completion, as 
well as the complexity of the rules which S$ used in 
producing the response. The completions were scored 
along a 7-point scale representing a continuum from 
low to high complexity (see Schroder et al, 1967, 
Appendix 2, for scoring manual) by two experienced 
scorers, who had in the past consistently shown 
interjudge reliabilities in the .90s. The score for each 
trainee consisted of the average of the scores assigned 
to the six responses by the two judges. 

Polarity Scale. The Polarity Scale (Tomkins, 1964) 
consisted of 56 forced-choice format items adminis- 
tered with the instructions used by Tomkins that S 
could check either, both, or neither of the two alter- 
natives for each item. The forced-choice format 
minimized the possibility of responding to the alter- 
natives on the basis of greater social desirability of 
one of the items. A trainee’s score was determined 
by dividing the number of “human nature is 
basically evil” choices checked by the total number of 
choices checked. 

Aptitude tests. The School and College Aptitude 
Test (SCAT) verbal scores and General Aptitude 
Test scores were available on all Ss, and were used 
in this study to check on whether the finding of 
hypothesized correlations among other variables 
could be attributed to a joint relationship to intel- 
ligence. This check can also serve as one indication 
of the discriminant validity of the social accessibility 
measure. 


RESULTS AND Discussion 


The product-moment correlations between 
self-disclosure and the other measures are 
shown in Table 1. As can be seen in Table 1 
nearly all the predicted correlations with self- 
disclosure were significantly different from 
zero in the expected directions. 

These data offer support for the generaliza- 
tion that trainees’ self-reported social acces- 
sibility relates to how well they are liked by 
peers and how interpersonally effective and 
adaptive the midboard assessment team sees 
the trainees to be. It is also interesting to 
note that self-disclosure scores were not re- 
lated to the initial peer nominations but were 
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TABLE 1 
CORRELATIONS AMONG ALL THE VARIABLES OF THE STUDY 


Item 2 | 3 4 | 5 al 6 | 7 | 8 | 9 | 10 
Self-disclosure (1) 133" —34* —.26 .06 .30* -36** 4i** Al  —.09 
Conceptual 

Seni a (2) 133* 15 24 23 A5** — .40** .21 .00 
Authoritarianism (3) 00  —.02 —.10 .12 .26 .04 .19 
Polarity (4) 124" a17 =26 4 —.30* —.20 
Peer nomination 

(initial) (5) 238%" | 48% 48** —07  —.02 
Peer nomination 

(6-week) (6) .80** S00% 47  —.05 
Interpersonal flexi- 

bility (7) re RA e07 
General adaptability (8) —.11 ape 
SCAT—verbal (9) 31 


General Aptitude 
Test (10) 


* Dd <.05. 
**p <01. 


related to the 6-week peer nominations. It 
suggests that other impression-formation 
type variables account for initial attitudes 
toward others, but that self-disclosure assumes 
importance in the development of more stable 
and less superficial interpersonal relationships. 
The relationship of social accessibility to the 
“interpersonal flexibility” and “general adapt- 
ability” ratings supports the idea that there is 
a process dimension underlying social acces- 
sibility which has to do with behavioral plas- 
ticity as well as openness. The correlations of 
conceptual complexity with both social acces- 
sibility and the adaptability ratings suggests 
that conceptual complexity may be tapping 
that dimension. 

As predicted, self-disclosure was negatively 
correlated with authoritarianism and posi- 
tively correlated with conceptual complex- 
ity, while the negative correlation with the 
Polarity Scale was of marginal significance 
(p = .06, two-tailed). The fact that the more 
authoritarian individual reports that he dis- 
Closes less about himself suggests, consistent 
with the theory of authoritarianism, that one 
with relatively closed belief systems acts so 
as to minimize the challenge to his values and 
beliefs. A similar interpretation can be made 
of the correlation between conceptual com- 
plexity and self-disclosure. A person of a 
higher level of complexity presumably inter- 


acts in an interdependent manner (i.e., assum- 
ing mutuality and equality in relationships; 
see Harvey et al., 1961), whereas one at a 
low level presumably interacts unilaterally 
(e.g., dominant and submissive roles). It 
follows that there should be more openness 
in communicating to others in an interdepen- 
dent rather than a unilateral interaction. 

Authoritarianism, low conceptual complex- 
ity, and the belief that human nature is evil 
can be viewed as three distinct cognitive- 
motivational bases for defensiveness towards 
others, It is interesting to note that together 
they account for about 25% of the total vari- 
ance in self-disclosure (multiple R = .51). 

While self-disclosure was related to con- 
ceptual complexity, authoritarianism, and peer 
nominations, the latter three variables did not 
correlate with each other. Thus self-disclosure 
is related to three aspects of interpersonal 
functioning which are operationally and con- 
ceptually independent, yet which all have to 
do with interpersonal openness and effective- 
ness. Those findings plus the relations of self- 
disclosure with the behavior ratings of inter- 
personal flexibility and adaptability, indicate 
a substantial degree of convergent construct 
validity for the interpretation of self-dis- 
closure within a framework of interpersonal 
flexibility and openness. 

Although it might be Possible to interpret 
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some of the present results as reflecting the 
contribution that social desirability or other 
response sets make to the paper-and-pencil 
measures, this possibility is rather remote for 
several reasons: The measures of authoritari- 
anism and polarity had specific formats de- 
signed to control for response sets, and they 
were not intercorrelated, which suggests they 
were measuring different aspects of interper- 
sonal functioning in this study. Also response 
sets seem to be minimized in the Paragraph 
Completion Inventory because of the semi- 
projective nature of the responses. 

The fact that neither the SCAT verbal 
scores nor the General Aptitude Test scores 
correlated with self-disclosure is one bit of 
evidence for the discriminant validity of self- 
disclosure as well. 
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o hundred Ss matched for sex, in- and outpatient status, and for medical- 
Pate dar nonpatient status were administered a self-rating daydreaming ques- 
tionnaire which was scored for frequency of daydreaming, content, and reac- 
tions to daydreams. Strong sex differences in content emerged, with males 
engaging in flamboyant, action-oriented daydreams, while the daydreams of 


men were passive, reality-oriented, and somewhat less frequent. Very few 
Tires Meads a function of hospitalization, but striking differences 
emerged on the illness dimension, with psychiatric Ss engaging in more improb- 
able, ideational, and highly affect laden daydreams, but reacting with little con- 
cern. Although daydreaming less than psychiatric Ss, medical Ss were more 
frightened by and absorbed in their daydreams, In general, psychiatric Ss had 
a greater tolerance for emotional fantasy than did medical patients. 


Experimental studies of daydreaming have 
dealt with a narrow range of Ss, largely 
middle-class college and graduate students. 
Although it is well documented that for this 
group daydreaming is a common daily experi- 
ence, Singer (1966) in reviewing the day- 
dream literature stated: “We need, therefore, 
to obtain some kinds of information about the 
true extent of inner mental activity in various 
samples . . . [p. 6].” 

Three factors have been virtually ignored 
in the empirical daydream literature: sex dif- 
ferences,’ relationship to illness, and the effect 
of hospitalization. Singer (1966) reported no 
significant sex differences in his daydreaming 
studies, Pytkowicz, Wagner, and Sarason 
(1967) reported a sex difference that ap- 
proached significance with males reporting 
more daydreaming. The present research in- 
vestigated sex differences in both content and 
frequency of daydreaming and in attitudes 
toward daydreaming. 


1 This research was supported by a grant awarded 
to the senior author from the State of Washington 
Initiative 171 funds for research in Biology and 
Medicine, 

2 Requests for Teprints should be sent to Ann 
Pytkowicz Streissguth, Child Development and Men- 
tal Retardation Center, Behavior Science Unit, Uni- 
versity of Washington, Seattle, Washington 98105. 

3 Since completion of the present study, Wagman 
(1967) has published a study of six differences in the 
content of daydreams, employing college student Ss, 
in which findings similar to those Presented in this 
study are reported, 


There are also no reported empirical studies 
on the relation of illness to daydreaming and 
little research on the effects of situational 
factors on daydreaming. A recent study by 
Antrobus, Singer, and Greenberg (1966), 
however, indicated that the presence of a 
highly distressing announcement preceding a 
discrimination task increased the production 
of spontaneous fantasy during task perform- 
ance. Presumably, through fantasy, the person 
attempts to integrate, deal with, and adapt to 
the distressing information. One might postu- 
late that similarly hospitalized patients might 
attempt to cope with the stress of illness 
through spontaneous daydreaming. 

The present study, then, investigated fre- 
quency of daydreaming, content of day- 
dreams, and attitudes toward daydreaming, as 
a function of sex, illness, and hospitalization. 
It was anticipated that men would have a 
greater frequency of daydreaming than 
women, that hospitalized patients would have 
a greater frequency of daydreaming than non- 
hospitalized patients, and that Psychiatric and 
medical patients would have qualitatively dif- 
ferent daydreaming patterns, 

Daydreaming was defined, following Singer 
(1966), as any cognitive activity Tepresenting 
a shift of attention away from an ongoing 
task. Loose associations, well-developed fanta- 
sies, plans for the future, memories of the 
Past, worries, wishes, fear: 


S, etc., were all 
subsumed under the term “daydreaming.” 
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TABLE 1 
CHARACTERISTICS OF SAMPLE 
Age in years Education 
Group 
20-34 35-49 50+ —11 12-14 15+ 

Psychiatric inpatient 

Male 9 10 1 2 9 9 

Female 9 9 5 2 11 7 
Psychiatric outpatient 

Male 12 7 1 3 8 9 

Female 9 9 2 3 9 8 
Medical inpatient 

Male 10 9 1 3 10 7 

Female 9 10 1 2 10 8 
Medical outpatient 

Male 8 10 2 2 8 10 

Female 8 11 1 2 9 9 
Nonpatients 

Male 9 10 1 3 10 7 

Female 9 10 1 2 10 8 
Total 92 95 13 24 94 82 


Note.—N = 20 for each group; total N = 200. 


METHOD 
Subjects 


The Ss were selected from three Seattle, Washing- 
ton, hospitals, and were equally divided within the 
following subgroups: psychiatric and medical patients 
and controls (nonpatients), in- and outpatients, and 
men and women. The psychiatric Ss were obtained 
first and the medical Ss and controls were then 
matched according to age and education. A purpose- 
fully wide range of Ss were selected, with age ranging 
from 20 to 73 years with a mean of 35.9 years and 
education ranging from 7 to 23 years with a mean of 
14.1 years. 

Certain unavoidable differences occurred between 
subgroups that reflect actual differences between the 
services from which Ss were selected. 

Length of hospitalization varied for each sub- 
group according to treatment procedures on each 
service. Psychiatric inpatients ranged in length of 
hospitalization from 3 to 120 days with a mean 
of 33.83 days. Medical inpatients’ length of hospitali- 
zation ranged from 1 to 48 days with a mean of 
5.05 days. 

Diagnoses of the psychiatric Ss included: 35% 
schizophrenics, 32% neurotic reactions, 24% character 
disorders, and 9% affective psychoses. There were 
somewhat more schizophrenics in the inpatient group 
and more neurotic reactions in the outpatient group, 
reflecting the nature of inpatient psychiatric patients 
compared to those seen in outpatient clinics, The 
diagnoses of medical Ss reflected the services from 
which they were obtained: 23% cardiology, 19% 
gastrointestinal, 12% arthritis, 11% chest and in- 
fectious diseases, 9% metabolic and endocrine, 6% 
Tenal, 5% dermatology, 5% hematology, 5% allergy, 
and 5% general medical. When tested, the control Ss 


were not patients and had no history of psychiatric 
treatment. They were recruited predominantly 
from hospital personnel and, where necessary, from 
acquaintances of the Es. 

The sample as a whole had the following marital, 
ethnic, and religious characteristics which do not 
appear to differ between subgroups: 65% were 
married, 35% were single, divorced, or widowed; 
29% were from British backgrounds, 28% from 
Western European, 16% from Scandinavian, 6% 
from Eastern European, and 21% from American or 
“other”; 68% were Protestant, 13% Catholic, 4% 
Jewish, and 15% “other” or had no religious prefer- 
ence. The sample is remarkably representative of the 
Seattle area. 


Procedure 


Each S was given the Singer Daydream Question- 
naire, which contains 120 specific daydreams to 
which an S responds on a 1-5 scale according to the 
frequency with which he has a particular dream.’ 
In addition, there are 142 statements about day- 
dreams to which Ss respond on a 5-point scale 
ranging from emphatically true to emphatically false. 

The Ss were given the test either individually or in 
small groups. In each case, the study was explained 
as a research project concerning what people day- 
dream and how they feel about their daydreams. The 


4 Now revised and published in a new form entitled 
“Tmaginal Processes Inventory” by Singer and An- 
trobus, published by the City College of New York. 

5 Although this study used the form of the Day- 
dream Questionnaire used by Singer in his early 
studies, the directions were those described by 
Pytkowicz et al., 1967. 
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examiner was available for questioning, but answered 
questions in a nonspecific but supportive manner. 


Experimental Design 


The Ss formed a 2X23 factorial design with 
the following independent variables: sex (male vs. 
female) ; hospitalization (outpatients vs. inpatients) ; 
and illness (psychiatric vs, medical patients vs. con- 
trols), Since there were, of course, no inpatient con- 
trols, this resulted in 10 subgroups of 20 Ss each, 
Scores were derived from the Singer Daydream Ques- 
tionnaire, Thirty dependent variables were studied, 
including one scale of daydreaming frequency, 11 
scales of daydream content, and 18 scales of attitudes 
about and attributes of daydreams. 

The necessary absence of the inpatient controls 
meant that the usual three-way analysis of variance 
had to be modified to allow for the unbalanced design 
(Dixon, 1965, p. 543; Scheffé, 1963). In particular, 
it was necessary to adjust the means of the control 
group to make them comparable with the psychiatric 
and medical groups which included inpatients. The 
adjusted control mean is given by: 


Yeoutror + Beg ius 
where Yeontro: is the mean for all control Ss while 
Yin and Fout are the means for inpatients and out- 
patients, respectively, in the psychiatric and medical 
groups. The comparison of inpatients and outpatients 
involved only the use of psychiatric and medical 
Ss while the sex comparison used all Ss. 


TABLE 2 


ANALYSIS OF VARIANCE OF MALE VERSUS FEMALE 
DIFFERENCES ON THE DAYDREAMING SCALES 


Variables Mean scores 
(Daydream scale) ti 
Male | Female 

Sexual 17.48 | 9.60 | 18,58* 
Heroic 6.30] 3.40 | 14.02" 
Modified aggressive 6.62) 4.55 | 10,94+* 
Acceptance of daydreams 31.48 | 25.92 | 10.70** 
Repetitiveness of day- 

dreams b 15.90 | 12.59 | 9,19%* 
Achievement-oriented 8.27 | 6.52 | 8,83** 
Factor H 55.30 | 47.24 | 8:79** 
Probable 10.33 | 12.85 | 8:28** 
General emotional response | 16.32 | 14.13 6,76** 
Present-orientation 14.92 | 12.77 | 5.98* 
Frequency 107.67 | 90.52 | 3.74 
Passivity 21.06 | 22.86 | 3.69 
Conscience 5.05} 4.03 | 3.66 
Problem solving 8.37 | 6.76 | 3.38 
Attempt to control 19.47} 1734 | 2.20 


—— rg ee 
Note.—N = 20, 
Poa ee in order of magnitude of difference. 
> < 01. 
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Psychiatric, medical, and adjusted control means 
were compared by Duncan’s multiple-range test with 
special modifications for the unbalanced design 
(Duncan, 1955). Because the adjusted mean for the 
control group had a larger variance than the psychi- 
atric and medical means, comparisons involving the 
control mean were less likely to reach statistical 
significance.® 


RESULTS 
Differences as a Function of Sex 


Overall sex differences in frequency of day- 
dreaming did not attain statistical significance 
as shown in Table 2. The trend of the sex 
difference in frequency replicates a previous 
finding (Pytkowicz et al., 1967) in which male 
college students also reported somewhat more 
daydreaming than women. The Ss in this 
study, spanning a much wider age and diag- 
nostic range, have a lower overall frequency 
of daydreaming compared with those in the 
earlier study, although the male-female rela- 
tionship remains the same, 

The content of men’s daydreams -differ 
significantly from the daydreams of women 
along a dimension of activity-passivity. Day- 
dream scales endorsed significantly more fre- 
quently by men include sexual daydreams, 
heroic daydreams, achievement-oriented day- 
dreams, and modified aggressive daydreams. 
The only content scale endorsed significantly 
more often by women than men is probable 
daydreams, which includes a selection of 
rather concrete, practical daydreams such as 
planning what to cook for dinner or what 
clothes to wear. Women also tend to score 
higher than men on a scale of passivity in 
daydreaming, which includes a selection of 
fairly static, unimaginative, inactive day- 
dreams (this latter difference, however, does 
not achieve significance at the .05 level). 

In attitudes toward daydreaming, men 
score significantly higher than women on the 
following scales: acceptance of daydreaming, 
Tepetitiveness of daydreams, general emo- 
tional response to daydreaming, and present- 
orientation in daydreams. Scales on which 


©The authors are most grateful to Ethel Schaeffer, 
Department of Preventive Medicine, University of 
Washington, for her assistance with the statistical 
paid ga ato. wish to acknowledge the assist- 
of James nd and Marjori in i 
collection of the data. Co gee 
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there is a trend for men to endorse more than 
women are: scales of conscience daydreams 
and problem solving through daydreaming. 
Many items on the latter scale are suggestive 
of daydreams being used actively in the 
creative process (such as to find new ways 
of approaching a problem). 

A noteworthy finding in terms of sex differ- 
ences in daydreaming is the absence of any 
differences on such scales as anxious day- 
dreams, improbable daydreams, overt-aggres- 
sive daydreams, pregenital daydreams, and 
need-satisfaction daydreams. 

Men had higher scores than women on 
Factor H, a factor with substantial loadings 
on neuroticism and other pathological non- 
daydreaming scales (Singer & Antrobus, 
1963). There is no reason to assume higher 
scores for males reflect increased neuroticism, 
raising questions, therefore, about the inter- 
pretation of this factor. 

In summary, the daydreams of men differ 
along a number of dimensions from those re- 
ported by women, The primary difference oc- 
curs in terms of the action-oriented and imagi- 
native content of men’s daydreams compared 
to the passive, reality-oriented daydreams of 
women. Men also are more accepting of their 
daydreams, more emotionally involved and ap- 
pear to use daydreaming for new approaches 
to problem solving. 

Although it is possible that the male-female 
differences reported here are primarily a func- 
tion of the patient status of the majority of 
Ss in this study, the lack of any significant 
interaction effects contraindicates this inter- 
pretation. Further, in a study (Wagman, 
1967) which appeared after this study was 
completed, sex differences of college students 
were strikingly consistent with the findings of 
our sample. 


Differences as a Function of Hospitalization 


This study reveals few differences in the 
daydreaming patterns of hospitalized com- 
pared with nonhospitalized patients (see 
Table 3). Hospitalized patients are signifi- 
cantly more accepting of their daydreams and 
report significantly more problem-solving day- 
dreams. On the other hand, hospitalized pa- 
tients report significantly fewer sexual and 
angry-emotional daydreams than do outpa- 
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TABLE 3 


ANALysIs OF VARIANCE oF INPATIENT VERSUS 
OUTPATIENT DIFFERENCES ON THE 
DAYDREAMING SCALES 


Variables Mean scores. 
(Daydream scale) z F 
Inpatients | Outpatients 


Acceptance of day- 


dreams 32.05 26.68 13.00** 
Sexual 11.35 15.93 7.86** 
Angry-emotional 6.34 8.54 6.91%* 
Problem solving 8.83 7.35 4,46* 
Frequency 91.15 107.74 3.62 
Physical sensation 

in daydreaming 22.25 20.60 2.80 
Visual imagery 9.24 8.09 2.68 
Anxious 6.35 7.46 2,40 

Note.—N = 200. 


A ged in order of magnitude of difference. 
*p <.05. 
=p <01. 


tients. No significant differences in frequency 
of daydreaming occurred, although the trend 
for hospitalized patients to daydream less than 
nonhospitalized patients was surprising. 

To explore this further, each subgroup of 
hospitalized patients, medical and psychiatric, 
were rank-ordered separately -according to 
length of hospitalization at time of testing. 
Each subgroup was divided by a median split 
into long and short duration of hospitaliza- 
tion. Male and female Ss were distributed 
evenly throughout long and short subgroups 
for each diagnostic category, thus ruling out 
possible interaction effects between sex, 
length of hospitalization, and frequency of 
daydreaming. 


TABLE 4 


FREQUENCY OF DAYDREAMING AS A FUNCTION 
OF LENGTH OF HOSPITALIZATION 


4 Length of hospitalization 
¥ 
Medical Psychiatric 
Hap inpatients inpatients 


Short | Long | Short | Long 


Mean number of days 


in hospital 2.1 | 98} 11.8} 51.9 
Mean score: frequency 
of daydreaming 69.5 | 98.3 | 114.1 | 95.4 
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As Table 4 indicates, length of hospitaliza- 
tion has clearly opposite effects on frequency 
of daydreaming for psychiatric compared with 
medical patients. Medical patients daydream 
more the longer they are hospitalized while 
psychiatric patients start out with a higher 
level of daydreaming, and daydream less as 
length of hospitalization increases. These find- 
ings are obviously confounded by the lack of 
equivalence between the short and long cate- 
gories of hospitalization for the medical com- 
pared with the psychiatric patients. It cannot 
be ascertained from these data whether hos- 
pitalization has a differential effect on day- 
dreaming according to the type of illness 
manifested by the patient, or whether length 
of hospitalization has a curvilinear effect on 
daydreaming frequency irrespective of type of 
illness. Research controlling for length of hos- 
pitalization seems warranted to clarify this 
point. 


Differences as a Function of Illness 


By far the most striking differences in day- 
dreaming patterns occur as a function of type 
of illness. Table 5 shows the analysis of vari- 


TABLE 5 


ANALYSIS OF VARIANCE OF PSYCHIATRIC PATIENTS 
versus MEDICAL PATIENTS VERSUS 
CONTROLS 


Mean score * 


Variable (Day- 
dreaming scale) 
Psychiatric | Control | Medical 

Anxious 8.63 5.19 
Angry-emotional 9.43 5.45, 
Modified aggressive 7.12 4.16 
Overt aggressive 5.50 3.00 
Frequency 117.99 80.90 
Sexual 16.89 10,39 
Conscience 6.01 3.30 
Frightened reaction 7.44 11.57 12.48 
Absorption in day- 

dreams 17.49 18.10 21.19 
Improbable 5.76 4.08 3.79 
Pregenital 5.43 3.00 3.11 
Achievement- 

oriented 8.50 7.98 6.00 
Heroic 6.08 5.30 3.40 
Repetitiveness 13.01 13.50 15.85 
Future oriented 18.75 15.05 18.36 
Factor H 48.14 48.85 52.06 


Note—N = 200. 
„^ Means connected with a single line are not signifi 
different from each other at thi aaa o 
Duncan's multiplerangetest. a > ~ > evel according to 
*p <05. 
wD <01. 
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ance of the three categories of illness (psychi- 
atric, medical, and controls) for the day- 
dreaming scales on which near-significant 
overall differences were obtained. Due to the 
difñculty in interpreting this three-way analy- 
sis, a Duncan’s multiple-range test, with 
special modifications as described above, was 
employed. 

In terms of frequency of daydreaming, 
psychiatric patients clearly report more day- 
dreaming than either controls or medical pa- 
tients. Athough medical patients report less 
daydreaming than controls, this latter dif- 
ference is not significant. 

A large cluster of scales having a highly 
affective content are endorsed most frequently 
by psychiatric patients and least frequently 
by medical patients. These include: improb- 
able daydreams, anxious daydreams, con- 
science daydreams, sexual daydreams, overt- 
aggressive daydreams, modified-aggressive 
daydreams, and angy-emotional daydreams. 
This cluster might be regarded as daydreams 
most highly characteristic of the psychiatric 
patients and least characteristic of medical 
patients. 

The only scale characteristic of medical 
patients only (the scale on which medi- 
cal patients score highest and psychiatric 
patients score lowest) is absorption in day- 
dreaming. Medical patients also score signifi- 
cantly higher than psychiatric patients on 
the following scales: Factor H, frightened 
reaction to daydreams, and repetitiveness of 
daydreams. 

It is clear that psychiatric and medical 
patients not only engage in different kinds of 
daydreams, but they have markedly different 
reactions to the experience of daydreaming. 
Psychiatric patients not only daydream more 
frequently than medical patients and nonpa- 
tients, but their daydreams are more improb- 
able (including more fantastic and flamboyant 
themes and an existence in impossible roles), 
more conscience-ridden (including themes of 
punishment and censure for personal wrong- 
doing), and include more themes of a pre- 
genital and sexual nature (including narcis- 
sistic, self-agerandizing, and self-gratifying 
themes such as owning expensive clothing, 
eating at great banquets, and being handsome 
and irresistible). Psychiatric patients also have 
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more highly affective and action-oriented day- 
dreams, including overtly aggressive disfigure- 
ment daydreams as well as more modified 
aggressive or nonphysically aggressive day- 
dreams and more angry-emotional daydreams. 
They also have more anxious daydreams, 
more heroic, and more achievement-oriented 
daydreams. Despite the highly affect-laden 
content of the daydreams of psychiatric pa- 
tients, they are less frightened by their day- 
dreams, less absorbed in daydreaming, and 
experience less repetitiveness of daydreaming. 

By contrast, medical patients (who do not 
differ from nonpatients in terms of overall fre- 
quency of daydreaming, nor in the amount of 
improbable, sexual, or pregenital daydreams) 
do report significantly less of the highly affect- 
laden daydreams than even the controls. 
Medical patients admit to significantly less 
aggressive, anxious, heroic, angry-emotional, 
and achievement-oriented daydreams than 
either psychiatric patients or controls. 

Despite the relative absence of themes of 
a highly ideational or affective nature in the 
daydreams of medical patients and the smaller 
frequency of daydreaming in medical patients, 
it is notable that medical patients are appar- 
ently more bothered by their daydreaming 
activity. They have a greater emotional re- 
sponse to daydreaming, are more absorbed in 
their daydreams, experience the most repeti- 
tive daydreams, and are most frightened by 
their daydreams. 

In summary, the reported fantasy life of 
psychiatric patients is characterized by a 
strong ideational component in which specific, 
highly affective, and fantastic themes are 
dominant. Although the nature of their day- 
dreams is highly affect-laden, psychiatric pa- 
tients report little emotional reaction to day- 
dreaming. Medical patients, on the other hand, 
experience little specific ideational or affective 
content in their daydreams, but do report a 
strong emotional reaction to daydreaming in- 
cluding being more absorbed in and frightened 
by their fantasy life. 


DISCUSSION 


This study supports a concept of day- 
dreaming as a personality characteristic on 
which a wide variety of human experiences 
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can be played. Sex differences in reported 
patterns of daydreaming seem to reflect quite 
directly the cultural stereotypes of male- 
female interests in the United States. Men 
engage in action-oriented, imaginative fanta- 
sies modeling the cultural role expectations of 
the male, while the women engage in probable, 
reality-oriented, and passive daydreams, re- 
flecting the accepted feminine roles of passiv- 
ity, attention to detail, and practicality. Thus, 
the respective daydreams of adult men and 
women appear as replications of the sex-role 
play of young children where the girls play 
house and have tea parties while the boys 
play pirates and take rocketships to the moon. 

Two questions should be raised regarding 
any such study of daydreaming as the present 
one: first, to what extent the reported day- 
dreams of Ss are representative of their actual 
fantasy life, and to what extent their reported 
daydreams may be related to their perceptions 
of what is socially desirable. There is little 
point in belaboring the first point, as it is 
difficult to imagine any way other than self- 
reports to measure an individual’s daydream- 
ing, so all that would ever be measurable 
would be reported daydreaming activity. Re- 
garding the second question, both Singer 
(1966) and Pytkowicz et al. (1967), report 
no significant relationship between the fre- 
quency of daydreaming and the construct of 
social desirability. 

The hospitalization effects are particularly 
interesting in view of other studies dealing 
with the relationship of decreased sensory 
stimulation to imagination. Studies of the 
fantasy life of blind children, for example, 
indicate that although they do engage in day- 
dreams, night dreams, and internal monologue, 
the entire fabric of their fantasies is less rich 
and imaginative and more concrete, simple, 
and reality-bound (Singer & Streiner, 1966). 
Similarly, children raised in institutions have 
a notable paucity of imaginative resources 
(Goldfarb, 1949). Singer (1966) points to 
the necessity for both a richness of external 
stimulation and an opportunity for privacy in 
the development of an active fantasy life. 
In this respect, the tendency for hospitalized 
patients to not only daydream significantly 
less about highly affect-laden sexual and ag- 
gressive themes as well as to be more accept- 
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ing of daydreams and to use daydreaming for 
problem solving is interesting. 

The anticipation that hospitalized patients 
might use daydreaming as an effective device 
for coping with the stress of hospitalization is 
not supported by these data. Procedural 
factors may be more at fault than the proposi- 
tion itself. It is quite possible that the Singer 
Daydream Scale is not sensitive to short-term 
changes in daydreaming activity and may be 
measuring more enduring personality charac- 
teristics, It is also possible, indeed suggested 
by this study, that length of hospitalization 
may be a more salient factor than the experi- 
ence of hospitalization per se. 

This is further complicated by the fact that 
psychiatric hospitalization is almost invariably 
longer than medical hospitalization due to the 
different goals of treatment during the hos- 
pitalization. Fantasy as a mechanism for 
coping with illness may well have been acti- 
vated at the outset of hospitalization or at 
the time of diagnosis and planning. Certain 
characteristics of the hospital system which 
encourage the relinquishment of responsibility 
and self-control on the part of the patient 
may actually inhibit spontaneous fantasy. 
Daydreaming may well be a manifestation 
of an individual’s feeling of personal choice 
and responsibility in a given situation—an 
attempt to work through possible alternatives 
and ideas, rather than an aimless flight of 
ideas in an apathetic state. Differential use 
of daydreaming as a function of length of 
hospitalization in medical compared with psy- 
chiatric patients is suggested by this study. 
Length of hospitalization, severity of illness, 
and individual differences of Ss are all factors 
heeding further investigation to understand 
the effect of hospitalization on spontaneous 
fantasy. The restitutive and protective func- 
tions of fantasy do not appear to have been 
adequately meaSured in the present study. 

The striking atterences between the day- 
dreaming patterns of medical and psychiatric 
patients as well as their marked differences in 
reaction to the experience of daydreaming are 
provocative. While obviously no cause and ef- 
fect interpretations can result from these data, 
it is interesting to speculate to what extent 
these distinctive daydream patterns are char- 
acteristic of the cognitive life styles of patients 
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who develop psychiatric symptoms compared 
with those whose symptoms are medical. Not 
only do the psychiatric patients apparently 
involve many of their worries and conflicts 
in their active fantasy life, but they do so 
with relatively little concern. Anxiety and af- 
fect is focused in the theme of the daydream 
rather than the experience of daydreaming. 

By contrast, medical patients deny active 
imaginative fantasies. Their daydreams, rather 
than being highly ideational, are loosely orga- 
nized, ruminative, repetitive, and egocentric. 
While denying affectively-toned themes, the 
medical patients report much reactivity and 
concern over the process of daydreaming, ap- 
parently attempting to consciously control 
and diminish this distasteful and frightening 
activity. The repressive and suppressive qual- 
ity that pervades their fantasy life may reflect 
the fear of individuals facing crucial health 
problems, or it may reflect a style of life 
that results in physical illness as a response 
to stress (Grace & Graham, 1952; Hinkle & 
Wolff, 1957). 

The view that daydreaming is in itself 
pathological is not supported by these data. 
Consistently, the mean scores of nonpatient 
control Ss fall between the mean scores of 
the psychiatric patients and the mean scores 
of the medical patients. If the spontaneous 
fantasy of the control Ss are taken as a norm, 
then the psychiatric and medical patients re- 
peatedly, on almost all of the 30 scales, de- 
viate in opposite directions from the norm. 
Although psychiatric patients report more 
daydreaming, there is no evidence from this 
study to suggest that the process of day- 
dreaming is uniquely associated with mental 
illness. Rather it appears more meaningful to 
conceptualize daydreaming as a cognitive ac- 
tivity, the quantity and quality of which is 
associated with a variety of personal dimen- 
sions of which two are sex and type of illness. 
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USE OF SPEECH IN THE INHIBITORY CONTROL OF MOTORIC 
BEHAVIOR OF SCHIZOPHRENIC PATIENTS* 


BEATRICE A. ASHEM 2 
University of Waterloo 


The conditions under which speech influences the speed of a tapping response in 
schizophrenics (N = 48) and nonschizophrenics (N =48) were experimentally 
investigated. The Ss verbalized three words (“goads”) under varied pretask 
instructions (Minimal and Slow Instructions) and two sources of delivery (self- 
delivered “Internal” verbalizations and other-delivered “External” verbaliza- 
tions). The results indicated that all groups inhibited speed of tapping for a 
Slow Instruction. The schizophrenics under the Internal goad condition inhib- 
ited response as well as the control group; under the External goad condition 
these patients were less able to inhibit response. For the Minimal Instruction 
condition the schizophrenics responded to the meaning inherent in the goads 
“faster” and “slower” for the Internal but not the External goad condition. 
The converse was true for the control group. 


Several papers in the recent literature have 
been concerned with the relationship between 
the slow performance which characterizes 
schizophrenic patients on simple motor tasks 
and situationally relevant variables. The 
search for relevant dimensions influencing 
performance separates into two categories on 
the basis of (a) identification of stimulus 
components and (6) those concerned with 
measures of response-mediated feedback. 

The experiments in the first category have 
indicated that deficit performance of schizo- 
phrenics on a reaction-time (RT) task can 
be eliminated by increasing the intensity of 
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the discriminative stimulus (Wienckowski, 
1959), or reducing distracting elements of 
the stimulus configuration (Payne & Caird, 
1967); those of the second category have 
found improvement on an RT task to be 
associated with increased information relative 
to the adequacy of the response (Lang, 1959). 
Buss and Lang (1967), following an extensive 
review of the literature, have suggested that 
if the schizophrenic is given more information 
relevant to his performance than is usually 
required by normal Ss his performance is 
adequate. This holds true for verbal condi- 
tioning and cognitive learning as well as 
motor performance. The authors hypothesize 
that the greater reliance of the schizophrenic 
upon external cues may be due to a break- 
down in self-instructional set. 

The import of isolating the variables influ- 
encing performance of the schizophrenic on 
any task is clear. The problem remaining to 
plague the investigators is whether the im- 
proved performance gained in the laboratory 
can be maintained outside of the experimental 
setting. 

Luria (1966) maintains that self-control 
(or internalized control) required for appro- 
priate responding to varying environmental 
demands in humans is attained only following 
the development of speech mechanisms. One 
might reasonably ask whether self-speech 
might be used to organize and control motor 
behaviors of schizophrenics. Two studies 
(Birch, 1966; Lovaas, 1964) dealing with 
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the symbolic representation of speech to in- 
fluence the motor behavior of children are 
relevant here. Both demonstrate that self- 
speech plays a central role in the inhibition 
and facilitation of simple motor responses. 

Lovaas (1964) had children bar-press while 
saying the words “faster” and “slower.” It 
was shown that there was a relationship be- 
tween the facilitory and inhibitory properties 
of the words and developmental level; the 
older children responded to the meaning com- 
ponent of the words, while younger children 
did not. It should be noted that pretraining 
was used, with reward contingent upon count- 
ing fast or slow in the presence of a light, to 
establish a “meaning” effect to the bar-press 
response. 

The purpose of the present study was to 
examine the conditions under which speech 
controls the speed of a simple continuous 
tapping response in schizophrenics and non- 
schizophrenics, The experiment differs from 
other studies attempting to delineate the vari- 
ables influencing motor performance of 
schizophrenics because it uses as a dimension 
of study S’s own speech as well as that of 
others, Further it examines the effects on the 
response of a pretask instruction which is con- 
sistent, irrelevant, or contradictory to words 
(“goads”) spoken during actual performance 
of a task. A preliminary investigation sug- 
gested that the meanings of the words “faster” 
and “slower” influenced the tapping perform- 
ance of schizophrenics and normals when Ss 
themselves spoke these words (Internal condi- 
tion). When Æ spoke the words (External 
condition), normals modified performance 
according to the meaning of the words but 
the schizophrenics did not respond to the 
meaning component of the words. Further, 
performance appeared to be modified to an 
important extent by a pretask instruction. For 
a Fast Instruction condition (e.g., Ss were 
asked to tap as fast as they could while 
verbalizations were occurring) no differences 
between the schizophrenics and normals were 
obtained; for a Slow Instruction condition, 
however, schizophrenics were significantly 
less able than normals to tap slowly when Æ 
spoke the goads (External condition) even 
though these patients were able to do so 
when they spoke the goads themselves. 
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The present study is a controlled examina- 
tion of the parameters involved in verbal 
control over a simple motor response in 
schizophrenics and nonschizophrenics. As the 
Fast Instruction condition had not yielded 
significant differences between schizophrenics 
and normals in the preliminary investigation, 
this condition was not repeated, 


METHOD 
Subjects 


The 48 Ss of the schizophrenic (S) group were 
selected from admission wards of four state hos- 
pitals in New Jersey according to the following 
criteria: a diagnosis of schizophrenia with no history 
of complicating neurological or physical disorders, 
male, between the ages of 23 and 45 years, length 
of hospitalization not over 3 months, no more than 
three previous hospitalizations, any previous hos- 
pitalizations not to exceed 3 months, not about to 
be discharged, no EST therapy within a 10-week 
period, nonassaultive and able to cooperate. All pa- 
tions were on phenothiazine drug treatment at the 
time of testing. Dosages were noted, and no patient 
with unusually high dosages was included. The mean 
age was 30.8. The mean IQ derived from a WAIS 
prorated vocabulary subtest was 105.4. 

The 48 Ss of the nonschizophrenic (C) group were 
21 convalescent patients at the Newark Beth Israel 
Hospital, 19 alcoholics who were voluntary admis- 
sions to the New Jersey State Hospitals, and 8 hos- 
pital attendants. The criteria for selection of Ss were: 
hospitalized at time of testing, between the ages of 
23 and 45 years, male, no known history of psychi- 
atric disorders, able to sit up freely, in no physical 
pain at the time of testing. Due to the difficulty of 
obtaining hospitalized patients to satisfy the estab- 
lished criteria eight hospital attendants were added 
to complete the sample. The mean age was 34.0; the 
mean IQ, 103.8. 

There was no danger of a possible response bias 
resulting from communication to one another of the 
nature of the task, as general hospital patients were 
from widely separated wards, To avoid such bias 
among the remainder of the Ss each was separated 
from the untested portion of the group as soon as 
testing was completed. 


Selection Tests 


All Ss were administered the Vocabulary subtest 
of the Wechsler Adult Intelligence Scale (WAIS) ; 
the Ullmann-Giovanni Self-Report Scale (Ullmann & 
Giovanni, 1964) and the Zigler Social Competence 
Scale (Zigler & Phillips, 1962) just prior to the 
experimental test. The Ss within the two major S 
groups were then matched on both the Ullmann- 
Giovanni Self-Report Scale (Ullmann) and the Zigler 
Social Competence Scale (Zigler). The purpose of 
the test was to ensure that differences obtained be- 
tween groups for the different experimental condi- 
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tions could not be attributed to bias deriving from 
individual differences in intellectual and social abili- 
ties. Separate analyses of variance of scores obtained 
on the Zigler and on the Ullmann showed the S 
group scored lower than the C group on both scales 
(F= 26, df=1/88 and F= 24.6, df = 1/88, respec- 
tively). The differences between the groups were 
significant beyond the .001 level. The differences in 
age and IQ were not significant. 


Apparatus 


The equipment consisted of a tapping board, an 
attached power supply and automatic counter, and a 
tape recorder, The tapping board had a wooden 
base, 94 inches X 94 inches, to which was attached a 
metallic plate, 5 inches square. A pencil, with a dual 
electric cord to the metallic plate and to the power 
supply, was attached. When the pencil touched the 
metallic plate it automatically activated the counter. 

The experiment took place in an office on the 
wards where the individual patients were located. 


Experimental Design 


There were three independent variables: level of 
pathology, source of goad, and pretask instruction. 
The level of pathology variable was divided into two 
conditions: schizophrenics and nonschizophrenics; the 
source of goad variable was divided into the Internal 
and External conditions; the pretask instruction vari- 
able was divided into the Minimal Instruction and 

_Slow Instruction conditions. For each condition, Ss 
faced verbal commands (goads) which were of three 
levels of consistency with the pretask instruction. 
One goad condition (the word “slower”) had high 
consistency with the Slow Instruction condition; the 
second condition (the word “letter”) was irrelevant; 
the third (“faster”), contradictory. A fourth condi- 
tion within each group was a control condition using 
no word (instructional set only). There was a total 
of eight S groups with 12 Ss in each group. Assign- 
ment of Ss to the experimental conditions was 
random. 

The goads were used by all Ss within each treat- 
ment condition in counterbalanced order. One-half 
of the Ss in each experimental condition was given 
the goads in the order “letter—faster-slower-letter”; 
the remaining half, in the order “letter-slower-faster— 
letter.” The “no-word” condition was used to com- 
Plete each series, that is, the presentation of the 
within-conditions was in the order “letter-(faster) — 
(slower)-letter-no word.” 

The dependent variable was the number 
averaged over two 30-second intervals, cope 
oe asanti the 7 were placed into a six re- 

measurements (including the operant level of 
tapping), 2X 2X 2 between-withi i i 
ance design. Se ects 90 vari- 

The number of times the verbal goad was pre- 
sented in a 30-second tapping trial was, Beie 
Internal condition, determined by the Ss themselves. 
For the External condition the number of times the 
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verbal goad was presented was determined by yoking 
each S with a “partner” in the Internal condition. 

The goads “letter,” “faster,” and “slower” were 
rated on a 7-point rating scale by each S at the 
end of the experimental session to evaluate the speed 
connotations which each of the goads possessed for 
the two S groups. 


Pretraining 


To ensure that Ss would understand that the goads 
to be used actually had speed connotations, all Ss 
received pretraining on the goads. 

The Ss were asked to say a word over and over 
for 15 seconds. The goad “letter” was given first. 
Four trials were given, and the number of repetitions 
of the goad were recorded by Æ. “Faster” or “slower” 
in counterbalanced order was given next. The cri- 
terion established for “faster” was a minimum of 
24 repetitions per 15-second interval; for “slower” 
the criterion was a maximum of eight repetitions per 
15-second interval. The S was told to say the word 
according to the meaning of the word. 


Procedure 


At the beginning of the experimental session, Ss 
were told that they were participating in an investi- 
gation of finger movements and muscular coordina- 
tion. The equipment was pointed out to them and 
briefly explained. 

The S was then told “I would like you to tap 
until I tell you to stop. Be sure to raise the pen 
from the board after each tap, and move across the 
board this way. Any questions? Ready? Begin.” Two 
trials were obtained, which served as the operant 
level of responding. The treatment to which S had 
been assigned was then applied. 

Minimal Instruction—Internal goad condition: 
Each S was told, “Now I would like you to tap 
while you are saying a word. Say the word all the 
while you are tapping—the word is ‘letter’ (S was 
shown the written word, and asked to enunciate it). 
Say the word ‘letter’ aloud all the while you are 
tapping. Ready? Begin.” The S was then told, “Now 
I would like you to say another word. The word is 
‘faster’ (‘slower’). You know what the word ‘faster’ 
(‘slower’) means, I am going to ask you to tap as 
you are saying the word ‘faster’ (‘slower’). Ready? 
Begin.” These instructions were repeated for the 
second word, “slower” (or “faster”), whichever 
occurred in the counterbalanced order. 

On the next trial Ss were asked to tap while saying 
the word “letter” again, This trial was included to 
assess any effects due to fatigue or practice. On the 
final trial, “no word” condition, Ss were asked to 
tap without saying anything. 

Minimal Instructions—External goad condition: 
The procedure and instructions were identical except 
for the appropriate changes. For instance, “Now I 
would Jike you to tap while I am saying a word. I 
am going to say the word all the while you are 
tapping—the word is ‘letter. I am going to say the 
word ‘letter’ aloud as you tap. Ready? Begin.” 
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TABLE 1 
MEAN TRIALS TO CRITERION ON PRETRAINING 
Minimal instructions Slow instructions 
Group Internal External Internal External 
Faster | Slower | Faster | Slower | Faster | Slower | Faster | Slower 
Schizophrenics 1.66 3.90 1.59 5.09 2.51 4.09 2.52 5.09 
Nonschizophrenics 1.84 4.50 1.53 5.50 2.09 4.51 1.61 3.73 


Slow Instructions—Internal goad condition: Each 
S was told, “I would like you to tap as slowly as 
you can while you are saying a word. Say the word 
all the while you are tapping—the word is ‘letter’ 
(demonstration and enunciation). Say the word 
‘letter’ aloud as you tap. Now tap as slowly as you 
can while you are saying the word ‘letter.’ Ready? 
Begin.” Then S was told, “Now I would like you to 
tap as slowly as you can while you are saying an- 
other word. The word is ‘slower’ (‘faster’). You 
know what the word really means. Say the word 
‘slower’ (‘faster’) aloud as you tap. I want you to 
tap as slowly as you can while you are saying 
‘slower’ (‘faster’). Ready? Begin.” 

On the next trial Ss were asked to tap as slowly 
as they could while saying the word “letter.” On the 
final trial, “no word” condition, Ss were asked to 
tap as slowly as they could without saying anything. 

Slow Instructions—External goad condition: The 
procedure and instructions were identical to those 
under the Slow Instruction—Internal goad condition, 
except for the appropriate changes as described under 
Minimal Instructions, External condition. 


The goads spoken by Æ were pronounced in the 
same manner throughout the experiment. A monoto- 
nous tone was used rather than an expressive one 
which might have conveyed a meaning to S. 


RESULTS 
Pretraining Data 


Table 1 presents the mean trials to criterion 
required by the schizophrenic (S) and control 
(C) groups on pretraining in the pacing of 
the words “faster” and “slower.” As there 
were no order effects from counterbalancing 
the words for either of the groups, the data 
were combined for further analysis, Separate 
analyses of the trials required to criterion for 
the words “faster” and “slower” revealed 
no differences between the S and C groups 
(F = 0.09, df = 1/88 and F = 0.9, df = 1/88, 
respectively). Both groups required twice as 


TABLE 2 


MEANS AND STANDARD DEVIATIONS OF TAPPING SCORES FOR 
Att EXPERIMENTAL CONDITIONS 


Internal External 
Instructions 
o L F S L N oO L iy S L N 
Minimal 
ges 115.1 | 68.0 | 81.7 | 63.5 | 72.6 | 125.7 | 125.4 | 120.3 | 126.1 | 117.2 | 120.7 | 133.5 
SD 41.3 | 35.4 | 40.4 | 40.8 | 38.1 | 47.4] 33.5) 42.8) 43.7) S15] 43.1) 35.1 
apse 126.4| 79.1 | 84.1 | 68.7 | 82.1 | 114.2] 115.5 | 114.9 | 127.8 | 105.5 | 127.0 | 135.6 
SD 34.3 | 48.3 | 48.5 | 50.0 | 54.8 | 59.1] 29.3) 47.0] 41.2) 554] 44.6] 41.9 
Slow 
wae aes 112.5 | 25.7 | 20.7 | 19.6 | 21.0 | 15.9 | 108.1) 40.9] 38.1] 34.0) 39.0) 34.0 
SD 249| 13.1 | 13.4 | 11.1 | 12.5 | 9.0] 28.8} 19.0] 26.5] 21.2] 23.0] 19.0 
S bag 112.2| 17.7 | 16.1 | 13.7 | 15.1 | 12.6 | 115.1] 17.4] 148] 125) 13.2] 113 
SD 380! 58| 65| 51| 7.2} 61| 289) 65| 60| 48| 61) 47 
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TABLE 3 


REPEATED MEASURES ANALYSIS OF VARIANCE OF 
TREATMENT EFFECTS ON TAPPING SCORES 


Source df MS F 
Between subjects 
Subject groups (A) 1 4,057 0.73 
Instructions (B) 1 | 696,737 | 126.15*** 
Delivery (C) 1 53,495 9.68" 
AXB 1 5,571 1.01 
AXC 1 3,923 0.73 
BXC 1 24,583 4.45* 
AXBXC 1 319 0.05 
Error (B) 88 5,523 
Within subjects 

Trials (T) 5 49,694 | 132.94*** 
TXA G 382 1.02 
TXB 5 | 26,409 | 70.65*** 
TXC 5 3,103 8.30"** 
TXSXI 5 517 1.38 
TXSXD 5 164 0.43 
TXIXD 5 1,894 5.078** 
TXSXIXD 5 667 1.78 
Error (W) 440 374 


many trials to reach the criterion for “slower” 
than for “faster.” 


Experimental Data 


Table 2 shows the mean scores for tapping 
for S and C groups for all conditions. A 
repeated-measures analysis of variance was 
performed on the data to evaluate the effects 
of counterbalancing the words (faster-slower 
and slower-faster). As there were no order 
effects for the C group (F = 1.42, df = 1/40) 
or for the S group (F = 0.85, df = 1/40) the 
data were combined for further analysis. 

Three variables were considered in this 
analysis: two levels of pathology (schizo- 
phrenics and nonschizophrenics), two sources 
of delivery (Internal goad and External goad), 
and two pretask instructions (Minimal In- 
structions and Slow Instructions). The results 
of this analysis are shown in Table 3. 

The table clearly shows that for both 
groups performance on the tapping task was 
modified as a result of the pretask instruc- 
tion. For a Slow Instruction both the S and 
the C group significantly decreased tapping 
speed (p < .001). However, it was noted that 
there was also a modification in speed of 
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tapping which was determined by whether the 
goads were Internal or External ( < .001). 
The significant interaction of main effects, 
Instructions X Delivery, indicates qualifica- 
tions of these effects, however. 

To delineate more precisely the controlling 
factors involved in the interaction between 
Ss, and the number of significant interactions 
within Ss, Trials X Delivery, Trials x In- 
structions, and Trials X Instructions X Deliv- 
ery, Duncan’s multiple-range test was used to 
make relevant comparisons. The results will 
be described by noting the effects on the tap- 
ping performance of S groups of the treat- 
ments and then examining the effects of the 
goads on the groups. However, before describ- 
ing these effects it must be noted that a com- 
parison of the mean difference in operant 
levels of tapping performance for all groups 
and conditions was not significant. 


Instruction and Delivery Effects on S and C 
Groups 


Both S and C groups significantly inhibited 
speed of tapping for a pretask Slow Instruc- 
tion (p < .001). However, while the schizo- 
phrenics tapped as slowly as the controls when 
self-verbalizing (Internal goad condition) this 
was not true for the External goad condition. 
Speed of tapping of schizophrenics under the 
External condition was significantly faster 
than the controls (probabilities ranging from 
05 to .01 for all goad conditions). The dif- 
ference was most pronounced when there was 
no External goad, that is, the “no word” con- 
dition (p < 001). The schizophrenics in the 
External condition did not tap significantly 
faster than the schizophrenics in the Internal 
goad condition, however. Further analysis 
showed this was due to the fact that the goad 
“slower” in each of these conditions did not 
reach an acceptable level of significance; all 
other goads were significantly separated. 
When the C groups were instructed to slow 
down, the source of delivery of goads, Internal 
or External, was not a factor influencing 
performance. 

For the Minimal Instruction condition there 
was a significant decrement in speed of tap- 
ping from the operant level which occurred as 
a function of self-verbalization (Internal goad 
condition) for both groups (p< .001). No 
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such inhibition of tapping occurred in either 


of the S groups when the goads were External. © 


Word Meanings and Performance 


The effects of the meanings of the goads on 
performance is best observed in the Minimal 
Instruction condition. Rate of tapping of 
schizophrenics for the goads “faster” and 
“slower” differ significantly for the Internal 
goad condition (p< .05), but not for the 
External goad condition. There was some in- 
hibition of response for the goad “slower.” 
However, this was only significant when com- 
pared with the upsurge of tapping that oc- 
curred for the “no word” condition (p < .05). 

The goad difference for the C group in the 
Internal condition did not reach an acceptable 
level of significance. However, for the Exter- 
nal condition these Ss responded to the 
meanings implied in the goads “faster” and 
“slower” by tapping faster and slower, respec- 
tively (p < .02), when the Æ spoke the words. 

A pretask Slow Instruction was sufficient to 
eliminate the influence that the goad meanings 
themselves might have exerted over tapping 
for both the S and C groups (p < .001). Little 
or no fluctuation in tapping occurred as a 
function of the goads themselves when Ss had 
been given an instruction to tap slowly. 

There is additional evidence that an in- 
struction is superior to a goad in the inhibi- 
tion of response. A comparison of the effec- 
tiveness of the goad “slower” for Minimal 
Instructions and Slow Instructions revealed 
that both S and C groups were better able 
to inhibit speed of tapping for the Slow 
Instructions (p < .001). 


Rating Scale 


A rating scale was used to determine 
whether the words “faster” and “slower” 
might have differing speed connotations for 
the two S groups. Pearson’s chi-square of 
association was used to analyze the differences 
in rating these words by the two groups. 
There were no significant differences for the 
words “letter? or “slower.” However, schizo- 
phrenics rated the word “faster” significantly 
More “rapid” than the C group (x*= 26, 
p < 001). 
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DISCUSSION 


Relative to the questions asked concerning 
the parameters of verbal control of nonverbal 
behaviors of schizophrenics and nonschizo- 
phrenics the results of the present study indi- 
cated that: (a) Source of delivery of goads, 
Internal or External, is an important param- 
eter of verbal control, but its effects are re- 
lated to pretask instructions and to level of 
pathology. When no pretask instruction was 
given the externally delivered goads appeared 
to “instruct” the C group, but not the S 
group. It was noted, however, that this was 
true only for the External goad “slower,” 
which implicitly demands response inhibition 
by the Ss. For the goad “faster,” which de- 
mands no such inhibition, performance was 
practically the same for the two groups. The 
Internal goads had the interesting effect, for 
both groups, of markedly decreasing tapping 
speed from the operant level. There was also 
a significant decrease in tapping performance 
from the External goad condition. That is, 
while Ss were saying the goads they seemed 
unable to tap at the rapid pace which other- 
wise characterized their performance. When a 
pretask Slow Instruction was introduced both 
the S and C group were “instructed” in their 
performance. However, the degree to which 
schizophrenics were able to slow down their 
response, following the pretask instruction, 
was dependent upon source of goad delivery; 
the C group responded to the Slow Instruction 
in the same manner whether goads were Inter- 
nal or External. Specifically, schizophrenics 
in the External goad condition tapped con- 
siderably faster than the schizophrenics in 
the Internal goad condition, The C group 
tapped at the same speed whether the goads 
were External or Internal. (b) The use of 
goads consistent, irrelevant, or contradictory 
to a pretask instruction is not a relevant di- 
mension in verbal control of motor responses 
of schizophrenics or nonschizophrenics. While 
the External goads had a disinhibitory effect 
on speed of tapping of the schizophrenics, 
relative to other groups, there was no evidence 
that this disinhibitory effect was related to 
the verbal inconsistencies of the Instruction 
goad condition. There was little variation in 
speed of tapping for any of the goad condi- 
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tions whether these were consistent, irrelevant, 
or contradictory to pretask Instructions. 

The similarity of schizophrenics and non- 
schizophrenics for the Internal goad condition, 
in the present study, suggests a reevaluation 
of the role of external verbalizations and, 
possibly, external stimuli, in the modification 
of the behavior of schizophrenics. Paradoxi- 
cally, while schizophrenics have been shown 
to improve their performance when given dis- 
tinctive external cues concerning response per- 
formance (Lang, 1959), these patients are 
also characterized by their poor ability to 
utilize external cues in the modification of 
performance. Following a review of the litera- 
ture, Buss and Lang (1965) report that on 
verbal conditioning tasks schizophrenics were 
consistently less able than normals to make 
use of verbal cues, for example, “right” or 
“wrong,” to improve their performance. 
Letchworth (1966) reports that even with a 
pretask “focusing” instruction, which is 
known to facilitate the subsequent condition- 
ing of normals, schizophrenics appear unable 
to make use of external verbal conditioning 
cues. 

The findings that schizophrenics perform as 
well as normals for the Internal goad condi- 
tion suggests that the most effective method 
by which to modify maladaptive behaviors of 
schizophrenics might be one which utilizes 
“self-speech” to produce voluntary control of 
responses, However, while “self-speech” may 
be beneficial in providing the schizophrenic 
with a means of inhibiting responses at will, it 
would scarcely have generality unless a con- 
comitant ability to respond to the external 
cues was also acquired. The relevant differ- 
ences which characterized the S and the C 
groups related to the ability of the C group 
to be guided by External goads in the inhibi- 
tory control of response. Conceivably, to de- 
velop in schizophrenics a_ self-instructional 
control which takes cues for control from 
external sources it might be necessary to con- 
dition the external control cue to the “self- 
speech” of the patient. 

In this respect, it is of interest that per- 
formance of schizophrenics, in the present 
study, for the “no word” condition of the 
Slow Instruction treatment was markedly 
similar to tapping speeds in the goad condi- 
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tions preceding it. Schizophrenics inhibiting : 
response under the Internal goad apparently. 
acquired a self-instructional control available i 
to them in the “no word” condition. k 
The failure to find that goads which were 
irrelevant or contradictory to the instruction 
disrupted performance of the schizophrenics 
on the tapping response is of interest. There 4 
is a theoretical formulation of schizophrenia, ~ 
the double bind theory (Bateson, Jackson, 
Haley, & Weakland, 1956), which hypothe- 
sizes that the schizophrenic disorder is the% 
end product of a faulty communication sys- 
tem which pulls the patient in opposing direc- 
tions as he is required to respond to incon: 4 
sistent and contradictory messages conveyed ~ 
to him by a dominant figure or figures in his — 
life. The present study provides no evidence 
that schizophrenics, or nonschizophrenics, are ~ 
disturbed in their performance by the con- f 
flicting messages they receive. However, this 
study alone cannot be taken as an indictment © 
of the double bind theory. The theory assumes ~ 
that on one level the message to the schizo- — 
phrenic involves emotional rejection. The de- | 
sign of the present study made it impossible” | 
to evaluate such components of the verbal 
message. However, as Schuham (1967), fol- 
lowing a review of the literature, has con- 
cluded, there is little or no empirical support 4 
for the hypothesis, at present the very exis- 
tence of the phenomenon appears open to 
serious question. g 
The ability of the schizophrenics, in the ~ 
present study, to maintain a response set, 
unaffected by irrelevant stimulus elements, is” 
contrary to results reported by other investi- 
gators (Payne & Caird, 1967; Rodnick & — 
Shakow, 1940; Shakow, 1962). The hypothe- 
sis of a failure in response set put forward 7 
by these investigators has been based primar- $ 
ily upon RT tasks in which S is penalized for ~ 
slight® delays in responding to irregularly 
placed, or unusually distracting, stimuli. The — 
differences in the results obtained in the 
present study and those of others may be due 
to the nature of the tasks which were used, 
or to the pretask instruction. The most 
plausible explanation would appear to be that 
the pretask instruction overcame the “set” — 
deficit described by Shakow (1962). A 
Why the schizophrenics were less able to 
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inhibit response for the externally delivered 
goads than for the internally delivered goads 
is not immediately clear. It appears to this 
author that this deficit behavior represents a 
failure in self-control which occurred because 
these patients were less able to use the exter- 
nal discriminative stimulus to control re- 
sponses. (When no element of control was 
involved, that is, no inhibition of response, 
as for the goad “faster,” the schizophrenics 
responded as the controls.) Possibly, when 
schizophrenics were told to tap as slowly as 
they could and then heard Æ saying the goad 
“letter,” the task was more confusing than 
when Ss verbalized the goad “letter.” Such an 
interpretation would be in agreement with the 
position taken by Lang and Buss (1965) 
that schizophrenic deficit performance is 
an increasing function of task ambiguity or 
complexity. 

However, it could be argued that the dif- 
ferences observed between the groups for the 
Internal and External conditions were due to 
a confounding of the frequency of goad pres- 
entation. Goads presented at a rapid rate 
could, conceivably, serve to confuse S in his 
assessment of task requirements. It will be 
recalled that the number of goads spoken by 
Ss in the Internal goad condition determined 
the frequency with which the external goad 
was presented. The mean frequency of the 
goad “faster” for schizophrenics was 57 
spoken words for 30 seconds, for the control 
group it was 45.5; for the goad “slower” the 
mean was 46.5 for the schizophrenics and 
36.5 for the control group. 

There are several arguments which mitigate 
against the possibility that the differences 
between the two groups were due to fre- 
quency of goad presentations. First, the 
difference score between “faster” and “slower” 
was not different for the S group (mean 
difference = 11) and the C group (mean dif- 
ference= 9.1). Second, the pilot data indi- 
cated a similar inability of schizophrenics to 
vary a tapping response according to the 
meaning of the External goads when the 
goads had been paced at a constant 2-second 
interval for schizophrenics and controls alike. 
Finally, under the Slow Instruction condition 
there was little difference in the frequencies 
of the goads for the schizophrenic and control 
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groups. (The means for “faster” were 23.9 for 
the schizophrenic group and 20,2 for the con- 
trol group; for “slower” the mean was 17.5 
for the schizophrenic and 15.8 for the control 
group. These differences were not significant.) 
If the S group had tapped at a rate which 
corresponded with the number of External 
goads presented there should have been no 
differences between this group and the C 
group. It was noted that the C group never 
tapped at a faster rate than the rate of 
External goad presentation. 

While it could be argued that the present 
findings indicate, experimentally, the autistic 
style of control characteristic of schizo- 
phrenics, analogous findings with children 
have shown that self-verbalizations exert con- 
trol over subsequent responses on concept re- 
versal tasks, normally beyond the abilities of 
these children (Silverman, 1966). With the 
schizophrenics, as well as with children, two 
aspects of self-verbalization, a focusing set 
and increased information from response 
mediated feedback, may interact to provide 
the inhibitory control necessary for appropri- 
ate response, The present study indicates the 
need for research into ways of influencing the 
“self-speech” of schizophrenics. 
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SELF-CONCEPT OF THE MARRIED PSYCHIATRIC PATIENT 
AND HIS MATE’S PERCEPTION OF HIM 
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Twenty-six married psychiatric patients and their spouses were administered 


a 50-item questionnaire to assess their concepts of themselves and of each other 
1 week after acute hospitalization and then again 7 weeks later. The patients’ 
initial self-images and their mates’ perception of them were both more negative 
than their own and their spouses’ images of the average person (p < .001). 
The patients and their mates viewed the patient as the most disturbed family 
member, with both seeing the spouse in a more positive light than the patient 
(p < 01). The patients’ self-images improved during hospitalization (p< .01). 
The spouses’ images of the patients also tended to become more positive 
(#:< .10), but the spouses still viewed the patients more negatively ` than 


concept in the recent literature (Wylie, 1961), 
there are relatively few systematic studies of 
the self-perception of hospitalized adult psy- 
chiatric patients and of changes in'their self- 
perception during hospitalization. In addi- 
tion, despite the current popularity of family 
treatment and of theories about the role of 
the family in psychopathology (Ackerman, 


| 
| 
| 
g themselves. 
Despite the increasing emphasis on the self- 


1966), there has been even less research on 
how family members perceive the hospitalized 
psychiatric patient. 

Our previous research with hospitalized 
adolescent patients and their families indi- 
cated that despite frequent family discord 
both patients and their parents viewed each 
other in a way that was generally in accord 
with the views of society (i.e, the patients 
were seen in a negative light and the other 
family members were viewed in a more posi- 
tive manner) (Harrow, Fox, Markhus, Still- 
man, & Hallowell, in press). The current re- 
search attempts to determine whether this 
trend is also true of marital families. 3 

The specific questions which the present 
study attempts to answer are: 

1. What is the self-perception of the mar- 
ried psychiatric patient shortly after his ad- 
mission to a psychiatric hospital? 

2. Do patients and/or their mates see only 

į the patient in a negative light (or as the most 

1 Requests for reprints or a copy of the 50 items 

used in the test may be obtained from Martin 


arrow, Department of Psychiatry, Yale University, 
33 Cedar Street, New Haven, Connecticut 06510. 


disturbed family member), or do they see 
both the spouse and the patient negatively? 
3. Does the patient’s self-concept change 
during the course of hospital treatment, and 
do other family members’ (e.g., his spouse’s) 
perceptions of the patient change? 


METHOD 
Subjects and Setting 


The'Ss were 26 married psychiatric inpatients and 
their husbands or wives. The mean age of the patient 
sample was 37.5 years and the mean educational level 
was 14 years, There were 9 males and 17 females 
among the patient population. Diagnostically the pa- 
tients included 10 schizophrenics (38%), 10 depres- 
sives (38%), 4 character disorders or neurotics 
(15%), and 2 of varied diagnoses (8%). 

The patients were all recent admissions to the 
Acute Psychiatrie Inpatient Division of Yale-New 
Haven Hospital. This ward, a short-term intensive 
treatment unit, emphasizes group, family, and social 
treatment (Astrachan, Harrow, & Flynn, 1968; 
Detre, Kessler, & Jarecki, 1963). 


Procedure and Data Collection 


The patients and their spouses were tested, indi- 
vidually, at two time periods: (a) Time Period 1 
(TP1), a week and a half after hospitalization, and 
(b) Time Period 2 (TP2), 7 weeks later. Complete 
sets of data were collected at both TPs from 23 
patients and their spouses, and partially complete 
data were collected from the other three patients and 
their mates due to the difficulty of testing both 
patients and their spouses at two TPs. 

During pilot work the original deck of 100 Q-sort 
items used by Butler and Haigh (1954) was reduced 
to 50 to eliminate items that would be difficult to 
understand by patients with below-average intel- 
ligence. Pilot work also suggested advantages in 
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TABLE 1 
MEAN ÅDJUSTMENT SCORES AT Time PERIOD 1 anp Tm Perrop 2 
: Mean adjustment scores laui 
People rating Concept rated TPI vs. TP2 
ee ee 
Patients Self 3.68** 
Patients Self r Ideal 4,240" 
Patients Ideal 2.20* 
Patients Average person 1,90 
Patients Spouse 2.77* 
Spouses Patient 1.73 
Spouses Ideal 1,83 
Spouses Self 1,35 
Spouses Average person 0.67 
Normal controls Self —_ 


Note.—TP1 = Time Period 1; TP2 = Time Period 2, 


s Reone mean correlation (using Fisher s transformation) between patients’ self-images and their ideals for themselves. 
<05. 


p 
** p < 01. 
rr > < .001. 


modifying the typical forced-choice distribution, due 
to difficulties encountered by disturbed patients and 
anxious mates in attempting to comply accurately 
with forced-choice instructions. The final test form 
was a 50-item paper-and-pencil test, with each item 
rated on 9-point Likert-type scales. 

At each TP the patient and his mate rated each 
of the following concepts, separately, on the 50 
items: (a) the patient, (b) the patient’s spouse, 
(c) the ideal for the patient, and (d) the ordinary 
average person. 

Each of the 50 items had been rated for “adjust- 
ment” and for social desirability in a previously 
reported study (Wiener, Blumberg, Segmen, & 
Cooper, 1959). Using these ratings, the 50 items 
(22 positive, 22 negative, and 6 neutral items) were 
combined and a total adjustment score was calcu- 
lated for each image of each patient and his mate. 
Scores for each image could range from +44 to —44. 
The major part of the data consisted of 16 adjust- 
ment scores for each family (one for each image 
at each of the two TPs) and correlations of all 
images within each family. Ratings of family inter- 
action and of the daily behavior and symptoms of 
all patients were collected from the patients’ thera- 
pists and from other staff observers at each testing 
period. 

RESULTS 


Table 1 presents the mean “adjustment” 
scores at both TPs and £ tests (Mair) com- 
paring the scores at TP1 with those at TP2. 
Table 2 reports the £ tests comparing the key 
concepts with each other at TP1, and also 
comparing these key concepts with each other 
at TP2. 


The initial adjustment scores derived from 
the self-images of the patients were signifi- 
cantly lower than their own and their spouses’ 
images of the ordinary average person (Tables 
1 and 2, p< .001). Their initial self-images 
were also significantly more negative than the 
self-images of a control group of 20 “normals” 
used in a previous study, employing the same 
test instruments (Tables 1 and 2, p < .001). 

In addition, the patients’ initial views of 
themselves were significantly more negative 
than their views of their spouses (Tables 1 
and 2, p < 01). The spouses tended to agree 
with this, seeing themselves in a more favor- 
able light than they saw their patient-mates 
(Tables 1 and 2, p < .001). Thus, both the 


patients and their spouses tended to see the — 


spouse in a relatively positive light. As fur- 
ther evidence for this, the patients’ spouses 
did not see themselves as significantly more 
negative than the average person (Table 2, 
p > .10), and the patients’ views were in gen- 
eral agreement with this (Table 2, p > .10). 

After 8 weeks of hospitalization the mean 
correlation between the patients’ self-images 
and their ideals had become significantly 
higher (Table 1, p< .001). Furthermore, 
their self-images had become significantly 
more positive (Table 1, » < .01), but were 
still lower than both their own and their 
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TABLE 2 


TESTS OF SIGNIFICANCE COMPARING THE KEY Concerts “AT Time PERIOD 1 
AND Time Peron 2 


t 
People rating | Concept rated vsa People rating | Concept rated 
Time Period | Time Period 
1 2 
Patients Self vs. Normal Controls | Self 4,37 1.31 
Patients Self vs. Patients Average person | 5.20*** 2.23* 
Patients Self vs. Spouses Average person | 4.92*** 2.29% 
Patients Self vs. Patients Spouse 3.39** 1,58 
Spouses Patient vs. Spouses Self 4.31%" 2.50* 
Patients Spouse vs. Patients Average person | 1.09 0.33 
Spouses Self vs. Spouses Average person 1.58 1,15 


oP all Fite the mean adjustment score for the second concept listed is higher than that for the first concept listed. 


spouses’ image of the average person (Table 2, 
p< .05). 

The spouses’ images of the patients also 
tended to become more positive 7 weeks later 
(Table 1, p< .10), but less than did the 
patients’ own views of themselves. Further- 
more, at TP2 the spouses still viewed the 
patient more negatively than themselves, and 
the patients showed some tendency to concur 
in this view (Table 2). 


Discussion 
Initial Self-Concepts of Patients 


The initially negative self-concepts of the 
Patients are in marked contrast to the images 
of the average person held by both the pa- 


tients and their spouses, and are significantly 


More negative than the self-concepts of the 


“formal” control group. In addition, when 


or 


h 


* 


we compare the patients’ initial views of 
themselves with their views of their mates, 
we also find a significant difference. Thus, 
these married patients see themselves as the 
More disturbed family member and this is in 
agreement with our results on hospitalized 
adolescent patients (Harrow et al., 1968). 
The dramatic impact of acute hospitaliza- 
tion, with its social stigma (Lamy, 1966), 
may have influenced the patient’s self-per- 
Ception. Although hospitalization frequently 
the patient in a more supportive setting 
than before, with some subsequent symptom 


reduction (Colbert & Harrow, 1967; Harrow 
et al., 1966), the complete removal from the 
home and family also implies a collapse of 
the patient’s previous adjustment. This im- 
plication of collapse probably lowers the 
patient’s self-concept. 

In addition, the duration of previous out- 
patient therapy by the patients was related 
significantly to their initial self-concepts 
(r = —.45, N = 26, p < .05), tentatively sug- 
gesting that more chronic patients who have 
been in treatment longer have more negative 
self-concepts. 


Spouses’ Initial Views of Patients and of 
Themselves 


The patients’ mates agreed with the pa- 
tient? negative self-evaluations, although 
their initial judgments were not as harsh as 
were the patients’. In contrast, both marital 
partners consistently saw the nonhospitalized 
spouses in a more positive light. These results 
occurred despite sharp marital discord in some 
families, intense ambivalence and overinvolve- 
ment, and acting out by some patients and 
their mates towards each other. This negative 
view of the patient by another family member 
is also in agreement with data we have re- 
ported on the views of families of hospitalized 
adolescents. 

The present results and those of our pre- 
vious work contain some interesting implica- 
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tions, since almost all patients and families 
studied were seen in family treatment. While 
the patient and his family may pay lip service 
to the notion that in family treatment both 
parties are “patients,” both of their moder- 
ately positive evaluations of the nonhospital- 
ized mate suggest only a partial acceptance 
of the idea that both are patients. 


Changes in Patients’ Self-Concepts and 
Spouses’ Views over Time 


The patients’ more positive self-concepts 
7 weeks later seem to reflect their clinical im- 
provement, In addition, the correlation of the 
patient’s ideal with his view of himself also 
became significantly higher over time. 

Since the patients’ ideals tended to become 
more positive at TP2, the higher correlations 
of patients’ self-images with their ideal images 
were not due to a lowering of ideal images 
(see Table 1), but may have been related to 
the patients’ self-images becoming more posi- 
tive at TP2. It should be noted in contradic- 
tion to some theories which postulate more 
realistic (and lower) patient ideals as clinical 
improvement occurs, the results with the pres- 
ent sample and two previous samples of ours 
suggest that as patients improve clinically and 
their self-images become higher, there is a 
slight trend for their ideals to become higher. 
Although this result may run counter to some 
theories, it would fit in with a model in which 
peoples’ expectations become higher as their 
competence increases. 

The spouses agreed with the more positive 
self-evaluations of the patients. At TP2, how- 
ever, the spouses still viewed the patients 
more negatively than themselves (p < .05), 
and there was also some tendency for the 
patients to see themselves more negatively 
than they saw their spouses. This occurred 
despite the more positive image of the pa- 
tients held by both parties. These results 
concur with the view of society at large, since 
the patient resides in a mental hospital and 
is still considered “sick,” despite his consider- 
able clinical improvement during the 7-week 
interim. The tendency for the spouses to view 
the patients more negatively than they see 
themselves, and the less optimistic view by 
the spouses concerning patients’ improvement, 
may create subsequent difficulties for the pa- 
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tients after their discharge from the hospital, 
as there is evidence that relatives’ attitudes 
and expectations bear some relation to pa- 
tients’ posthospital success (Freeman & Sim- 
mons, 1963). 

In relation to changes in the patients’ and 
spouses’ views, the present authors computed, 
at each TP, correlations of patients’ with 
spouses’ views of the patients, and also cor- 
relations of patients’ with spouses’ views of 
the spouses. These correlations indicated that 
there was significantly greater agreement as 
to what the patients (and the spouses) are 
like at TP2 as compared to TP1 (p < .02). 
The greater agreement about what the pa- 
tient is like may be related, in part, to a 
reduction in turmoil and conflict around the 
patient as time passes since acute hospitaliza- 
tion, and as the patient improves. The data 
indicating greater agreement about the spouse 
may also suggest some reduction in conflict 
concerning the spouse (who either assisted or 
was involved in the hospitalization of the 
patient) as time passes. In addition, the pres- 
ent hospital setting emphasizes family treat- 
ment, with patients and families being in- 
volved in conjoint family therapy (Satir, 
1964) and multiple family group therapy 
(Harrow, Astrachan, Becker, Detre, & 
Schwartz, 1967), and this may have played 
some role in the greater family agreement 
about what the patient and spouse are like 
at TP2. 
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EFFECTS OF ANXIETY ON TIMED AND 
UNTIMED INTELLIGENCE TESTS: 


ANOTHER LOOK a 
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The timed subtests of the WAIS were administered to 48 Ss, only half of whom 
were aware of being timed. The Ss were divided into four groups on the basis 
of their response to 15 worry items and 15 emotionality items taken from the 
Taylor Manifest Anxiety scale. Low-worry Ss who were “timed” were superior 
to those who were “untimed,” whereas high-worry Ss tended to perform more 
poorly under the “timed” than the “untimed” condition, This Worry X Timing 
interaction closely parallels the “Anxiety” X Timing interactions found by 
others. Emotionality was not found to interact with the task variables. 


The major formulations of the relationship 
of anxiety to performance (Mandler & Sara- 
son, 1952; Spence, 1958) have led to an ex- 
amination of various aspects of performance 
situations, some of which have differential 
effects on high- and low-anxious Ss. One factor 
to which attention has been directed in this 
regard is the timing of performance. Thus, 
Matarazzo, Ulett, Guze, and Saslow (1954) 
found anxiety, measured by Taylor Manifest 
Anxiety (MA) scale scores, to be negatively 
related to scores on a timed intelligence test, 
but unrelated to scores on untimed measures 
of intelligence. Similarly, Siegman (1956) ad- 
ministered the MA scale and the Wechsler 
Adult Intelligence Scale (WAIS), compared 
the performance of the high- and low-anxious 
Ss on the timed and untimed subtests, and 
found that “only the high MAS scorers had 
significantly lower scores on the time-limited 
subtests of the WAIS [p. 177].” And, in an 
experimental study, Sarason, Mandler, and 
Craighill (1952) found high- and low-anxious 
Ss, identified by Test Anxiety Questionnaire 
(TAQ) scores, to be differentially affected by 
different instructions involving the timing of 
a task. Specifically, Sarason et al. gave college 
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versity, Nashville, Tennessee 37203. 


students a task which was impossible to 
finish within the allotted time. Half of these 
Ss were told that they were expected to finish; 
the rest were told that they were not expected 
to finish. Low-anxiety Ss were found to per- 
form better under the stressful (i.e., “expected 
to finish”) instructions than under the non- 
stressful condition. Differences in the perform- 
ance of high-anxious Ss were slight, but, as 
predicted, they tended to perform less well 
under stress than otherwise. 

In an attempt to refine existing conceptuali- 
zations, the present authors have recently 
questioned the value of viewing anxiety 
as a unitary state and have proposed an em- 
pirical examination of the component feelings 
and cognitions which are popularly called 
“anxiety” (Liebert & Morris, 1967). Spe- 
cifically, we posited a distinction between 
cognitive and physiological-affective anxiety 
components. Thus, worry (W) was identified 
as cognitive concern about one’s performance, 
while emotionality (Z) referred to autonomic 
reactions which occur in response to the stress 
of the test-taking situation. Within this frame- 
work it has been demonstrated that (a) worry 
varies as an inverse function of one’s perform- 
ance expectancy in a test situation, while emo- 
tionality is unrelated to expectancy; and (b) 
emotionality rises sharply immediately before 
the test and decreases sharply immediately 
after leaving the test situation, whereas worry 
reaches a high level earlier and does not dis- 
sipate so rapidly (Liebert & Morris, 1967; 
Spiegler, Morris, & Liebert, 1968). 
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EFFECTS oF ANXIETY 


The present study attempted to apply the 
worry-emotionality distinction to an examina- 
tion of the relationship, of anxiety to intel- 
ligence test performance. We anticipated a 
partial replication of the findings cited above 
with respect to a Timing x Anxiety interac- 
tion and advanced two additional hypotheses 
derived from the worry-emotionality distinc- 
tion. Manipulation of task difficulty was also 
included (a) because this dimension has often 
been found to interact with other task ma- 
nipulations in their effect on the performance 
of high- and low-anxious Ss (e.g., Sarason & 
Palola, 1960), and (b) because difficulty is 
closely related to performance expectancy, 
which has been used in prior worry-emotion- 
ality studies. 

The major hypotheses of this study were 
as follows: 

1. High-anxious Ss should do better on un- 
timed than on timed tests, whereas low- 
anxious Ss should do better on timed than on 
untimed tests. This is based on the findings 
cited above. 

2. (a) Timing of the test was conceived of 
as an external, stressful aspect of the testing 
situation, and was thus expected to elicit pri- 
marily emotionality. (b) Assuming emotion- 


ality to be inhibitory in nature, we would’ 


therefore expect high-emotionality Ss to per- 
form poorly on timed tests relative to untimed 
tests. Like the low-anxiety Ss, low-emotion- 
ality Ss should perform better on timed than 
on untimed tests. 

3. (a) Presentation of a difficult test was 
expected to lower performance expectancy, 
thus causing worry to increase. (b) Assuming 
worry also to be inhibitory in nature, we 
therefore expected that high-worry Ss would 
be inferior to low-worry Ss on hard tests, but 
superior to them on easy ones. 

With respect to Hypotheses 2 and 3, it was 
assumed that the predicted differential effects 
of the anxiety components would occur only 
if the manipulations of timing and difficulty 
were effective in eliciting emotionality and 
worry, respectively. In other words, if Parts a 
of these hypotheses were not confirmed, then 
this experiment would be unable to provide a 
test of Parts b. 
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METHOD . 
Subjects 


The Ss were 48 students in introductory psychol- 
ogy at Vanderbilt University, all of whom volun- 
teered in order to receive the experimental credit 
required for completion of the course. They were 
grouped into four anxiety groups (high worry-high 
emotionality—-HH; high worry-low emotionality— 
HL; low worry-high emotionality—LH; low worry- 
low emotionality—LL) on the basis of their response 
to an anxiety questionnaire administered to all stu- 
dents (approximately 350) early in the semester. This 
“Experiences Questionnaire” consisted of 15 worry 
items and 15 emotionality items, all of which were 
taken from the MA scale. On the basis of indepen- 
dent judging by the authors, items were chosen for 
which there was agreement that they reflected either 
cognitive concern or autonomic arousal. Students re- 
sponded on the basis of a 5-point scale: “The state- 
ment is never true, seldom true, sometimes true, 
mostly true, or true for me.” W and E scales were 
considered separately, high and low scores being 
defined by the upper and lower thirds on each scale. 
Students falling into the anxiety categories outlined 
above were then allowed to volunteer as Ss. 


Materials 


The performance task which was presented to Ss 
consisted of five subtests of the WAIS which are 
normally timed subtests (arithmetic, picture comple- 
tion, block design, picture arrangement, and object 
assembly). Unlike Siegman (1956), who compared 
Ss’ performance on the timed versus the untimed 
subtests of the WAIS, the present study compared 
the performance of Ss who were aware of being 
timed versus Ss who were unaware of being timed 
on identical subtests. This latter procedure appears 
to reduce the likelihood of contamination by task 
differences. The difficulty manipulation was accom- 
plished by choosing items from the first part of the 
subtests (easy) versus items from the last part of 
each subtest (hard). 


Procedure 


The Ss entering the experimental situation were 
told that the aim of the experiment was to investi- 
gate the relationships between grades, feelings and 
attitudes, and IQ. They were told that they were 
selected as potential Ss on the basis of their present 
grade in introductory psychology. They were further 
told that they were going to be tested on a part 
of the Wechsler Adult Intelligence Scale, and that 
their scores could be transformed into IQ scores, thus 
providing a measure of their intelligence. Those in 
the Timed condition were told that the tasks would 
be timed, and that the time taken to reach a solution 
would be taken into account in determining their 
score. In ‘the Untimed condition, nothing was said 
about time limits. 

The Ss were then asked to complete a pre- 
examination questionnaire consisting of five W items 
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TABLE 1 
ANALYSES OF VARIANCE 


WAIS scores Emotionality change scores Worry change scores 
Source 
MS F MS F MS F 
Difficulty (A) 1140.75 18.29**** 9.19 2.31 46.02 6.42** 
Emotionality (B) 44.08 <1 1.69 <i 92 <1 
Worry (C) 1.33 <i 2.52 <1 2.52 <i 
Timing (D) 252.08 4,04* 02 <1 1.02 <1 
AXB .09 <1 1.69 <1 .02 py 
AXC 48.00 <i 19 <1 02 <1 
AXD 420.09 6.74" 2.52 <1 52 <1 
BXC 65.34 1.05 4.69 1.18 .52 <1 
BXD 24.09 <i 2.52 <1 22.69 3.17 
cxD 432.01 6.93" 11.02 2.77 7.52 1.05 
AXBXC 16.33 <1 7.52 1.89 2.52 <1 
AXBXD 2.07 <1 2.52 <1 13.02 1.82 
AXCXD 833.32 13-568" 02 <1 7.52 1.05 
BXCXD 47.99 <1 1.02 <1 1.02 <1 
AXBXCXD 61.21 <1 52 <1 9.19 1.28 
Subjects/groups 62.38 3.98 7.17 
* 
"p Sim 
eee p = L001. 
wert > = 0002. 


and five E items taken from Mandler and Sarason’s 
TAQ (cf. Liebert & Morris, 1967). The five subtests 
of the WAIS were then administered according to 
standard procedures, with the exception of differen- 
tial timing procedures. In the Timed condition, the 
experimenter held a stop watch in his hand, making 
an obvious effort to time S’s work accurately. In the 
Untimed condition, the tasks were timed without S’s 
knowledge by the use of a silent clock with a sweep 
hand, which was hidden from S’s view. Following 
completion of the task, Ss completed a postexamina- 
tion questionnaire consisting of the same 10 items 
from the TAQ. Finally, Ss were assured that IQ 
scores could not be obtained from these tests alone, 
and that the personal information was strictly 
confidential. 

Two dependent measures were used: (a) scores on 
the WAIS subtests, obtained by standard scoring 
procedures, and (b) W and E change scores, obtained 
by subtracting W and E scores on the preexamina- 
tion questionnaire from the corresponding scores on 
the postexamination questionnaire. 


RESULTS AND Discussion 
~ 


Analysis of variance of WAIS scores is pre- 
sented in Table 1. It is evident here that, 
as expected, there is a main effect for difficulty 
level. There is also an effect for timing that 
approaches significance, performance on timed 
tests being better than on untimed tests. Ad- 
ditionally, there is an interaction effect of 
Difficulty x Timing, due to inferior perform- 


ance by the Ss who were given hard, untimed 
tests. It is also clear from Table 1 that W 
interacts with the task variables of Timing 
and Difficulty x Timing, while Æ is involved 
in no significant interactions. 

Table 1 also presents analyses of variance 
of W and E change scores. The only signifi- 
cant effect is that of difficulty level on W 
change scores, which primarily reflects a de- 
crease in W scores when an easy test was pre- 
sented, There was also a tendency, though 
not significant, for W scores to increase as 
predicted when a hard test was presented 
(t = .73). Contrary to prediction, Æ scores did 
not increase as a function of timing the test. 
Tn fact, there were no significant effects in the 
analysis of variance of Æ change scores. 

The major hypotheses of this study can be 
evaluated in the light of the above data as 
follows: 

Hypothesis 1. High and low anxiety was 
defined by presence in the HH or LL group, 
respectively. Figure 1a shows the Anxiety X 
Timing interaction obtained by Siegman 
(1956), and Figure 1b shows the correspond- 
ing findings in this study. Here it is evident 
that the same pattern of results was obtained 
in both investigations. However, the difference 
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between performance on timed versus untimed 
tests is not statistically significant for either 
high- or low-anxious Ss in this study, though 
the tendency in both cases is in the predicted 
direction (é= 1.11 and t= 1,04, respec- 
tively). Likewise, an analysis of variance of 
WAIS scores revealed a nonsignificant Anxiety 
X Timing effect (F = 2.03, p= .17). There 
is, however, a significant three-way interaction 
(F = 7.39, p < .02), which indicates that the 
Anxiety X Timing interaction is more pro- 
nounced for hard tests than for easy ones, 

Hypothesis 2. This hypothesis received only 
partial support from the data: (a) the timing 
procedure failed to elicit emotionality, and 
(b) since this was prerequisite to the predic- 
tion concerning the performance of high-E 
Ss, the latter prediction could not be evalu- 
ated. Actually, WAIS scores for both high- 
and low-E Ss were ordered according to the 
prediction for the latter, their performance 
being better on timed than on untimed tests 
(Figure 1d). This finding seems reasonable, 
given the failure of the timing procedure to 
elicit emotionality. 

Hypothesis 3. (a) The first part of this 
hypothesis received partial support, since the 
W scores of Ss who were given hard tests in- 
creased significantly more than the W scores 
of those who were given easy tests. However, 
this difference was largely due to the de- 
creased W scores of the latter, rather than an 
increase upon presentation of the hard test. 
It is possible that the initial instructions led 
Ss to expect a hard test, so that those who 
were presented with an easy test experienced 
considerable relief from worry. On the other 
hand, presentation of a hard test aroused only 
a small amount of additional worry. Never- 
theless, this finding further corroborates the 
inverse relationship found between expectancy 
and worry scores in earlier studies. (b) There 
is little evidence to indicate that the different 
levels of worry had differential effects on 
Performance. Worry does not interact with 
difficulty alone; it does, however, enter into 
a three-way interaction with difficulty and 
timing, the two-way interaction between the 
task variables being accentuated for low- 
worry Ss. 

The most significant finding in this study 
appears to be the interaction of W X Timing. 
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Fic. 1, Mean WAIS scores as a function of Timing 
and MA scale, Worry, and Emotionality. 


Figure 1c portrays this interaction graphi- 
cally. It is important to note that this 
interaction follows the same pattern as the 
“Anxiety” X Timing interaction obtained in 
this and previous studies. Thus, these findings 
concerning worry are almost identical to those 
of Sarason et al, (1952) using “anxiety.” In 
both cases, there was a clear difference in the 
performance of low-anxiety (worry) Ss under 
different conditions, but only a tendency for 
the performance of high-anxiety Ss to differ 
as predicted. These findings appear to lend 
very strong support to the distinction between 
worry and emotionality, and to the usefulness 
of this distinction in anxiety-performance 
research, 

The evidence presented here in conjunction 
with the findings of previous studies clearly 
confirms the hypothesis that worry varies in 
a test situation as a function of the examinee’s 
expectation concerning his performance on 
the test. Thus, when a difficult task is pre- 
sented, S’s expectation decreases, causing him 
to be more worried (i.e., concerned about his 
performance). The findings of the present 
study further suggest that it is worry, not 
“anxiety,” which affects performance on 
intellectual-cognitive tasks and which inter- 
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acts with the relevant variables of the test 
situation. 

On the other hand, emotionality scores have 
been found to vary as a function of being in 
or out of a test situation. However, the failure 
of the Timed condition to elicit an increase in 
emotionality stresses a problem which requires 
further research, that is, the problem of 
determining what specific aspects of a testing 
situation are principally responsible for the 
elicitation of emotionality. It also seems likely 
that there are some types of tasks (e.g., motor 
skills) in which emotionality may be the 
component which affects performance. Further 
experiments are planned to attempt to address 
these issues. 

In contrast to the data of Siegman (1956), 
which suggested a “simple” interpretation 
concerning the inhibitory or disruptive effect 
of high anxiety on timed tests, the findings 
of this study are very similar to those of 
Sarason et al. (1952) in that they require 
consideration of differential effects of a 
stressor (timing) on high- and low-anxiety 
(or worry) Ss. Thus, timing the test seems 
to have an inhibitory effect on high-anxious 
Ss, and a facilitating effect on low-anxious 
Ss. In this study, the differential performance 
of low-worry Ss on timed and untimed tests 
seems best explained by assuming that the S 
with a low worry score is one who requires 
additional external motivation in order to 
perform well on this kind of task. The Timed 
condition is such a motivating condition, 
which enables the low-worry S to realize his 
potential. High-worry Ss are also motivated 
additionally by the Timed condition but, 
according to Mandler and Sarason (1952), 
have learned task-irrelevant anxiety responses 
which distract them from the task and thus 
disrupt their performance. Further studies are 
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needed to test this and other possible interpre: 
tations, and to further clarify the nature of 
the effect of both worry and emotionality on 
performance in various types of situations, 
Overall, the aim of this experiment 
been realized: that is, to show that 
worry-emotionality distinction is applicable ‘to 
analyses of the relationship of anxiety and 
performance and to suggest that by attendin 
to these components of anxiety, we may be 
better able to specify the effects of anxiety 
on different kinds of performance tasks. 
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PREFERENCE FOR COMPLEXITY IN ACUTE SCHIZOPHRENICS 
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A paired comparisons scaling technique was employed to assess the preference 
of schizophrenics for varying quantities of information as presented in geo- 
metric forms. Both good premorbid paranoid and poor premorbid nonparanoid 
schizophrenics showed their greatest preference for the least complex shapes. 
The results were discussed in terms of Munsinger and Kessen’s conception of a 
match between preference for environmental information and input-processing 


capacity. 


Recent research has indicated that indi- 
viduals show reliable preferences for different 
quantities of information as presented in geo- 
metric shapes (Munsinger & Kessen, 1964; 
Thomas, 1966). For example, Munsinger and 
Kessen found that Ss showed their greatest 
preference for figures of intermediate com- 
plexity (10 independent turns). Stimuli higher 
and lower in complexity were preferred less. 
Munsinger and Kessen have further theorized 
that the amount of information an individual 
prefers matches his information-processing 
capabilities. While some inconsistencies exist 
in this literature, it is clear that neither chil- 
dren nor adults prefer low-complexity shapes 
or, thus, low quantities of information. 

In contrast, a growing body of literature 
has indicated that schizophrenics may prefer 
a low level of input. Harris (1959) reported 
that schizophrenics are better able than nor- 
mals to tolerate the effects of sensory depriva- 
tion. McReynolds has reported that schizo- 
phrenics spent relatively less time viewing 
novel than familiar pictures and exhibited 
less stimulus seeking behavior on the Howard 
Maze Test (McReynolds, 1963; Sidle, Acker, 
& McReynolds, 1963). 

These investigations, however, have not 
given explicit consideration to the possibility 
that different subgroups of schizophrenics may 
respond differentially to different input levels. 
In support of this possibility, Mehl (1967) 
found that among chronic nonparanoid schizo- 
phrenics only patients with poor premorbid 
adjustment indicated a preference for sensory 
deprivation. Neale and Cromwell (1968) have 

1 Requests for reprints should be sent to John M. 
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proposed that among acute schizophrenics two 
“input processing styles” are present. Acute 
good premorbid and paranoid patients were 
hypothesized to process much information 
(operate a low level of redundancy), while 
poor premorbid and nonparanoid patients 
were hypothesized to process little input 
(operate at a high level of redundancy). 

The present study was designed to examine 
whether these hypothesized processing styles 
would be reflected in preference for complex- 
ity employing the Munsinger and Kessen 
paradigm. It was hypothesized that the good 
premorbid paranoids would show a super- 
normal preference for complexity and that the 
poor premorbid nonparanoids would show a 
subnormal preference for complexity. 


METHOD 


Nineteen good premorbid paranoid and 19 poor 
premorbid nonparanoid schizophrenic patients were 
studied. Premorbid adjustment was determined by 
the Phillips (1953) scale (Part I) and presence or 
absence of paranoid symptomatology from local 
psychiatric diagnosis. All patients were males be- 
tween the ages of 19 and 55 and had been hospital- 
ized for less than 3 years. Most were on standard 
dosages of phenothiazine medication. 

The stimuli were randomly generated asymmetrical 
shapes. There were four stimuli at each of five com- 
plexity levels (3, 6, 10, 20, and 40 independent turns). 
All complexity levels were paired and any given 
stimulus appeared once in the series of 10 trials. The 
order of the stimuli within the 10 trials was deter- 
mined from the procedures developed by Ross 
(1934), Each time a given level of complexity was 
paired with another level a different stimulus was 
employed. 

The Ss were presented with the 10 pairs and on 
each trial were asked to indicate which of the two 
stimuli they preferred. This basic design was 
replicated four times. 
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Fic. 1. Scale scores for judged preference for 
random shapes by the good premorbid paranoid and 
poor premorbid nonparanoid schizophrenics. 


RESULTS AND DISCUSSION 


The results were analyzed following Thur- 
stone’s Comparative Judgment Model. Scaling 
was accomplished under the assumptions of 
Classes II and III, Case C (Torgerson, 1958) 
and the results are presented in Figure 1. As 
may be seen, both the good paranoids and 
poor nonparanoids show their greatest prefer- 
ence for the least complex shapes. The num- 
ber of votes for each stimulus by the two 
groups was analyzed by a 2 (groups) x 5 
(levels of complexity) analysis of variance. 
Since the assumption of homogeneity of the 
variance-covariance matrix could not be met, 
the conservative test procedure suggested by 
Winer (1962) was followed in evaluating the 
Significance of the obtained F ratios. The 
analysis indicated a highly significant effect 
for levels of complexity (F = 25.33, df= 
1/36, p < .01) and a significant Groups X 
Complexity interaction (F = 3.73, df = 1/36, 
$ < .01). A breakdown of this interaction 
indicated that the good paranoids and poor 
nonparanoids differed reliably from each other 
at Complexity Levels 6 and 40. The poors 
showed less preference for the 6-turn figure 
than the goods. For the 40-turn figure this 
relationship was reversed. 

The results indicate that acute schizophre- 
nics show a clear preference for stimuli of 
low complexity. This result has not been 
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found with other populations who have been 
tested employing this procedure (e.g., various 
adult populations, Munsinger & Kessen, 1964; 
children at various ages, Thomas, 1966), In 
contrast to prediction, however, there was 
little difference between the good paranoid 
and poor nonparanoid patients, In terms of 
Munsinger and Kessen’s theory, the results 
may be interpreted to reflect a decreased 
capacity for information processing among 
acute schizophrenics. That is, schizophrenics 
prefer little information because they process 
information inefficiently. This reduction in 
the efficiency of information processing 
among acute schizophrenics has recently been 
confirmed in an investigation of span of 
apprehension in these patients (Neale, Mc- 
Intyre, Fox, & Cromwell, 1968). 
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Three groups of psychiatric patients in a veterans general hospital were tested 
under research, transfer, and discharge instructional sets. Differences in self- 
reported deviance among the groups are interpreted within an impression man- 
agement framework for the information they reveal about patients’ motivational 
orientations toward transfer and discharge as hospitalization outcomes. Patients 
were of greatest accord in wanting to avoid transfer. Orientation toward dis- 
charge varied among patient subgroups. Shifts in the level of presented deviance 
were similar for all dimensions but occurred significantly (p< .05) only for 
psychopathology. Differential levels of presented deviance among subgroups 
are interpreted in terms of corresponding differences in judged appropriateness 


for patienthood, 


The assertion that “psychopathological” 
behavior can be adequately understood as one 
way of attempting to exercise control over 
current interpersonal interactions (e.g., Haley, 
1963; Szasz, 1964) is a position with numer- 
ous implications. One of these is that the be- 
havior of mental patients and the “mentally 
ill” can be analyzed in terms of self-presenta- 
tion or impression management just as fruit- 
fully as the behavior of nonpatients or “nor- 
mals” can. The transactional nature of psy- 
chological testing provides a good example of 
the applicability of this view. Psychological 
tests are usually given in conjunction with a 
forthcoming decision to be made about a pa- 
tient. If psychological testing is considered to 
be a two-way communication situation as 
Rotter (1960) and others have contended, the 
patient’s perception of the nature of the de- 
cision must be considered an important de- 
terminant of his responses and not simply a 
biasing factor. The patient receives certain 
benefits from the hospital or clinic, and his 
presentation on psychological tests is one of 
the major means available to him for influenc- 
ing decisions in the direction of his goals. 

1The authors wish to thank Mary McIndoo, 
Jonathan Cummings, and the staff of the Veterans 
‘Administration Hospital, Washington, D. C., for 
their support. Appreciation is also due Edward Klein 
and Dorothea Braginsky for their critical reading 
of the manuscript. 4 

2Also at Vale University. Requests for reprints 
should be sent to Alan F. Fontana, Veterans Admin- 
istration Hospital, West Haven, Connecticut 06516. 


When the communication approach to 
psychological testing is applied, inferences 
about the types and strengths of patients’ 
goals can be made on the basis of response 
changes when the avowed purpose of the 
testing changes. Several investigators have 
reported that patients manifest differing 
amounts of pathology depending upon their 
understanding of the reasons for testing 
and the possible relevance of the results to 
the attainment of their goals (Braginsky & 
Braginsky, 1967; Braginsky, Grosse, & Ring, 
1966; Wilcox & Krasnoff, 1967; Young, 
1965). In a similar vein, Fontana and Klein 
(1968) categorized schizophrenic patients as 
healthy or sick presenters on the basis of their 
self-reports. When tested on reaction time, the 
deficit manifested by patients varied from 
none to a substantial amount and was found 
to be directly related to the adequacy of self- 
presentation and the characteristics of the 
hospital environment. Explanation of the re- 
sults included the assertion that patients in 
treatment buildings were motivated to main- 
tain the status quo; that is, they neither 
wanted to be transferred to custodial build- 
ings nor to be discharged. The present study 
was designed to test the generality of this 
assertion by varying the meaning of the test- 
ing conditions under which patients in a dif- 
ferent treatment hospital completed several 
self-report scales. The expectation was that 
patients would attempt to create the impres- 
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sion that they were too healthy to warrant 
transfer but too sick to deserve discharge. 

Psychopathology and adequacy-responsibil- 
ity are two dimensions considered to be of 
central importance to the transactions between 
patients and staff. Psychopathology should be 
expected to occupy a central position because 
hospitals are established for the treatment of 
pathology as their reason for being. The 
strong emphasis in clinical thinking on pa- 
tients’ need for self-esteem and the accept- 
ance of responsibility suggests adequacy- 
responsibility as an important dimension also. 
In the self-presentation view of psychopathol- 
ogy both psychotics and nonpsychotics are 
considered to be equally aware of the demands 
and norms of their environment. It is recog- 
nized that the mode of impression manage- 
ment is different for the groups, but it is 
expected that both groups attempt to manage 
their impressions. Three testing conditions 
were employed in which testing was explained 
in terms of research interests. Further infor- 
mation was provided in two conditions, one 
in which patients were told that the results 
might have personal relevance to them regard- 
ing transfer to another hospital, and the 
second in which they were told that the re- 
sults might have personal relevance to them 
regarding discharge from the hospital. 

The major hypotheses can be stated as 
follows: 

1. Compared to patients in the research- 
only condition, patients in the transfer- 
relevant condition would attribute less psycho- 
pathology to themselves, and patients in the 
discharge-relevant condition would ascribe 
more pathology to themselves. 

2. Compared to patients in the research- 
only condition, patients in the transfer- 
relevant condition would describe themselves 
as more adequate-responsible, and patients in 
the discharge-relevant condition would report 
that they were less adequate-responsible, 

3. Comparable effects would be found 
for patients diagnosed as psychotic and non- 
psychotic. 


METHOD 


Subjects and Setting 


Seventy-five male psychiatric patients from a 
Veterans Administration general medical and surgical 


ALAN F. FONTANA AND THEODORE GESSNER 


hospital comprised the S sample. Forty were diag- 
nosed nonpsychotic (68% assigned diagnosis of anx- 
iety reaction, depressive reaction, or alcoholism) and 
35 psychotic (89% schizophrenic). The average pa- 
tient was 414 years of age (SD=8.8), had 114 
years of education (SD =3.1); and had been hos- 
Pitalized currently for 5.4 months (SD = 3.8), Sixteen 
of the Ss were Negro and 59 were Caucasian. Assign- 
ment of Ss to testing conditions was balanced for 
diagnosis, race, and ward of residence. The Ss in the 
three conditions did not differ significantly in age, 
education, or length of hospitalization. Excluded from 
the sample were patients considered to be organically 
impaired, those 60 years of age or older, and persons 
currently hospitalized less than 1 month. 

The hospital is a new, attractive building located 
in the center of a metropolitan area. All wards are 
open, extensive facilities are available, and an active 
treatment program is in force. A psychiatric trans- 
fer for most patients would entail going to older 
hospitals, located in rural areas, where less extensive 
facilitites and treatment are available, Klein and 
Fontana (in press) found that patients evaluate hos- 
pitals of the former type much more positively than 
hospitals of the latter type. It is reasonable to con- 
clude that patients would wish to avoid being trans- 
ferred from a treatment hospital. 


Testing Conditions and Instruments 


The Ss were tested in groups and were told that 
the testing was being conducted for research pur- 
poses. For the 27 Ss in the research condition this 
was their only information. The transfer group 
(N = 24) was additionally informed, 


I should add that this is mainly a research study 
but that it may have personal relevance to you 
since one of the purposes of this study is to deter- 
mine which patients have received maximum bene- 
fit from this hospital and should be transferred to 

(a large psy- 
chiatric hospital). 


The discharge group (N =24) was given this in- 
formation with the exception that discharge was 
substituted for transfer. The phrase “maximum 
benefit” was employed because it is the language, 
familiar to both staff and patients, used to refer to 
a staff judgment that further treatment is unlikely 
to be productive and that the patient’s association 
with the hospital should be terminated. The research 
group was tested 1 week before the transfer and 
discharge groups, which were tested at the same time. 
In this way, there was no Possibility that the dis- 
charge or transfer instructional set might influence 
any testing condition other than the one intended. 
Psychopathology was considered to be the most 
prominent dimension for manipulation. A scale assess- 
ing the attribution of Psychopathology to oneself 
(SD1s) reported by Fontana, Klein, Lewis, and 
Levine (1968) and a scale for predicting chronicity 
(C) developed by Anker (1961), were used as 
indexes of Psychopathology. f 
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TABLE 1 
Means AND ANALYSES OF VARIANCE OF THE RESPONSES TO TESTING CONDITIONS 
BY PsycHotics AND NonpsycHorics 
Conditions Analyses of variance 
Variable Diagnosis 
Research Transfer Discharge | Conditions | Diagnosis | Interaction 
SDig Psychotic 11.31 13.67 12.23 t se 
Nonpsychotic 9.00 12.07 9,00 
Cc Psychotic 6.85 5.00 6.08 . 
Nonpsychotic 7.43 3.93 5.91 
K' Psychotic 9,31 9.22 10.54 . 
Nonpsychotic 5.79 8.67 6.18 
AR Psychotic 9.77 12.56 11.31 ” 
Nonpsychotic 6.93 9.20 8.45 
I-E Psychotic 10.38 9.33 9,54 
Nonpsychotic 12.29 9,53 9,91 
ET Psychotic 38.92 40.89 38.08 : * 
Nonpsychotic 46.36 39.47 37.82 


Note.—For psychotics in the research group, N = 13; in the transfer group, N 


= 9; in the discharge group, N = 13. For 


nanpiyeho riori the research group, N = 14; in the transfer group, N = 15; in the discharge group, N = 11. 
p <.05. 


HD < 01. 


In order to resolve an interpretive ambiguity sur- 
rounding pathology scales such as SDis and C, 
Welsh’s “purification” of the MMPI denial scale 
(K') (Dahlstrom & Welsh, 1960) was included. K’ 
consists of the original K scale minus those items also 
included in the MMPI pathology scales. Thus K’ 
can be taken as a measure of the denial of undesir- 
able but nonpathological statements. If Ss respond to 
K' items in the same way and to the same extent as 
they do to SDs and C items, it would suggest that 
self-presentation is largely achieved through a general 
denial of undesirable characteristics. If, however, Ss 
respond somewhat differently to K’ than to SDs 
and C, it would suggest that the pathology content 
of the items is a crucial variable. 

The second dimension thought to be relevant to 
impression management is adequacy-responsibility 
(AR). A 20-item, counterbalanced, true-false scale 
was devised which included items such as: “I can 
successfully accomplish most things I set out to do”; 
“I am someone who people can depend on”; and “I 
think that a person with an emotional problem is 
still accountable for his actions.” A second scale used 
to assess this dimension was the Internal-External 
Locus of Control Scale (I-E) developed by Rotter 
(1966). 

Finally the sum of the “true” responses (ZT) to 
the counterbalanced scales was calculated. This served 
as a measure of the extent to which Ss adopted an 
uncritical, uninvolved orientation to the items as 
opposed to a critical involvement in the content of 
the items. 

®The split-half reliabilities, corrected by the 
Spearman-Brown formula, for these measures are as 
follows: SD1s (.84), C (.80), K’ (.84), AR (80), 
LE (.77), and ZT (.53). 


RESULTS 


A 3 (testing conditions) X 2 (diagnoses) 
analysis of variance was performed for each 
scale and the sum of the “true” responses. 
The means and results of the analyses of vari- 
ance are presented in Table 1. High scores on 
SDıs, AR, and K’ indicate a healthy, ade- 
quate, denying orientation, respectively; 
whereas high scores on C and I-E indicate an 
orientation toward greater chronicity and 
external locus of control, respectively. 

Inspection of the means reveals that there 
is a consistent pattern of response to all scales 
across the testing conditions. As predicted, Ss 
responded in the least deviant manner in the 
transfer condition. However, this effect was 
significant only for measures of the pathology 
dimension, SDig and C. The Ss did not re- 
spond in the most deviant manner in the dis- 
charge condition as expected; rather the 
means typically fell between those for the re- 
search and transfer conditions. Psychotics 
consistently presented themselves in a less 
deviant light than nonpsychotics, significantly 
so on SD:s, K’, and AR. As hypothesized, the 
psychotics’ pattern of responses to the testing 
conditions was parallel to that of nonpsy- 
chotics in every case. XT was the only vari- 
able to yield a significant interaction between 
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groups and conditions; nonpsychotics en- 
dorsed more items as true than did psychotics 
in the research condition. 

An exploratory analysis was conducted in- 
volving patients’ service connection and dis- 
ability status. This variable was thought to be 
relevant to patients’ hospitalization goals 
since service-connected patients have priority 
in claiming treatment resources and are the 
only ones automatically eligible for psychi- 
atric disability payments. Three groups were 
established: nonservice-connected, service-con- 
nected with partial disability, and service- 
connected with full disability. To the extent 
that considerations regarding disability ratings 
and payments affect hospitalization goals, 
nonservice-connected patients could be ex- 
pected to be affected least by differences in 
the testing conditions and partial disability 
patients to be affected most, particularly in 
the comparison between research and dis- 
charge conditions. All service-connected pa- 
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tients receive disability payments at the rate 
of 100% while they are in the hospital, pro- 
viding they have been hospitalized more than 
21 days. Upon discharge, payments revert to 
the level of the prehospitalization rating. Dis- 
charge would mean no decline in payments 
to the fully disabled, but a decrease for the 
partially disabled. Thus, to the extent that a 
reduction in payments might influence hos- 
pitalization goals, the partially disabled could 
be expected to be most resistant to discharge 
and to present themselves with greatest de- 
viance in the discharge condition. Patients in 
the three groups were comparable in terms of 
diagnoses and background variables except 
for prior hospitalization history; men with 
a 100% disability rating (prior to current 
hospitalization) had been hospitalized for 
psychiatric problems significantly more often 
(F = 6.57, df = 2/72, p < .01) than patients 
in the other groups. The means and results 


TABLE 2 


MEANS AND ANALYSES OF VARIANCE OF THE RESPONSES TO TESTING CONDITIONS 
BY NONSERVICE-CoNNECTED, AND PARTIALLY DISABLED AND Futty 
DISABLED SERVICE-CONNECTED PATIENTS 


Conditions Analyses of variance 
Variable eres connection 
Research | Transfer | Discharge | Conditions Service Interaction 
connection 
Nonservice connected 11.40 11.78 11.50 
SDi, | Partially disabled 8.43 13.89 11.83 brad $ 
Fully disabled 8.60 12.17 6.50 
Nonservice connected 6.07 5.22 6.36 
Cc Partially disabled 8.71 2.78 3.67 se $ 
Fully disabled 8.20 5.33 8.25 
Nonservice connected 8.53 8.67 9.64 
K' Partially disabled 5.57 9.22 8.50 
Fully disabled 7.00 8.67 4.75 
Nonservice connected 9.53 10.44 11.00 
AR Partially disabled 7.29 11.33 11.00 ny 
Fully disabled 6.00 9.17 5.00 
Nonservice connected 10.53 9.78 10.43 
I-E Partially disabled 13.00 8.33 7.83 
Fully disabled 11.60 10.67 10.00 
Nonservice connected | 41.00 38.89 38.43 
DE Partially disabled 45.57 41.67 37.50 Er 
Fully disabled 44.20 39.17 37.00 


Note.—For the nonservice-connected group in the research condition, N 

lar 4 rti; service-connected patients 
condition, N = 9; in the discharge condition, N = 6. For the fully di: service- tients i 

= 6; in the discharge the fally disabled connected patients in ‘the research condition, 


discharge condition, M = 14. For the partially disabled 


N = 5; in the transfer condition, N 
*D < 10. 
dD < .05, 


= 15; in the transfer condition, N = 9, in the 
in the research condition, N = 7; in the transfer 
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of the analysis of variance can be found in 
Table 2. 

Inspection of the means indicates that non- 
service-connected patients were much less af- 
fected by the testing conditions than were 
service-connected patients. The data also re- 
veal that the service-connected groups pre- 
sented themselves similarly in the research 
condition and with least deviance in the trans- 
fer condition, but differed markedly in their 
responses to the discharge condition. Com- 
pared to the research condition, full disability 
patients attributed the same or greater devi- 
ance to themselves in the discharge condition. 
Contrary to expectations, partial disability 
veterans described themselves much less devi- 
antly in the discharge condition than in the 
research condition. 


DISCUSSION 


The results have strongly supported some 
of the conceptions about patients’ self- 
presentation and have indicated that modifica- 
tions are needed in others. The prediction that 
patients would not want to be transferred to 
a less desirable hospital was clearly borne out. 
Under classifications of both diagnosis and 
service connection, patients in the transfer 
condition presented themselves less pathologi- 
cally than those in the research condition. 
Patients’ orientations toward discharge were 
less clear. When Ss were classified by diag- 
nosis, the level of reported pathology in the 
discharge condition was greater than in the 
transfer condition and usually similar to the 
research condition. In addition, the variability 
of scores was consistently (though not signifi- 
cantly) found to be largest in the discharge 
condition and smallest in the transfer condi- 
tion. This suggested that there was a greater 
difference of opinion among patients concern- 
ing the desirability of discharge compared 
to transfer, and that patients were either in- 
different to discharge or divided, with one 
subgroup favoring and one opposing dis- 
charge. The classification according to service 
connection helped to clarify this ambiguity 
and indicated that all three orientations were 
represented. 

Nonservice-connected patients varied very 
little in their presentations, demonstrating an 
indifference to the added information in the 
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transfer and discharge conditions. Full and 
partial disability Ss differed markedly in their 
presentations, the fully disabled manifesting 
more deviance than the partially disabled. 
The difference was most pronounced in the 
discharge condition. The fully disabled mani- 
fested a pattern consistent with avoiding dis- 
charge whereas the partially disabled pro- 
duced a pattern consistent with obtaining dis- 
charge. The latter finding is particularly un- 
expected because there were plausible reasons 
for suspecting that the partially disabled 
would be most desirous of avoiding discharge. 
Certainly this finding runs counter to the 
widespread impression that potential. reduc- 
tion of disability payments encourages adop- 
tion of the sick role and the desire for hos- 
pitalization. These data suggest that analyses 
and explanations based solely in terms of 
monetary payments are oversimplifications 
and that the effects of disability payments 
result from a complex interaction of factors 
little understood at present. A similar conclu- 
sion was drawn by Fisher (1965), who found 
that disability payments did not compensate 
for low occupational adjustment in any simple 
and direct way. 

The fact that the fully disabled confirmed 
the initial hypothesis and the partially dis- 
abled did not may be related to the greater 
number of prior hospitalizations of the former 
group. It is not surprising that the orienta- 
tion of staying in the hospital was found for 
those patients who had a history of repeated 
hospitalization. A chronicity difference might 
also account for the authors’ general over- 
statement of patients’ desire to avoid dis- 
charge. Patients in the present sample were 
currently hospitalized an average of only 5.4 
months compared to 22.9 months for the 
Veterans Administration sample investigated 
by Fontana et al. (1968). Thus patients in 
the latter sample may have created the 
stronger and more pervasive impression of 
wanting to stay in the hospital concomitant 
with their greater chronicity. 

Psychotics consistently presented them- 
selves less deviantly than did nonpsychotics. 
The fact that the difference was most evident 
in the research condition indicates that the 
groups approached testing with different ori- 
entations. This conclusion is supported further 
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by the difference in the sum of the “true” 
responses between diagnostic groups. Non- 
psychotics in the research condition chose the 
“true” alternative much more often than did» 
the psychotics or the nonpsychotics in the 
other conditions. Thus the nonpsychotics re- 
sponded less to the content of the items. 
Psychotics maintained close attention to 
item content throughout the three conditions. 
A similar difference in the research condi- 
tion can be seen for the service connection 
classification. Nonservice-connected veterans 
presented themselves less deviantly than did 
service-connected patients. Although the diag- 
nostic and service connection classifications 
are independent (x? = 1.96, df = 2, p > .25), 
they both possess an internal classification of 
greater and lesser appropriateness for admis- 
sion and retention. 

The fact that this was a treatment hospital 
and that psychotics are generally considered 
to be the poorer candidates for individual 
and group therapy could be expected to exert 
a selective influence in favor of the healthier 
appearing psychotics. Similarly, a greater 
weight may have been placed on healthier 
self-presentations by nonservice-connected vet- 
erans because of their lesser priority in claim- 
ing hospital resources. Conversely, service- 
connected veterans because of their legal 
status, and nonpsychotic patients as a result 
of their symptomatology, could be expected 
to be a less selected sample in terms of their 
self-presentations by virtue of their greater 
suitability for patienthood. 

With the exception of the “true” responses, 
nonpsychotics and psychotics responded to the 
testing conditions in a parallel manner. Both 
groups revealed an awareness of the differen- 
tial meaning of the testing conditions and an 
interest in varying the level of deviance 
presented. 

Impression management mainly involved 
scales with pathology content. Scales measur- 
ing adequacy-responsibility were only weakly 
affected by the testing conditions, albeit in 
the same direction as measures of psycho- 
pathology. This is surprising since so much 
of therapeutic endeavor is conceptualized in 
terms of increasing a patient’s sense of per- 
sonal responsibility. However, patient respon- 
sibility is often incompatible with the tradi- 
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7 
tional patient role, and it is very possible that 
double messages are often given to hospital- 
ized patients (Wilensky & Herz, 1965), Exe 
plicit expectations of responsibility are gen. 
erally expressed through words, while covert- 
expectations against responsibility are ex- 
pressed through actions. The resulting ambi- 
guity could be expected to make patients 
unsure of how to present themselves on this 
dimension. 

The greater sensitivity to the testing condi- 
tions of the pathology scales as compared to 
the denial scale helps to resolve a question of 
interpretation of the former. Although K’ 
correlated .75 and —.52 with SDıs and C, 
respectively, the shift in K’ scores across con- 
ditions was not statistically significant. It 
seems reasonable to conclude that although 
a general defensiveness or social desirability 
orientation was operating, pathology content 
as focused avenue of self-presentation was 
important over and above such an orientation. 
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RELATIONSHIP OF SELECTED PSYCHOSOCIAL VARIABLES a 
TO PROGNOSTIC JUDGMENTS 4 


JAMES E. COWDEN? anp ASHER R. PACHT 
Wisconsin Department of Health and Social Services 


The way in which clinicians use selected cue variables in making prognostic 
judgments of the postrelease adjustment of institutionalized delinquents was 
assessed in this study. Cue variables (concepts commonly used in clinicians’ 
reports, typically with some prognostic significance) were correlated with 
prognostic judgments of postrelease adjustment and also with actual ratings 
of postrelease adjustment. The clinicians were reasonably accurate, as a group, 
in estimating the prognostic significance of the cue variables. Cue variables more 
readily inferred from behavior (institutional adjustment, hostility, home en- 
vironment, and maturity) were related most consistently and accurately to post- 
release prognosis, while those less easily inferred from behavior (anxiety, de- 
pendency, and guilt) showed a less clear-cut relationship. More communication 
among clinicians, particularly in their usage of concepts describing intrapsychic 


processes, was thus recommended. 


Isolating the factors involved in making 
clinical judgments has long been a problem 
for the clinician, This problem is further 
complicated when the clinician is asked to 
make diagnostic assessments in a correctional 
setting. Most clinical instruments have been 
standardized on noncorrectional populations. 
Inmates tend to be rather defensive and un- 
communicative, and there is typically con- 
siderable social class distance between the 
clinicians and the inmates. These factors in 
combination have served to heighten the dif- 
ficulty of making meaningful clinical predic- 
tions (Harari, 1964; Jacks, 1962; Prentice, 
1964). As a result, false inferences are drawn 
and inmates are frequently stereotyped as 
being sick and/or victimized by their environ- 
ment (Giedt, 1955; Lee & Tucker, 1962; 
Kostlan, 1954). More effective prediction can 
best be achieved through a systematic analy- 
sis of the relationships of the factors which 
underlie clinical judgment. 

This study examined the accuracy and con- 
sistency with which clinicians utilized selected 
cue variables in making prognostic judgments 
relating to the postrelease adjustment of de- 
linquent boys. 

METHOD 

Four experienced staff members (three psycholo- 

gists and a psychiatrist) at the Wisconsin School for 


1 Requests for reprints should be sent to James E. 
Cowden, 1 West Wilson Street, P, O. Box 669, 
Madison, Wisconsin 53701. 
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Boys, Wales, were surveyed to determine w 
factors they considered to be most predictive of 
postrelease adjustment of delinquent boys. From 
among those typically included in reports the nine 
factors selected for this study were: hostility, anx- 
iety, dependency, overall magnitude of personality 
disturbance, social delinquent features (degree to” 
which involvements with delinquent peers contributed 
to their delinquent behavior), institutional adjust- 
ment, level of maturity, and adequacy of their home 
environment. These factors were labeled “cue va 
ables” and a 7-point rating scale was constructed fo 
each. In addition, a 7-point “global prognosti 
rating scale was developed for clinicians to ma e 
predictions concerning the postrelease adjustment of 
each boy in the sample. Finally, a 7-point rating 
scale was constructed to measure the actual post- 
release adjustment of Ss for a period of up to 1 year 
following their release from the institution. f 

Ratings were made on each of the nine cue vari- 
ables, the global prognostic scale of postrelease ad- 
justment, and the rating scale of actual postrelease 
adjustment on 152 delinquent boys who had been 
institutionalized as delinquents at the Wisconsin 
School for Boys at Wales. P. 

Each clinician rated approximately the same num- 
ber of boys and no boy was rated by more than | 
one clinician. Interjudge reliabilities on these scales | 
had been assessed previously and are reported in the 
results section. To avoid bias, clinicians were not 
aware that the study involved correlating ratings on 
each of the cue variables with global prognostic 
ratings and actual ratings of postrelease adjustment. 

To assess the predictive value of each cue variable, 
the cue variable ratings of each of the four raters 
(separately and combined) were correlated both with 
their global prognostic ratings of postrelease adjust- 
ment and with ratings of actual postrelease adjust- 
ment of Ss made independently by other raters. The 
results of a previous study (Cowden & Pacht, 1967) 
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TABLE 1 
CORRELATIONS OF CUE VARIABLES WITE GLOBAL PROGNOSTIC RATINGS OF POSTRELEASE 
ADJUSTMENT AND WITH RATINGS OF THE ACTUAL POSTRELEASE 
ADJUSTMENT oF 152 DELNQUENT Boys 
Global prognostic ratings of postrelease adjustment 
= Ratings of 
Cue variables* Separate raters actual post: 
Combined Paces T 
t justmen 
Jc TB wP GM ats 
Institutional adjustment 65 .78 275) 71 65 39 
Hostility —40 —11 —.53 —.16 —.36 —.15 
Home environment Al .26 38 17 28 7 
Immaturity =.35 —.07 —.21 —.34 — 24 —.16 
Personality disturbance —.23 —.51 25 —.43 —.32 —.11 
Guilt 39 —.29 63 -06 .29 18 
Social delinquent features —.23 09 —AL —.10 —.24 —.14 
Dependency 35 —.19 37 .08 16 15 
Anxiety —.10 —.12 52 .00 —.02 02 


Note.—Global prognostic ratings ch popeease adjustment were made by psychologists from reports prepared by clinicians 


(psychologists or psychiatrists) shortly 


ter admission to the institution. Ratings of actual postrelease aaant were made 
from parole agents’ chronological reports of their adjustment up to 1 year following their release from the insti 


tution. 


^ Ranked in order of the consistency of ratings and overall magnitude (either positive or negative) of the correlation coefficients. 


indicated that clinicians’ global prognostic ratings are 
among the best predictors of the subsequent behavior 
of institutionalized delinquents. In that study, it was 
found that clinicians’ global prognostic ratings 
showed a moderately high correlation (r=.51) with 
rated postrelease adjustment. It was, therefore, as- 
sumed that the correlation between a clinician’s 
ratings of a boy on a given cue variable and his 
global prognostic ratings of the same boy would 
reflect the importance of this cue variable as a source 
of information for the clinicians in making judgments 
as to the boy’s probable postrelease adjustment. Cor- 
relations between cue variables and prognostic ratings 
for each clinician separately were then compared to 
determine degree of consistency among clinicians in 
utilizing these cue variables as sources of information 
in making judgments about the postrelease adjust- 
ment of delinquent boys. 

The foregoing procedures permitted the following 
comparisons to be made: First, the consistency with 
which clinicians utilized the cue variables in making 
Prognostic judgments was determined by assessing 
the degree of relationship between ratings of each 
cue variable and the global prognostic ratings for 
each clinician in turn. Second, the relative accuracy 
with which clinicians as a group used each of the 
cue variables as predictors of postrelease adjustment 
was determined by an assessment of the relationship 
between the cue variables, global prognostic ratings, 
and ratings of actual postrelease adjustment for all 
clinicians combined. 


RESULTS 


Interrater reliabilities of each of the uke 
Cue variables and the global prognostic rating 


variable were assessed through having two 
psychologists make independent ratings of 35 
delinquent boys on each of these variables. 
The reliability coefficients for these variables 
were as follows: institutional adjustment, .85; 
maturity, .83; hostility, .78; social delinquent 
features, .76; personality disturbance, .73; 
home environment, .71; guilt, .66; anxiety, 
-56; dependency, .52; global prognosis, .79. 
These variables can thus be rated by ex- 
perienced clinicians with at least moderate 
reliability. 

The cue variables were then correlated with 
ratings of the actual postrelease adjustment 
of the boys, and, second, with the global 
prognostic ratings of their postrelease adjust- 
ment, The results of this analysis are shown 
in Table 1. 

These results indicate that the cue variable, 
institutional adjustment, showed the highest 
correlation both with the global prognostic 
and actual ratings of postrelease adjustment. 
In addition, the raters showed a high degree 
of consistency in utilizing this cue variable 
in making their global prognostic ratings. In 
similar fashion, the variables of hostility, 
home environment, and maturity (in that 
order) were also, to a lesser extent, used 
consistently by clinicians in making their 
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global prognostic ratings of postrelease ad- 
justment. In particular, the results suggested 
that delinquents with a better than average 
institutional adjustment, who came from a 
relatively good home environment, and who 
showed relatively little hostility and above- 
average maturity, would be rated by most 
clinicians as having a positive prognosis. The 
variables of dependency and anxiety, on the 
other hand, were not consistently correlated 
with the clinicians’ prognostic judgments. 

A comparison of the correlations between 
the cue variables and global prognostic 
ratings, and between the cue variables and 
actual postrelease adjustment for the sample 
of delinquent boys, demonstrated reasonably 
good agreement in the direction and magni- 
tudes of the correlation coefficients. The 
magnitudes of the correlations between the 
cue variables and global prognostic ratings 
were, however, somewhat higher than those 
between cue variables and actual postrelease 
adjustment, which suggests that clinicians 
tended to give the cue variables more 
“weight” as predictors than was warranted. 
The relative weightings given to each of the 
cue variables appeared realistic, however, as 
indicated by the correlation (phi coefficient) 
of .74 between\the relative rankings of the 
magnitudes of the correlation coefficients be- 
tween the cue variables and global prognostic 
ratings, and between the cue variables and 
ratings of actual postrelease adjustment. It 
seems clear that experienced clinicians are 
moderately accurate not only in predicting the 
postrelease adjustment of institutionalized 
delinquents, but also in the degree to which 
they utilize various cue variables in making 
such predictions, 


Discussion 


The results of this study suggest that 
under certain conditions clinicians show con- 
siderable inconsistencies as to which variables 
are most clearly related to the postrelease ad- 
justment of delinquent boys. Their degree of 
consistency depended heavily upon the nature 
of the concepts being rated. It was found, for 
example, that the variables of institutional 
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adjustment, hostility, home environment, and 
maturity were most consistently related to 
postrelease prognosis. These results suggest 
that those concepts most readily inferred from 
behavior can be more clearly and consistently 
related to prognostic judgments by clinicians, 
By contrast, the variables of guilt, anxiety, 
and dependency were least consistently re- 
lated to prognostic judgments. These results 
are particularly meaningful since these latter 
concepts are so frequently mentioned in cli- 
nicians’ reports, typically with some prognostic 
implications. The lack of consistency among 
clinicians in their perceptions of the relation- 
ships between these concepts and predicted 
postrelease adjustment appears to be a conse- 
quence, at least in part, of the relative ambi- 
guity of these concepts. 

In general, the results of this study suggest 
that (a) clinicians should be cautious in as- 
suming a significant consensus as to the prog- 
nostic significance of many concepts fre- 
quently found in clinical reports; (b) more 
communication among clinicians should be 
encouraged with the goal of achieving greater 
mutual understanding and more consistent 
usage of certain key concepts, particularly 
those describing intrapsychic processes. 
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COMPARATIVE STUDY OF THE WECHSLER PRESCHOOL AND PRIMARY 
SCALE OF INTELLIGENCE AND THE STANFORD-BINET INTELLIGENCE 
SCALE, FORM L-M, AMONG CULTURALLY DEPRIVED CHILDREN + 


ALLAN BARCLAY ann ALLAN C. YATER 
St. Louis University 


This study investigated the relationship be- 
tween the Wechsler Preschool and Primary Scale 
of Inteiligence (WPPSI; Wechsler, 1967) and the 
Stanford-Binet Intelligence Scale, Form L-M 
(S-B; Terman & Merrill, 1960), among a sample 
of culturally disadvantaged children in order to 
afford some estimate of the relative similarities or 
differences that might exist between these two 
instruments. 

The Ss were 50 culturally disadvantaged chil- 
dren who had been participating in a Head Start 
program for an average period of 8 months. The 
sample was drawn from neighboring centers in an 
attempt to minimize differences in residential 
environments. The children ranged in age from 
60 through 71 months (M = 63.84, SD = 2.44). 
Two female examiners administered the tests, and 
each child was tested individually in his respec- 
tive Head Start center. Ten subtests of WPPSI 
were administered in the order suggested by 
Wechsler (1967). The omitted subtests were 
Sentences (Supplementary) and Animal House 
Retest. 

A counterbalanced order of presentation of the 
WPPSI and the S-B was employed, with a mean 
interval between test administrations of 21 days. 
Twenty-five Ss received the WPPSI first; the 
remaining Ss completed the S-B first. Each group 
contained 13 males, 12 females, 11 Caucasians, 
and 14 Negroes. 

The means and standard deviations for the 
WPPSI Verbal, Performance, and Full-Scale IQs 
were 91.72 and 10.97, 96.32 and 12.09, 93.38 and 


1An extended report of this study may be ob- 
tained without charge from Allan Barclay, Depart- 
ment of Psychology, St. Louis University, St. Louis, 
Missouri 63103, or for a fee from the American 
Science Information Service, Order NAPS Document 
NAPS-00272 from ASIS National Auxiliary Publi- 
cations Service, c/o CCM Information Sciences, Inc., 
22 West 34th Street, New York, New York 10001; 
remitting $1.00 for microfiche and $3.00 for photo- 
copies. Requests for reprints should be sent to Allan 
Barclay at the above address. 


11.57, respectively. The mean Performance IQ 
was significantly higher than the mean Verbal IQ 
(#= 3.34, df =49, p < .01) and the mean Full- 
Scale IQ (¢ = 4.09, df = 49, p < .001), while the 
mean Full-Scale IQ was significantly higher than 
the mean Verbal IQ (# = 2.39, df = 49, p <.05). 

The mean S-B Full-Scale IQ was 100.96 with a 
standard deviation of 12.46. The S-B IQ mean 
was significantly higher than the mean WPPSI 
Verbal IQ (t= 7.39, df = 49, p < .001) as well 
as the mean WPPSI Performance IQ (t= 3.69, 
df=49, p<.001) and the mean WPPSI Full- 
Scale IQ (ż = 7.13, df =49, p < .001). 

A 2 X 2 Latin-square design also compared the 
mean WPPSI Full-Scale IQ with the mean S-B 
IQ. It was again found that the mean S-B Full- 
Scale IQ was significantly higher than the mean 
WPPSI Full-Scale IQ (F = 51.73, df = 1/48, p 
<.001). However, order of administration (i.e., 
WPPSI-S-B vs. S-B-WPPSI) had no significant 
effect on IQ scores (F < 1.00, df = 1/48). 

Further comparisons were made between males 
and females, between Negroes and Caucasians, 
and between children 60-63 months and 64-71 
months of age. No significant differences due to 
sex, race, or age were found on any of the four 
IQ estimates. 

Correlation coefficients for the S-B IQ and the 
WPPSI Verbal, Performance, and Full-Scale IQs 
were .73, .74, and .81, respectively. 

Results were interpreted as suggesting that the 
WPPSI and the S-B were not comparable instru- 
ments for the culturally disadvantaged children 
in this study, and that the WPPSI seemed to be 
experienced as the more difficult test. 
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REVISED SUICIDE POTENTIAL SCALE * 


R. W. MISKIMINS anp LOWELL T. WILSON 
Colorado State University 


A problem of extreme interest for all psychi- 
atric institutions is that of suicide prevention. An 
investigation by Miskimins, DeCook, Wilson, and 
Maley (1967) identified readily obtained clues in 
the demography and personality of institutional- 
ized mental patients who had committed suicide. 
These empirically derived items were cast to- 
gether in the form of a checklist and entitled the 
Suicide Potential Scale (SPS). The SPS construc- 
tion was completed in 1965 and staff-selected po- 
tential suicides provided the only means for 
“cross-validation” of the instrument. With the 
passage of 3 years, there became available an- 
other sample of suicides, and hence, the present 
study was undertaken. 

For the original research, study groups were 
established by matching 16 psychiatric patients 
who had committed suicide with nonsuicidal pa- 
tients on the variables of sex, age, marital status, 
and diagnosis. The two groups were then com- 
pared on routine demographic and psychiatric 
information that was collected prior to the inci- 
dence of self-destructive behavior. The SPS was 
constructed from 23 items which were found to 
discriminate the two groups, plus an additional 
three based upon variables employed in the match- 
ing. 

The Ss for the present study were drawn from 
the patient population of the Fort Logan Mental 
Health Center, Denver, Colorado. The experi- 
mental groups consisted of 15 patients who had 
committed suicide during the period from June, 
1965 to June, 1968. A nonsuicidal control group, 
totaling 30, was selected randomly from the re- 


1 An extended report of this study may be obtained 
without charge from R. W. Miskimins, Fort Logan 
Mental Health Center, 3520 West Oxford Avenue, 
Denver, Colorado 80236, or for a fee from the 
American Science Information Service. Order NAPS 
Document NAPS-00271 from ASIS National Aux- 
iliary Publications Service, c/o CCM Information 
Sciences, Inc., 22 West 34th Street, New York, New 
York 10001; remitting $1.00 for microfiche or $3.00 
for photocopies. Requests for reprints should be sent 
to R. W. Miskimins at the above address. 
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mainder of the hospital population. These 
patient groups were scored on the original SP 
The suicide group mean score was 32.60, as com- 
pared to 22.37 for the control group (t =6. 
~<.001). An item analysis was undertaken 
cull out the least discriminative items, both 
increase the scale’s precision and to decrease 
length. This procedure suggested the elimina 
of 10 low-differentiation items; the remaining 
were used to construct the Revised Suicide 
tential Scale (RSPS). The 16 RSPS items we 
based on the following characteristics: sex and 
age, diagnosis, times admitted, marital statu: 
education, preoccupation, slowing of thought, la 
guage use, anger, depression, apathy, inapproprial 
behavior, social pattern against, impaired 
tiveness, external precipitating stress, and dange 
to self. 
All six study groups, from the original 
present research, were scored on the RSPS. 
mean scores for these samples (and the Ns on 
which they were based) were as follows: (a) 
Original Suicides, 30.06 (16); (b) Replicatio 
Suicides, 28.20 (15); (c) Potential Suici 
28.75 (24); (d) Matched Controls, 19.06 (16) 
(e) Original Random Controls, 23.09 (116); ( 
Replication Random Controls, 16.77 (30). It ca 
‘be seen that the mean scores for the two group 
of suicide Ss (and the unit selected potential 
suicides) are all quite distinct from those ob> 
tained for the three nonsuicidal groups. For all 
nine possible comparisons between suicidal and 
control groups (two suicide groups and one hig] 
potential group, each contrasted with the th 
nonsuicide samples), the differences obtaine 
were significant beyond the p < .001 level. Th 
results clearly support the validity of the revise 
SPS. ; 
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two significant re-evaluations 
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PSYCHOTHERAPEUTIC AGENTS 


New Roles for Nonprofessionals, Parents, 
and Teachers 


Edited with commentaries by Bernard G. Guereney, 
Jr., Rutgers, The State University 


Through articles and editorial commentary, Profes- 
sor Guerney explores remedial methods and pro- 
cedures based on the strategy of using nonprofes- 
sionals as trained and supervised agents of the 
professional psychotherapist. The book considers 
in detail problems and solutions in selecting and 
training nonprofessionals, and includes detailed 
descriptions of therapeutic projects involving non- 


professionals. 
April 1969 608 pages $9.95 


A new revision 


concept. 


Order from: 


of major significance . . . 


APA PUBLICATION MANUAL 


1967 Revision 


The APA PUBLICATION MANUAL has unde 
10 years. Although the revised edition prescribes e n 3 
previous edition, major changes and additions were made in the text in an effort to clarify and 


raone a complete revision for the first time in 


simplify the instructions on how manuscripts should be prepared. 


The APA PUBLICATION MANUAL is intended primarily as a guide for authors and editors 
in the preparation of material to be published in APA journals. However, in preparing the orig- 
inal edition it was recognized that the MANUAL could, and should, be written to accommodate 
a wider range of manuscripts. The revised PUBLICATION MANUAL has retained that original 


AMERICAN PSYCHOLOGICAL ASSOCIATION 
1200 Seventeenth Street, N. W. 
Washington, D. C. 20036 


HELPING AND HUMAN RELATIONS 


A Primer for Lay and Professional Helpers 
Volume |: Selections and Training 


By Robert Carkhuff, University of Massachusetts, 
Amherst 


The first of a two-volume series, this volume deals 
creatively and pragmatically with the effective 
selection and training of lay and professional 
helpers for social work, counseling, clinical work, 
and psychiatry, as well as other areas of human 
relations. 

$7.50 (tent.) 


August 1969 288 pages 


Holt, Rinehart and Winston, Inc. 


383 Madison Avenue, New York, New York 10017 


e same basic format for manuscripts as the 
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EUROPE 


To utilize the low group fares 
of the major airlines — 
especially in connection with 
the International Congress of 
Psychology in London July 
27, and the Annual American 
Psychological Association 
‘Convention in Washington + 
August 31, 1969— we have 
scheduled several groups to 
London, the Continent, the 
Iberian Peninsula, Scandinavia, 
Yugoslavia and Greece. Round 
trip jet fares start at $230 
(New York/London.) Write to 
Division “P” Travel Whole- 
salers International, ITT 
Building, Washington, D.C. 
20036, or call collect (202) 
296-9161. 


Consider this important 
new text... 


CASEBOOK ON ETHICAL 
STANDARDS FOR 
PSYCHOLOGISTS 


Ł 

The application of the 19 principles com- 
prising the Code of Ethics for all psychol- 
ogists is presented. Each principle is il- 
lustrated by actual, but disguised, cases 
involving alleged violations of the principle 
and action taken by the Committee on Scien- 
tific and Professional Ethics and Conduct. 


Four Appendixes state and amplify the 
additional principles applicable to specific 
areas of psychology: 


—Ethical Practices in Industrial Psychology 


—Aulomated Test Scoring and Interpretation 
Practices 


—Standards for APA Directory Listings of 
Private Practice 


—Guiding Principles for the Humane Care and 
Use of Animals 


The arrangement of the casebook into a 
general section of cases and four additional 
specific sections increases its versatility as a 
reference book for individuals and as a text 
for instructional purposes. 


Price: $1.00 


Order From: 


AMERICAN PSYCHOLOGICAL 
ASSOCIATION 

1200 Seventeenth Street, N. W. 

Washington, D. C. 20036 


The Difference 


between 
continuing 
delinquency 
and a good 
citizen 

is personal 
interest and 
individual care 
under 
professional 
direction. 


.. America’s most arrested 
person is only 15 

years of age. 

His criminal 

career 

began at age 9, 


It Can Be 
Dealt With 


delinquents since 1945, 
From every state, of 

every race and religion, 
some as young as 12, others 
as old as 18... more than 
80% have never been arrested 
after graduation from SFBH. 
The chain of delinquency 
broken, they are citizens, 
taxpayers, good husbands, 
good fathers, 


The Right Goal 


Mere custody is not enough. The goal must be a change inside 
the young man himself — enough change to keep him out of 
trouble, enough change to enable him to live successfully in 
our society. 


For information write: 
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Brief Reports 


The Journal of Consulting and Clinical Psy- 
chology will accept Brief Reports of research 
studies in clinical psychology for early publication 
without expense to the author. The procedure is 
intended to permit the publication of soundly de- 
signed studies of specialized interest or limited 
importance which cannot now be accepted because 
of lack of space. Several pages in each issue will 
be devoted to Brief Reports, published in the 
order of their receipt without respect to the dates 
of receipt of the regular articles. Most Brief Re- 
ports appear in the first or second issue to go to 
press following their final acceptance. 


An author who wishes to submit a Brief 
Report: 

1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the National 
Auxiliary Publications Service (NAPS) to insure 
indefinite availability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 

4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 
Brief Report. The Brief Report should give a 
clear, condensed summary of the procedure of 
the study and as full an account of the results 
as space permits. 


To insure that the Brief Report will be no 
longer than one printed page, its typescript, in- 


cluding all matter except the title and the 
thor’s lines, must not exceed 85 lines averag 
42 characters and spaces in length. Set the 
writer margins for short lines of 42 charac 
which are 3.5 inches long in elite typing, and 4.2 
inches long in pica. ` 


The manuscript of the Brief Report must 
double spaced throughout. Except for its short 
lines, it follows the standard style of the 1 
revision of the APA Publication Manual. 
ings, tables, and references are avoided or, 1 
essential, must be counted in the 85 lines. 
Brief Report must be accompanied by a footnote 
in the style below, which is typed on a separate’ 
sheet and not counted in the 85-line quota: 


1 An extended report of this study may be ob= 
tained without charge from John Doe (giving th 
author’s full name and address) or for a fee from 
the American Society for Information Science. Ord 
NAPS Document from ASIS National At 
iary Publications Service, c/o CCM Inform: 
Sciences, Inc., 22 West 34th Street, New York, New 
York 10001; remitting $—— for microfiche or $ 
for photocopies. 


Extended report. Because the extended repoti 
is intended for photoduplication, and is not cop 
to be sent to a printer, its style should differ i 
several ways from that of other manuscripts: 
(a) The extended report should be typed ¥ 
single spacing for economy in duplication. (b 
Tables and figures should be placed adjacent to 
the text which refers to them. A caption shoul 
be typed below each figure. (c) Footnotes shoul 
be typed at the bottom of the page on whic) 
reference is made to them. In other respects, tht 
full report is prepared in the style specified by 
the Publication Manual. 


Style of Manuscripts. Manuscripts must be red 
dere ei gt Publication 7 Monat oh the 
7 'sycholo, ssociation (1967 Revisi 
TeS for $1.50 from the CAE office < 2: 


Early Publication. Authors may secure thi 1; 
pablo of a acceptable article in the first Neue 
go to press after its acceptance by arranging fi 
the article to be printed fea extra pages increasing 
He es Ji pages A subscribers receive, Informa- 

ion about or earl; blicati 
tained from the Managing Editor. Ss Ua 


Noona aaun a E Service. To save 
rinting costs, materials supplemen to articles ma; 
B deposited with the NAPS. To obtain copies, dedet 
by Document Number, as cited in the reference foot- 
note, from ASIS National Auxiliary Publications 
Service, c/o CCM Information Sciences, Inc., 22 West 
34th Street, New York, New York 10001. The indi- 
cated fee must be remitted in advance. 


Abstracts. Manuscripts must be accompanied by & 
abstract of 100-120 words typed on a separate sheet @ 
The abstract should conform to the sty 


when the requester is the author of the article to, 
printed, or the article is to be reproduced in a 
number of copies solely for instructional purposes: 


Reprints. Authors receive 50 reprints 
Seas quantities may be ordered when 
proofs. 
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BEHAVIOR THERAPY: 


OBSERVATIONS AND REFLECTIONS 


MARJORIE H. KLEIN 


Wisconsin Psychiatric Institute, University of Wisconsin 
ALLEN T. DITTMANN, MORRIS B, PARLOFF 
National Institute of Mental Health 
anb MERTON M. GILL 


Downstate Medical Center, State University of New York 


This paper presents impressions based on 5 days’ observation of the clinical 
activities of Joseph Wolpe and Arnold Lazarus at the Eastern Pennsylvania 
Psychiatric Institute. Behavior therapists now treat a wider range of patients 
and use a greater variety of treatment methods than before. Treatment is 


becoming longer and more complicate 


d, with concomitant lowering of success 


rates, There are points where behavior therapy is not so simple and straight- 
forward as the popular stereotype of it: clinical inference is essential in diag- 


nosis, hierarchy selection, patient manage! 
of suggestion in the patient-therapist rel: 


ment, and ongoing evaluation. Elements 
lationship may play a powerful role in 


treatment, Implications of these impressions for research and clinical training 


are discussed. 


Like many clinicians and researchers today 
we have been impressed by claims made for 
behavior therapy: its superior effectiveness, 
straightforwardness, efficiency, flexibility, and 
objectivity. In order to learn more about how 
behavior therapists function we undertook to 
spend a period of time observing ongoing be- 
havioral treatment under the guidance of two 
of its most prominent proponents, Joseph 
Wolpe and Arnold Lazarus. In this paper we 
wish to present some questions, comments, 
and inferences based on observations of their 
Clinical activities which we were privileged to 
make during a typical week at the Eastern 
Pennsylvania Psychiatric Institute. Over 5 
working days we saw both private and demon- 
stration sessions for cases in various stages 
of treatment, and occasionally were able to 
interview patients. We had access to tape 
recordings of other treatment sessions, at- 
tended staff meetings and case conferences, 
and discussed many issues with both Wolpe 
and Lazarus. We are aware that 1 working 
week is in many ways an unusual length of 
time for outsiders to visit a clinical operation, 
but realize that a different week may have 
brought different patients and yielded other 
impressions. 


At the outset we wish to express our appre- 
ciation for the extraordinary cooperation ‘of 
our hosts and our deep respect for their un- 
usual openness. Wolpe and his associates were 
both patient and helpful in their efforts to 
answer our questions, many of which we must 
admit were less than gentle and friendly and 
reflected our own psychoanalytically oriented 
training, experience, and biases. In the inter- 
ests of accuracy in writing this paper, we sent 
drafts to both Wolpe and Lazarus, and they 
made a number of comments about it. Many 
of these concerned misstatements of fact on 
our part, and we have incorporated these by 
making the necessary corrections. Others were 
comments about interpretation, and we have, 
with their kind permission, included them in 
the article, separated from the body of our 
text by the use of smaller type size identifica- 
tions of the commenters. We are greatly in- 
debted to Wolpe and Lazarus for making this 
method of presentation possible. 

Therapy as practiced by Wolpe and his col- 
leagues is among the most widely used form 
of behaviorally oriented treatment. It was first 
described fully by Wolpe in 1958, with a more 
technique-oriented exposition including new 
developments recently available (Wolpe & 
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Lazarus, 1966). Complete unanimity of ap- 
proach to theory and technique is not to be 
found in any so-called school of psycho- 
therapy, and behavior therapists are no excep- 
tion. Even the two we visited are quite dif- 
ferent on some issues, Lazarus being a good 
deal more eclectic in his choice of techniques. 
Still, there are some basic general principles. 
The chief of these has to do with reciprocal 
inhibition, in which the fear response (or 
some other maladaptive response) to those 
situations which form the basis for the 
patient’s complaint is inhibited by an incom- 
patible response, usually a positive one, The 
positive responses most frequently used are 
relaxation and self-assertion. Relaxation is 
brought about by a modification of Jacobson’s 
technique, and for each symptom the patient 
is instructed to imagine situations like those 
he fears, arranged in hierarchical order from 
least to most fear-producing. When the pa- 
tient is able to imagine each situation and 
remain relaxed the therapist goes on to the 
next hierarchy scene. Where self-assertion is 
used as the new response, the patient is in- 
structed in the relevant procedures for per- 
formance between sessions. Treatment is con- 
sidered successful when all major symptomatic 
complaints are reorganized in this fashion. 

Although most of what we encountered has 
been spelled out explicitly in the writings of 
Wolpe and others, some of what we saw sur- 
prised us, and led us to understand the litera- 
ture more fully when we returned to it later. 
In addition, there were a few impressions 
which have not, as far as we know, been 
spelled out completely in any publication. We 
shall note these as we come to them and 
State them rather tentatively. We shall begin 
with a description of the range of patients 
who came in during our week of observation 
and of the techniques we saw employed and 
shall go on to discuss the implications of what 
we have seen for research in psychotherapy 
and for programs of clinical training. 


Patients 


Consistent with Wolpe’s emphasis that his 
method has wide applicability, cases we ob- 
served under treatment went well beyond 
simple phobias or social anxieties (cases for 
whom behavior therapy has been thought to be 
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a specific) to include a broad range of comple 
psychoneurotic problems, character neuroses 
or borderline psychotic problems (acute 
psychoses are excluded). Many had been re. 
sistant to prior therapeutic efforts. To the 
clinician of ordinary persuasion a number oí 
these patients would seem to have extremely 
poor prognosis and present a formidable 
challenge. They were not particularly gratify. 
ing patients for the behavior therapist either 
These patients were often unwilling or unable 
to meet what are considered the basic require. 
ments for behavior therapy or to perform cor. 
rectly the therapeutic tasks that were as 
signed. In some of the more complex cases we 
observed (and in others we were told about) 
the patients were not optimally relaxed o1 
would at times exaggerate or misreport thei 
levels of relaxation. On occasions inquiry 
would reveal that the hierarchy scene actually 
“imagined” was considerably different from 
that which the therapist had presented. As a 
consequence new cues or response dimension: 
were often introduced, reducing or heighten: 
ing anxiety levels spuriously. In addition 
compliance with the very specific instruction: 
for response performance outside the therapy 
setting seemed equally difficult at times to 
obtain or to verify. 

In general, it was our impression thal 
broadening the clientele for behavior therapy 
to include “less good” cases has introduced 
inevitable strain on the method and potential 
depression of its outcome rate. It should be 
noted that behavior therapists have not by 
plan increased the range of problems they 
treat. Rather, as they have become better 
known, they have been referred a broadet 
spectrum of patients, a number of whom, 4 
we have said, had experienced unsatisfactory 
results in the care of others. 


Techniques 


From our acquaintance with the literature 
we knew intellectually at least that behavi0! 
therapists do not work in a unitary fashion 
and indeed take pains to vary their approach 
from case to case. We were surprised to find, 
however, that within most cases, too, a num 
ber of manipulations were routinely employ¢ 
Even where desensitization was the primaly 
technique, others such as assertiveness oF & 
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pressiveness training, manipulation of behav- 
ior outside the treatment setting, and educa- 
tion in learning principles were also included. 
With the more complex cases the spectrum of 
techniques became even broader. In addition 
to desensitization involving a variety of hier- 
archies, many variants of assertion and ex- 
pression, often elaborate role-playing and 
behavioral programs of homework were de- 
vised to correct response deficiencies. Along 
with these specific procedures we found also 
that patients were given a good deal of in- 
doctrination, teaching, and exhortation, ap- 
parently intended to provide a rationale for 
the treatment and to enhance motivation. 

These remarks may well sound like com- 
plaints by outsiders that behavior therapists 
use a “shotgun” approach. But such an ac- 
cusation would be unfounded, since the many 
techniques do have a unity in the behavior 
therapist’s view in that they are thought 
either to produce reciprocal inhibition of anx- 
iety or to alter stimulus-response contingen- 
cies. It would make no more sense to insist 
that the behavior therapist be limited to one 
behavioral manipulation than it would to in- 
sist that another kind of therapist restrict 
himself to one content area or interview tech- 
nique. It is appropriate, however, to note the 
apparent contradiction between the prolifera- 
tion of methods in behavior therapy and the 
popular conception, based partly on hope, 
partly on the behavior therapists’ writings, 
that this is a simple and straightforward 
treatment of the neuroses. It is also appropri- 
ate for us to look into the research implica- 
tions of a many-method approach, as we shall 
do later on in this paper. 

The popular notion of simplicity in behav- 
ior therapy covers not only treatment tech- 
niques but diagnostic procedures as well. 
Many people suppose that the therapist begins 
by clearly and systematically defining the pa- 
tient’s problems in terms of manageable hier- 
archies and then selects appropriate responses 
to be strengthened or weakened. We found 
little support for this conception of behavior 
therapy diagnosis in our observations. Indeed 
the selection of problems to be worked on 
often seemed quite arbitrary and inferential. 
We were frankly surprised to find the pre- 
senting symptomatic complaint was often 


sidestepped for what the therapist intuitively 
considered to be more basic issues. Most sur- 
prising to us, the basis for this selection 
seemed often to be what others would call 
dynamic considerations. The distinction be- 
tween “secondary” (that is, the superficial) 
and “primary” (the more basic, underlying) 
was even made openly on occasion by behav- 
ior therapists: The words we have put in 
quotes or their terms. The literature, of 
course, gives no hint of this development. 


JW: This conveys the impression that the writers 
assume that “more basic” in a behavioral sense is 
the same thing as the psychoanalysts mean. This is 
really quite incorrect. For example, it may be found 
that a person’s fear of enclosed spaces is based on 
a fear of developing certain physical symptoms that 
may lead, as he believes, to his death, and he avoids 
enclosed spaces because once within them he cannot 
easily run out and seek help. It is the fear of his 
symptoms that would then be the primary focus of 
this person’s treatment. What is involved here is the 
pinpointing of the time stimuli to anxiety, but these 
are still extrinsic, and no reference is made to “intra- 
psychic mechanisms.” 

AAL: One must be careful when using a term like 
“dynamic trends.” A behavior therapist may trace a 
patient’s anxiety to faulty identification with an 
inadequate father coupled with excessive sibling 
rivalry, compounded by ambivalence engendered at 
the hands of an inconsistent mother. However, these 
basic and formative interpersonal encounters are then 
taken at face value and do not presuppose additional 
“dynamics” such as “castration anxiety,” “incestuous 
wishes” or “death wishes,” etc. As to the statement, 
“The behavior therapy literature, of course, gives no 
hint of this development,” may I refer you to 
Lazarus (1966). 


We also learned that behavior therapists 
prefer to structure hierarchies on dimensions 
of time and space (for example, “a month 
before an examination,” and leading up to the 
time of an examination; “standing in a hall 
a mile away from the hospital” and gradually 
approaching it; treading water in a pool 5 feet 
deep and 5 feet from the edge,” working up 
to greater depths and distances). The ex- 
amples of hierarchies reported in the behavior 
therapy literature are generally much more 
flexible and lifelike than the ones we en- 
countered during our visit. 

One point which the behavior therapists’ 
writing makes quite clear but which did not 
strike us until our visit has to do with the 
ongoing evaluation of the patient progress. 


262 


Because behavioral treatment is posited as 
highly specific, it follows that success depends 
on the patient’s exact and close cooperation 
with the therapist’s instructions. It is there- 
fore very important for the therapist to test 
this cooperation repeatedly during treatment. 
The therapist must also constantly assess the 
patient’s progress on hierarchy dimensions. 
With the possible exception of role-playing, 
the therapist is dependent in this evalution 
upon the patient’s report both of progress 
outside treatment and of events within the 
sessions themselves. Since there are no inde- 
pendent procedures for evaluating or verify- 
ing his report, the patient has considerable 
leeway to bias his report in order to please, 
frustrate, or otherwise manipulate the thera- 
pist, or to meet some personal expectation. 
And the form of much of the feedback from 
the patient (i.e., lifting his finger if he feels 
an increase in anxiety during desensitization 
or doing nothing if he does not) gives the 
therapist very few cues for distinguishing 
valid from invalid reports. Thus the therapist 
must use considerable intuition to assess 
progress and correct the treatment plan. This 
all serves to highlight a very basic discrepancy 
between the theoretical orientation of behavior 
therapy and its actual practice. While the 
theory clearly calls for the manipulation of 
overt behavior, the therapist typically deals 
primarily with the patient’s report of his 
image of that behavior. 


JW: We all have “images” of everything we talk 
about. If a patient says that he has a feeling of 
anxiety, this is presumed to be his image of some 
responses within him. Except insofar as this kind of 
image cannot be shared by another observer, there 
is no difference between such a report and the report 
of something perceived in the world outside the 
Subject. Either could be false: but what would 
motivate patients to mislead their helpers? And what 
insinct would make them all lie according to the 
same rules with regard to their emotional responses 
to repetitions of an image—that is, they report either 
that anxiety is weak and progressively diminishes, 
or that it is strong and does not? 

AAL: It is incorrect to state that, “the theory calls 
for manipulation of overt behavior.” All forms of 
behavior are covered by the theory, including 
“thinking behavior.” 


Perhaps the most striking impression we 
came away with was of how much use behav- 
ior therapists make of suggestion and of how 
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much the patient’s expectations and at 
are manipulated. Behavior therapists ; 
at all silent on this point in their des 
of technique, but the literature did ni 
pare us for the unabashed suggestions 
therapists directed toward their patients, 
major arena for suggestion is in the orie 
tion period of treatment. Here the 
tells the patient at length about the p 
of the treatment method, pointing out 
has been successful with comparable pat 
and all but promising similar results for] 
too. The patient is provided with a det; 
learning-theory formulation of the etiolog 
his problems and is given a straightforw 
rationale for the way in which the spe 
treatment procedures will “remove” his 
toms. The patient’s motives and values | 
also be considered so as to “correct mis 
ceptions” which block desirable course: 
action or restrict the effect of treatm 
Indeed it seemed to us that treatment 
and goals were laid out in such a detail t 
the patient was taught precisely how thi 
would proceed and what responses 
changes were expected of him all along 
way. A quite complete account of these 
cedures may be found in Wolpe and Lazi 
(1966, pp. 16-20). 


AAL: Both Wolpe and I have explicitly stai 
relationship variables are often extremely im 
in behavior therapy. Factors such as warmth, 
pathy, and authenticity are considered necessal 
often insufficient. 


Although Wolpe and other behavior 
pists are reluctant to ascribe therapeu 
fectiveness to these features of the r 
ship it is difficult for us to believe that 
do not constitute an important part 0 
treatment. Certainly the explicit, positive, 


designed to establish the therapist as a po 
ful figure and turn the patient’s hop 
success into concrete expectations. The 
ductory education in learning theory, in 
tion, must function to make the tre 
more plausible and provide a simp! 
coherent frame of reference for the 
understanding of his difficulty. Fu 
focus of the treatment philosophy on 
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of external psychonoxious environmental fac- 
tors in the formation of the patient’s problems 
must be quite reassuring for many patients, 
At the very least the therapist’s evident will- 
ingness to assume major responsibility for 
correcting the patient’s problems may be es- 
pecially important in helping him overcome 
inertia and start on the path to change. 


JW: Paul’s (1966) study in which psychoanalyti- 
cally oriented therapists obtained significantly better 
results with desensitization than with either their own 
insight methods or with another clearly structured 
method indicates a relatively small role for the 
factors suggested here. 


The Course of Treatment 


As behavior therapists have gained more ex- 
perience with their methods and behavior ther- 
apy has been more widely used, a number of 
complementary trends seem to have emerged; 
techniques have become more polished, tend- 
ing to shorten the treatment of specific cases, 
but at the same time patients with more com- 
plicated problems have been seeking behavior 
therapy, tending to lengthen treatment. In the 
first years of Wolpe’s experience with behav- 
ior therapy the average number of interviews 
per case rose from 25 (Wolpe, 1952, p. 827) 
and 26 (Wolpe, 1954, p. 217) to 45 (Wolpe, 
1958, p. 218), but declined sharply there- 
after: Lazarus reports an average of 14 inter- 
views per case in 1963 (Lazarus, 1963, p. 75). 
At the present time it is our impression that 
the length of treatment is on the increase 
again. Some of the patients we observed had 
been in treatment for over a year. In others 
the number of interviews per week had in- 
creased dramatically. One patient was being 
seen four times a week, and this had been 
going on for approximately 9 months. We 
felt that these trends reflected the shift in 
the case load to include more complex or seri- 
ously disturbed cases. Also related are changes 
over the years in reported outcome rate. From 
the over 90% improvement originally reported 
(see Wolpe, 1952, 1954, 1958) the claims 
have been modified to approximately 807% 
(Lazarus, 1963), with some of Wolpe’s asso- 
ciates now reporting 70% rate. These figures 
cannot be taken literally as they were not 
derived from data specifically designed to 
yield comparative outcome rates—indeed the 
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last figure is purely anecdotal. What we wish 
to convey by referring to them is that behav- 
ior therapy seems to be taking longer and 
showing less spectacular successes. Why this 
should be so could be the topic of extensive 
study. It should be noted, however, that 
these trends are familiar ones among psycho- 
therapies as they are introduced and find 
their way. 

One final impression about the course of 
treatment: It is interesting to note that the 
ongoing argument between behaviorist and 
other methods concerning symptom substitu- 
tion seems to be taking a new turn. When 
confronted with a patient who is successfully 
treated for his initial array of problems, but 
continues to report disabling complaints, the 
therapist talks not of symptom substitution 
but of faulty or inadequate treatment. It may 
be argued, for example, that (a) the original 
formulation of the case did not consider the 
correct stimulus contingencies; (b) “second- 
ary” anxieties may have masked the more 
“primary” or situational pervasive anxieties; 
or (c) the patient did not comply with the 
instructions or work hard enough to extend 
his learning outside the treatment setting. 
While it may be commendable for behavior 
therapists to assume primary responsibility 
for slow or uneven progress in their patients, 
the distinction between backsliding due to in- 
adequate treatment and symptom substitution 
is certainly not as obvious to outsiders (es- 
pecially to dynamically oriented ones) as it 
is to behavior therapists. In any case it is a 
distinction which is difficult to verify. 


JW: Symptom substitution means replacement of 
one symptom by another. It is not symptom substi- 
tution if a patient complains of symptoms that he 
had before treatment began, though other symptoms 
have been removed by therapy. “Backsliding” is in 
no sense commonly observed. 

AAL: “Symptom substitution” is an unfortunate 
term. It presupposes that unresolved unconscious 
complexes result in the terminal manifestation of 
new problems. Patients can always acquire new aber- 
rant responses (e.g., when faced with inimical life 
situations). Is this “symptom substitution?” 


Research Implications 


In this section we shall consider a number 
of implications our observations have for be- 
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havior therapy research. It should be clear, 
and let us stress it if it is not, that these 
problems are by no means unique to research 
in behavior therapy. Indeed, what struck us 
most forcefully was the fact that the issues 
and research problems which confront behav- 
ior therapy are the same familiar ones which 
have long troubled all psychotherapy research. 

The first and most obvious research 
problem behavior therapists must face is a 
consequence of the diversity of their tech- 
niques. While it is true, as we said earlier, 
that behavior therapists believe all of their 
techniques are united in their derivation from 
theories of learning, the question of the exact 
function and effectiveness of each technique is 
still an empirical one. Research questioning 
which of the many techniques is best suited 
to which patient, and which produces which 
result, would teach behavior therapists how 
to make their treatment more effective and 
efficient. 

In making these points let us stress again 
that we are not criticizing behavior therapy 
for its technical variety; the range of patients 
seen certainly calls for flexibility in treatment, 
and the underlying experimental and empirical 
philosophy demands constant technical ad- 
justments, Nonetheless, it remains that the 
question of the theoretical utility of a many- 
faceted method, whatever its theoretical per- 
suasion and whatever its clinical value, has 
never been adequately dealt with in psycho- 
therapy research. Thus the fact of variety in 
behavior therapy makes it as difficult to talk 
of behavior therapy as a single method of 
treatment to be evaluated and compared with 
others as it is to talk of a single method of 
insight therapy. The subtleties and variations 
within methods may be as important as the 
unifying theories. It may be possible to com- 
pare therapeutic approaches on some dimen- 
sions—behavioral symptoms versus uncon- 
scious conflicts as the “core” of neurosis, for 
example, in comparing behavior and insight 
therapies—but it must be remembered that 
the dimensions are theoretical ones. They may 
relate only indirectly to the specific therapist 
activities during the treatment session, and 
there may be equally important variables 
that may be overlooked because they do not 
happen to be in the theoretical spotlight. 
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In our earlier section on technique we em. 
phasized the role of suggestion in behavior 
therapy practice. We did this because we were 
surprised at the extent and consistency of its 
use. The research implications are obvious: 
Even though these factors are not called tech- 
niques, in that they do not follow from the 
theoretical framework, they nevertheless have 
an effect on the patient, and these effects must 
somehow be understood and differentiated 
from the effects of the theoretically important 
“techniques.” 


JW: Suggestion is nothing but the use of words to 
obtain the responses that are associated with these 
words, If deliberate evocation of responses in this 
way brings about habit change, it is just as much 
behavior therapy as habit change through responses 
otherwise contrived. Words are probably used with 
this intent more consciously by behavior therapists 
than most other therapists. 

AAL: The statement that suggestion as a technique 
does not follow from the theoretical framework is 
not correct. “Suggestion” can only have an effect by 
virtue of responses already in the person’s repertoire 
(i.e., previously conditioned cues) which the sug- 
gestions come to elicit. Furthermore, if suggestion 
enables the person to attempt new responses, these 
may have positive effects. One thus endeavors quite 
deliberately to maximize the “placebo effect.” 


Again, this problem is not unique to be- 
havior therapy, but again, it is the popular 
notion that behavior therapy is simple and 
straightforward, behaviorally oriented, and 
thus untrammeled by these side effects. In 
practice, all therapists make suggestions, by 
telling their patients about the methods they 
employ, and by explaining why they believe 
these methods will be helpful. The particular 
behavior therapists we observed were experi- 
enced clinicians (whose training included dy- 
namic methods), and they went about theif 
business very confidently and skillfully. Much 
of what they did was what any clinician 
does in dealing with patients, and they did it 
by second nature, so to speak. The point is 
that all of these activities, the second-nature 
ones, the overt suggestions, the specific 
“technique”—all must be examined for theif 
effects on the patient. 


AAL: Recent experimental results lead me to en- 
dorse the authors’ contentions here. Indeed, even the 
results of a specific technique like systematic desensi- 
tization cannot be accounted for solely in terms © 
graded hierarchies and muscle relaxation. 
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The more complex cases highlight some 
other difficult questions for assessing the out- 
come of behavior therapy. It is generally 
assumed that the behavior therapist, basing 
his judgment on change in manifest symp- 
toms, enjoys a simpler criterion of outcome 
than does the “dynamic” therapist whose 
criteria are cast in such inferential terms as 
conflict resolution or structural personality 
change. But the apparent objectivity of 
symptomatic change is lost when the behavior 
therapist treats patients who present complex 
symptom pictures. Unless all symptoms are 
successfully removed, one must decide in 
assessing outcome whether all symptoms are 
to be equally weighted, whether only those 
symptoms mentioned at intake should be 
counted, or whether complaints emerging later 
should be given secondary status. Questions 
like these are certainly not news to behavior 
therapists, any more than they are to thera- 
pists of other persuasions. In fact, it looks 
as if they may have already begun to 
work toward some answers although per- 
haps unwittingly: The differentiation between 
“primary” and “secondary” problems which 
we mentioned earlier may be a first try at 
ordering symptoms for therapeutic attack. 
That this differentiation was made on the 
dynamic basis was our inference from our 
observations. Whatever the basis, it must be 
defined more clearly before it can be used in 
designing research studies. 

One more research implication of the com- 
plex cases: Behavior therapists have relied 
almost exclusively on patient reports, both for 
evaluating the success of manipulations within 
a given session and for assessing the outcome 
of the entire case. In general it was our im- 
pression that the more disturbed patients were 
especially unlikely to give reliable reports. 
The problem seemed to make it difficult for 
therapists to handle these patients clinically. 
It brings about insuperable difficulties in com- 
paring patients with each other in a research 
study, or for that matter in comparing two 
reports collected at different times from the 
same patient. The obvious way out of this 
problem is to rely more heavily on exter- 
nal measures, preferably behavioral ones, as 
criteria for the success of treatment. 
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Implications for Clinical Practice and 
Training 

At present there is a clearly discernible 
trend in a number of training institutions to 
emphasize behavior therapy considerably, 
both in theory and technique. We are con- 
cerned that such training practices not pro- 
voke an overreaction against “dynamic” 
theory and practice. It may be well to recall 
that the innovators of behavior therapy have 
each had considerable training and experience 
with classical psychodynamic orientations. 


JW: It seems to be suggested here that previously 
acquired psychoanalytic knowledge plays an integral 
part in the practical repertoire of behavior therapists. 
Earlier passages have implied that because the behav- 
ior therapist is not always content with “a simple 
mechanical acceptance of the patient’s statements re- 
garding his fears and anxieties,” and may go into 
probing activities for the sake of clarification to 
establish the antecedents, this is ipso facto psycho- 
analytic, It is in a sense analytic, but in every im- 
portant respect repugnant to whatever is specific to 
the psychoanalytic theory of neurosis, and different 
from the practices of psychoanalytic therapy. 


Despite their present enthusiasm for behav- 
ior therapy the fact remains that previously 
acquired skills still form an integral part of 
their repertoire. The effect is perhaps most 
clearly evident in the selection of patient- 
relevant hierarchies, which we have stressed 
before. Recall also our observations that 
Wolpe and his associates make very effective 
use of the patient-therapist relationship to 
establish a context in which the specific be- 
havioral techniques can be utilized most ef- 
fectively, Although the behavior therapist 
would not wish to ascribe his results to any 
“dynamic” theories he may have learned in 
the past, it is clear that many clinical deci- 
sions in the treatment are based on an under- 
standing of some functional organization of 
behavior, and an appreciation of the power of 
the relationship. 

The danger for training programs as we see 
it is that the trainee will be exposed princi- 
pally to learning theory and behavioral tech- 
niques without being adequately trained in 
other areas of psychology. It is especially 
those clinical skills of interviewing and diag- 
nosis, and those theoretical skills that grow 
with knowledge of a wide range of therapeutic 
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theories that prevent training from becoming 
sterile and one-sided. Without these skills the 
training will severely limit the competence of 
the therapist and restrict the quality of re- 
search and theoretical development possible 
within the field. Training programs which 
do not specifically attempt to develop a 
broad range of clinical skills are in danger 
of reducing the effectiveness of the entire 
profession. 


JW: With regard to the statement that behavioral 
training should avoid being one-sided, the answer 
is that a very good knowledge of all kinds of reac- 
tions and interactions is highly desirable. Our position 
is that it is not important to know a great deal about 
theories, but enough to obtain a historical perspective 
for present-day practices. From the point of view 
of the problems presented by patients, only proposi- 
tions that have some scientifically meaningful, factual 
support are worth anything at all. If behavior thera- 
pists have tended to ignore propositions from the 
psychoanalytic field, it has been because of a dearth 
of them that qualify for respect on this criterion. 

AAL: Do we need only a knowledge of general 
psychology and therapeutic theories to be effective 
clinicians or do we need in addition a course in 
sensitivity training and rational thinking combined 
with a mélange of specific techniques? 


Little benefit to the field of psychotherapy 
is to be gained from developing yet another 
closed system of treatment. We believe that 
the stimulus provided by the behavior thera- 
pist’s apparent willingness to examine the con- 
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sequences of his procedure may be a 
one for the field. Perhaps ultimai 
greatest contribution the behavior 
can make to psychotherapy theory 
tice is his dedication to making both tl 
ess and the outcome of treatment as ol 
and as efficient as possible. If the disc 
between behavior therapy and other 
can be subjected to empirical tests, 
ultimate aim of achieving a more 
theory of personality change may 
served. 
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ABSTRACTION ON PROVERBS IN PROCESS- 


REACTIVE SC 


ROBERT 


HIZOPHRENIA* 


CANCRO? 


The Menninger Foundation 


Fifty-one consecutive, drug-free, male 


admissions, with a senior-staff diagnosis 


of schizophrenia, were given the Phillips Prognostic Rating Scale (PRS) and 
the Benjamin Proverb Test shortly after entering the hospital. There was no 


relationship between classification by 
and duration of current hospital stay 


PRS and abstraction when drug effects 
were controlled. Using only those cases 


who had no previous history of psychiatric hospitalization, thereby controlling 
for previous and current hospitalization, still failed to produce a significant 
relationship. The suggestion of a racial effect emerged. This is discussed along 
with certain theoretical implications of the finding. 


The work of Goldstein (Bolles & Goldstein, 
1938; Goldstein, 1944; Goldstein & Scheerer, 
1941) suggested the hypothesis that schizo- 
phrenics showed a particular impairment of 
the abstract attitude, which could be differen- 
tiated from the characteristic impairment of 
an organic. More recently there have been a 
number of studies which have reported that 
process schizophrenics show a greater impair- 
ment of the abstract attitude than reactives 
(Becker, 1956; Herron, 1962; Johnson, 1966; 
Judson & Katahn, 1963; Meadow, Greenblatt, 
Funkenstein, & Solomon, 1953; Meadow, 
Greenblatt, & Solomon, 1953). 

Johnson (1966) comments on a number of 
methodological deficiencies in the earlier 
studies, highlighting particularly the impor- 
tance of using a reliable means of classifica- 
tion and controlling for the effect of vocabu- 
lary differences. There are even more serious 
methodological problems which have to be 
considered. ‘The use of chronically institu- 
tionalized and drugged patients introduces an 
indeterminate amount of variability. Reported 
differences in abstract attitudes between proc- 
ess and reactive cases can be a function of 
group differences in type, dosage, and respon- 
siveness to drugs. The effects of long-term 
institutional care are such that reactive cases 
who usually have been in the hospital for a 
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shorter period of time cannot be compared 
to process cases who have been in the hospital 
for longer periods of time. Chronic institu- 
tionalization is very similar to a subclinical 
sensory deprivation experience, and it would 
not come as a surprise if abstracting ability 
on the proverbs were to be affected by 
duration of hospital stay. The present study 
was undertaken to compare abstracting abil- 
ity on proverbs in process and reactive 
schizophrenics on whom these variables were 
controlled. 


METHOD 
Subjects 


Fifty-one consecutive male admissions to the psychi- 
atric observation wards of a large city hospital, who 
had a senior-staff diagnosis of schizophrenia, ranged 
in age from 18 to 44 and were fluent in English, 
were admitted to this study. The age range was 
selected in order to screen out early adolescent re- 
actions, involutional states, and premature senility. 
Patients whose clinical picture was complicated by 
organic, neurological, or extraneous factors, such as 
alcoholism, were rejected. Patients were not given 
tranquilizers or long-acting sedatives throughout the 
period of testing and examination so that perform- 
ance on the test of abstraction would not be 
contaminated by drug effect. 


Procedure 


Toward the end of the first week of hospitaliza- 
tion, the patient was given the 11 proverbs from the 
Benjamin Proverb Test (Benjamin, 1944). He was 
informed that the results of the study would not 
play a role in determining his length of hospital stay. 
The proverbs were orally administered with instruc- 
tions for the patient to give the most general answer 
that he could. If he responded with a concrete or 
only partially abstract answer it was accepted, and 
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he was encouraged to try to give a more general 
meaning. This was done in order to maximize the 
patient’s abstract response and to compensate for 
individual or group differences in initial motivational 
level. The proverbs were scored on the Meadow 
Scale (Meadow, Greenblatt, Funkenstein, & Solomon, 
1953) by an independent judge who had no knowl- 
edge of the patient’s process-reactive classification. 
The total score for the 11 proverbs was used as the 
measure of abstraction. 

Patients were interviewed, and the material ob- 
tained was used to fill out the Prognostic Rating 
Scale (PRS; Phillips, 1953), which served to rate 
the patients on the process-reactive continuum. Pa- 
tients were not divided into nominal categories on 
the basis of PRS, but rather were assigned their 
actual score for the calculation of Pearson product- 
moment correlations, The use of consecutive schizo- 
phrenic admissions raised concerns about the un- 
controlled effects of variables such as age, education, 
race, Wechsler Adult Intelligence (WAIS) vocabu- 
lary, and history of previous psychiatric hospitaliza- 
tion. Correlation coefficients between these variables 
and the PRS score were determined. As an additional 
precaution the population was divided so that pa- 
tients with PRS scores in the range of 2-14 were 
grouped as reactive (V = 20), and those with a PRS 
score between 15 and 32 (N=31) were grouped as 
process (Phillips, 1953). The two groups were then 
compared to see if they were adequately matched. 
Race was included as a variable because differences 
in the quality of education resulting from school 
segregation might affect the abstraction score. 


RESULTS 


There are no significant correlations be- 
tween the PRS score and age (r= —.13), 
years of education (r = .16), race (r = .16), 
WAIS scaled vocabulary score (r = .19), and 
history of previous psychiatric hospitalization 
(r = .10), The matching of the process and 
reactive groups on the cited demographic vari- 
ables is presented in Table 1. There are no 
significant group differences. Of the total 
sample, 54.9% of the patients had no his- 
tory of prior psychiatric hospitalization. The 
slightly higher incidence of previous hospitali- 
zation in the reactive group is not significant 
(x? = 1.071, df = 1), yet it raises some con- 
cern about the representativeness of the 
sample. The total group was followed for a 
period of 3 years after admission and the 
PRS did successfully predict future length of 
hospital stay for both process and reactive 
cases (Cancro & Sugerman, 1968). The mean 
duration of previous hospitalization in the 
reactive group was 4.25 months as compared 
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TABLE 1 


DEMOGRAPHIC MATCHING OF PROCESS AND 
REACTIVE GROUPS 


Item Process 
Mean age 28.3 
Mean years of education 11.3 


Mean scaled vocabulary (WAIS) 
score 9.4 


Proportion of nonwhite patients 29.0% 
Previous mental hospitalization 40.0% 
First mental hospitalization 60.0% 
Mean duration in months of 

previous hospitalization 9.06 


Note.—For the process group, N = 31; for the reactive 
group, N = 20. 


to the process group mean of 9.06 months, 
While this difference is statistically significant 
(t = 2.86, p < .005, one-tailed), it is not a 
very large one. f 

The correlation between the actual score on 
the PRS and the abstraction score is .005 for 
the total sample. The mean abstraction score 


active group is 7.2; these scores do not differ 
significantly. The use of recent admissions 
controlled for length of current hospital sti 
but did not control for duration of previous 
hospitalization. Taking those 28 patients who 
were admitted to a psychiatric facility for 
the first time and dividing them on the basis 
of their PRS scores into process and reactive 
categories revealed no mean differences in ab- 
straction. The mean abstraction score for the 
reactive group was 6.3, while the mean for 
the process group was 6.1, The correlation 
between the actual PRS score and the 
abstraction score on the 28 first admissions 
was not significant. 

There is a relationship in this sample be- 
tween race and history of prior psychiatri 
hospitalization. The white patients are mo 
likely to have been hospitalized before 
(phi = .35, p< .02, two-tailed). There aré 
weak relationships between race and age 
(r= .26, p < .10, two-tailed), and between 
race and abstraction (7 = .25, p< .10, two- 
tailed). The white patients tend, although 
not significantly, to be older and. to have 
higher scores on abstraction. Restricting the 
sample to nonwhite patients (WV = 15) brings 
out a weak relationship between scores on the” 


ABSTRACTION IN Process-REACTIVE SCHIZOPHRENIA 


PRS and abstraction in the expected direction 
(r= = 305 BO; one-tailed). The group 
means between nonwhite process and reactive 
cases differ weakly in the expected direction 
(t= 1.63, p < 10, one-tailed). 


DISCUSSION 


The failure to demonstrate a difference in 
abstract ability between process and reactive 
cases, when drugs and duration of current 
hospital stay are controlled, raises many ques- 
tions. Virtually all state hospital patients 
diagnosed as schizophrenics are on tranquil- 
izers. Those patients whose clinical pictures 
are more malignant, that is, the process cases, 
are most likely to receive large doses of the 
major tranquilizers. If having a slender body 
build is related to poor prognosis in schizo- 
phrenia, then we would expect process cases, 
as a group, to be getting even higher effective 
levels of drugs in terms of milligrams of tran- 
quilizer per kilogram of body weight per day. 
Reported linear relationships between abstract 
ability and normal, reactive, and process 
groups may be a function of differences in 
group levels of drug ingestion. 

While the current hospital stay was con- 
trolled, there was a difference between groups 
in mean duration of total previous hospitali- 
zation. It would be difficult to attribute the 
failure in demonstrating a difference in level 
of abstraction to the effect of a mean of 5 
months’ additional lifetime hospitalization. 
There is even less reason to assume that the 
slightly longer prior hospitalization of the 
process group would raise their abstract score. 
Nevertheless, the correlation between duration 
of previous hospitalization and abstraction 
was obtained and it was not significant. 

The results suggest the usefulness of con- 
trolling for possible racial effects on proverb 
tests of abstraction in schizophrenics. The 
quality of education, as well as its quantity, 
must be controlled if we are to use tests 
which are sensitive to educational differences. 
The greater disorganization of the process 
schizophrenic may manifest itself in the prov- 
erbs when the poorer quality of education 
prevents the “reflex” correct response, thereby 
producing apparent group differences in 
abstract ability. 

The value of proverbs tests in general as 
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measures of the abstract attitude in schizo- 
phrenia is open to substantial doubt. The 
abstraction score is determined in part by 
the correctness of the response and therefore 
indirectly by the absence of intrusions of 
schizophrenic thought. The proverbs were re- 
scored by a different independent judge, who 
had no knowledge of the abstraction or 
Phillips scores, for frequency of signs of 
schizophrenic thought disorder, using a scale 
developed by Cancro (1962). The correlation 
between abstraction and frequency of signs 
of thought disorder was —.38, p< 005. The 
greater the number of intrusions, the lower 
the abstract score. This finding supports the 
suggestion that proverb measures of abstrac- 
tion are weighted by the relative “cleanness” 
of the response. It is strongly recommended 
that new methods of measuring the abstract 
attitude that are not so vulnerable to educa- 
tional and psychopathological differences be 
developed. 
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ERRATA 


In the article “Size Constancy in Retarded Versus Normal Children: A Developmental 
pothesis” by Bruce J. Richman, Harold N. Kellner, and Dean A. Allen (Journal of Co ing 
and Clinical Psychology, 1968, 32, No. 5, pp. 579-582), a statement appearing below Table 1 is 
erroneous. Reanalysis has indicated that none of the between-groups comparisons of the means 
presented in that table are significant at p < .05. The results reported, therefore, must be 
garded as inconclusive and subject to caution in interpretation. : 


In the article “Hypnotist and Manner of Presentation Effects on a Standardized Hypnotic 
Susceptibility Test” by D. Eugene Thorne and Ernst G. Beier (Journal of Consulting and Clinica 
Psychology, 1968, 32, No. 5, pp. 610-612), there is a misspelling of J. V. Cabibi’s name on 
611 and again in the reference list. The reference should read: Cabibi, J. V., Hughes, H. H., 


Butler, J. R. Behavioral effects on the hypnotic process by a change in hypnotist. Journal 
Clinical Psychology, 1966, 21, 334. 


In the article “Relationship of Personality Characteristics to Persistence in Psychotherapy” 
by Ralph E. Van Atta (Journal of Consulting and Clinical Psychology, 1968, 32, No. 6, pp. 731- 
733), on page 731, column 2, line 23, delete the phrase “and by Sullivan, Miller, and Sme 
(1958) . . . .” After line 26 add the following: “Sullivan, Miller, and Smelser (1958) confirm: 


the association of educational and occupational level with both length of stay and improvem' 
in therapy.” 


Journal of Consulting and Clinical Psychology 
1969, Vol. 33, No. 3, 271-274 


CONSERVATISM AS A PREDICTOR OF HUMOR PREFERENCES 


GLENN D. WILSON ' 
Institute of Psychiatry, London 


AND 


JOHN R. PATTERSON 
University of Canterbury, New Zealand 


Of a sample of 139 high school students, humor preference profiles were com- 
pared for 30 Ss scoring highest on the Conservatism Scale, and 30 Ss scoring 
lowest. As predicted, high-conservative Ss tended to prefer “safe,” formal types 
of humor (e.g, puns), while low conservatives (liberals) expressed greater 
appreciation of transparently “libidinal” types of humor (sick and sexual). 
Results were interpreted as indicating the importance of individual differences 
in the extent to which it is necessary for appetitive content to be disguised 
by the structural aspects of the joke medium before humorous affect is evoked. 


Freud (1905) emphasized the distinction 
between tendentious humor, which has the 
function of giving expression to sexual and 
aggressive impulses (broadly defined), and 
innocent or nontendentious humor, which de- 
rives its effect solely from the techniques and 
conventions of the joke medium itself. In 
psychoanalytic theory, tendentious humor is 
successful only so long as the “joke facade” 
is effective in disguising the underlying libidi- 
nal intent from the consciousness of the 
audience, otherwise it evokes discomfort or 
disgust instead of humorous pleasure. 

Experimental studies (Gollob & Levine, 
1967; Singer, Gollob, & Levine, 1967) appear 
to support the psychoanalytic notion that an 
important function of humor is that of pro- 
viding a socially acceptable setting for the 
partial expression of normally prohibited im- 
pulses, and that intensification of the inhibi- 
tions relating to these impulses may interfere 
with the appreciation of humor. These have 
so far dealt only with situationally induced 
inhibitions relating to aggression, although the 
investigators do contemplate the possibility 
of “personality variables which determine 
differential susceptibility to this type of inter- 
ference [Singer et al., 1967, p. 568].” 

The Conservatism Scale (C Scale; Wilson 
& Patterson, 1968) seems promising in this 
respect since it may be regarded as measuring 
the extent to which “parental” attitudes and 
prohibitions have been internalized. High 
scorers on the C Scale may be assumed, on 
construct grounds at least, to be intolerant, 
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resistant to change, authoritarian, dogmatic, 
and antihedonistic in outlook. The present 
experiment tests the hypothesis that persons 
scoring high on conservatism, having chroni- 
cally mobilized inhibitions, will be less appre- 
ciative than persons scoring low on conserva- 
tism of humor involving the transparent 
expression of sex and aggression, but not of 
humor which is either purely formal or in 
which libidinal content is comparatively 
veiled. In other words, we are postulating a 
difference between liberals and conservatives 
in the proportion of the nontendentious com- 
ponent (amount of facade) which is required 
before a joke is capable of evoking humorous 
affect. 


METHOD 


The six categories of humor selected for study are 
listed, with definition and examples, in Table 1. It is 
not claimed that this classification of humor is in 
any sense comprehensive, merely that it provides a 
good sample of humor types differing along the 
“formal-libidinal” continuum. 

As the next step in the construction of a humor 
preference test, a search was made of magazines for 
cartoons which appeared to correspond fairly un- 
ambiguously to the six categories. The category defi- 
nitions were then given to four graduate students 
who were asked to sort the cartoon pool in terms 
of them, and those cartoons which were reliably 
sorted into the same category were retained. Finally, 
five cartoons corresponding to each category were 
selected to be included in the test booklet. 

The Ss were 139 high school students (75 female 
and 64 male), aged 15-19. High school students are 
well suited to a study of this kind because they 
comprise a fairly homogeneous group with respect 
to age, marital status, education, and occupation. 
The cartoon booklet was given to Ss with instruc- 
tions to rate each cartoon according to the scale: 
(1) Not at all funny, (2) Slightly funny, (3) 
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„ Example 


A carrier staggering under the weight of a grandfather 
clock is asked: “Have you got the time on you?” 

A butcher with an animal-plan erected on a music stand 
is applying his hacksaw to a side of meat in the man- 
ner of a double-bass player. 

As they pass by a hardware store, a beatnik is shielding 
his girl-friend’s eyes from the sight of a bath and 
washbasin. 

A truck-driver waiting at a stop-light is casually stub- 
bing out his cigarette on the helmet of a motorcycle | 
cop also waiting beneath him. 

A little girl is swinging joyfully on the legs of a man who 
has just hanged himself. 

A playboy attired in silk dressing gown is welcoming an 


TABLE 1 
CATEGORIES OF Humor EMPLOYED 
Category Definition 
Puns Play on the sound and meaning of 
words 
Incongruity Familiar elements appearing out of 
usual context 
Antiradical Ridicule of radical or Bohemian ele- 
ments in society 
Antiauthority | Ridicule of conventionally respected 
persons and institutions 
Sick Delight in the morbid, gruesome, and 
$ sadistic 
Sexual Participation in, or anticipation of, 
sexual pleasure 


attractive blonde. “Come in, Miss Faversham. Looks 
like you’re the only one to come to the party.” 


Moderately funny, (4) Very funny, (5) Extremely 
funny. Personality information was gathered by 
subsequent administration of the C Scale, and the 
EPI (Eysenck & Eysenck, 1964). 


RESULTS AND DISCUSSION 


Puns and incongruity jokes are fairly exclu- 
sively formal, cognitive, and respectable in 


content, while the sick and sexual categories 
constitute the outright expression and indul- 
gence of aggressive and sexual impulses, re- 
spectively. The antiradical and antiauthority 
categories can both be described as forms of 
social satire involving the ridicule of certain 
classes of persons and institutions, a type of 
indirect, interpersonal aggression. 
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Fic. 1. Comparison of humor Preference profiles for the 30 most conserva- 


tive and 30 most liberal Ss. 


(The six categories of humor are ordered along a 


dimension of “tendentiousness” on an a Priori construct basis.) 


PREDICTOR OF HuMOR PREFERENCES 
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TABLE 2 


PRODUCT-MOMENT CORRELATIONS BETWEEN CONSERVATISM SCORES AND “FUNNINESS” 
Ratincs FOR EACH OF THE Sıx Humor CATEGORIES 


Ss Puns Incongruity | Antiradical | Antiauthority Sick Sexual 
Males hog 14 —.29* —.08 —.10 —.28* 
Females .26* .08 —.20 —.03 =15 — .23* 
Total group wid 10 —.22** —.10 —.18* —.24** 


Note.—For Males, N = 64; for Females, N = 75; Total N = 139. 


Figure 1 shows a comparison of humor 
preference profiles for the 30 Ss scoring 
highest on the C scale (conservatives) and 
the 30 Ss scoring lowest (liberals). The gen- 
eral pattern of the profiles is very much as 
predicted, with conservatives tending to prefer 
the very “safe” or nontendentious types of 
humor and liberals expressing a greater liking 
for the more openly “libidinal” or tendentious 
types of humor. Using a ¢ test, these differ- 
ences are significant beyond the .01 level ex- 
cept for the incongruity and antiauthority 
categories, which are not significant. Correla- 
tions between conservatism scores and humor 
ratings on each of the six cartoon categories 
are given for males and females separately, 
and for the total group, in Table 2. 

It might be argued that a self-report tech- 
nique for measuring humor preferences is 
susceptible to the influence of social desir- 
ability, and that such a factor may have 
mediated the relationship found between con- 
servatism and “funniness” ratings. The evi- 
dence which is available, however, does not 
support this argument. The EPI “lie scale” is 
essentially a measure of social desirability or 
“faking good,” and these scores bear little 
relationship to the humor scores, the only 
significant correlation being that between the 
lie score and the pun category (r= .18, 
p < .05). In any case, there is suggestive evi- 
dence that persons scoring high on the C Scale 
tend to give more socially desirable responses 
(r= .20, p< .05), consistent with their 
general tendency to “play it safe” (Patterson 
& Wilson, in press). 

Of the other EPI personality variables, 
extraversion was significantly related only to 
sexual jokes (r= .31, p< .01), while neu- 


roticism did not relate to any of the humor 
scales. 

Although there were certain complex differ- 
ences between males and females in the humor 
ratings, these were independent of the effects 
of conservatism since neither age nor sex was 
significantly related to conservatism. 

The one deviation from the expected order- 
ing of differences between liberals and con- 
servatives, the partial reversal on the anti- 
radical and antiauthority categories, is inter- 
esting. Satire directed at radical targets (e.g. 
beatniks, artists, protest demonstrations) was 
rated higher by liberals than conservatives, 
though it is the latter who are supposedly 
antagonistic toward such institutions. On the 
other hand, there was no significant difference 
between the two groups in the expressed en- 
joyment of humor based on the ridicule of 
authoritarian institutions (e.g., Church, law) 
which ostensibly gain respect from conserva- 
tive elements. Psychoanalytically, this finding 
might be interpreted as suggesting that con- 
servatives, although superficially respectful of 
authority, are actually harboring a certain 
amount of resentment at some “unconscious” 
level, which is expressed and revealed in their 
unexpectedly high appreciation of antiauthor- 
ity humor. This expression of antiauthority 
feelings through humor preference is presum- 
ably only possible because the joke mecha- 
nism provides an effective distraction from 
the aggressive and “antisocial” content of the 
cartoons. 

Taken generally, the results of this experi- 
ment suggest that there are considerable indi- 
vidual differences in the extent to which it is 
necessary for sexual and aggressive content 
of a joke to be disguised before humorous 
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affect is evoked. In fact, with very liberal 
persons there may be little or no necessity for 
any such veiling or mitigation of “impulse 
expression.” The tendency for high-conserva- 
tive Ss to rate the purely formal humor types 
(particularly puns) higher than liberals 
might then have arisen through a process of 
compensation for low ratings on the more 
threatening categories, based on a need to 
exhibit some “sense of humor.” 
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COLLEGE STUDENTS’ MEMORIES OF THEIR PARENTS: 
A FACTOR ANALYSIS OF THE CRPBI* 


HERBERT J. CROSS? 


University of Connecticut 


The revised 18-scale form of the Children’s Reports of Parental Behavior Inven- 
tory was administered to 119 college females and 99 males, Four factor analyses 
—females’ reports on fathers, females’ reports on mothers, males’ reports on 
fathers, and males’ reports on mothers—were performed. Three factors, corre- 


sponding to Schafer’s three dimensions 


of Acceptance vs. Rejection, Psychologi- 


cal Autonomy vs. Psychological Control, and Firm Control vs. Lax Control, 
emerged. Factor structures were similar in all analyses except males’ reports 
of mothers. These differences are discussed in terms of the relative importance 
of dimensions used by students to evaluate their parents’ behavior, 


Schaefer (1965a, 1965b) has done extensive 
work on conceptualizing and measuring chil- 
dren’s reports of parental behavior. He has 
modified his two-dimensional circumplex 
model of maternal behavior (Schaefer, 1959) 
to now (Schaefer, 1965b) include three dimen- 
sions: Acceptance versus Rejection, Psycho- 
logical Autonomy versus Psychological Con- 
trol, and Firm Control versus Lax Control. 
This model of parent behavior is roughly 
similar to those independently developed by 
Roe (1957), Slater (1962), and Becker 
(1964) and seems to summarize a great deal 
of thinking and data about parents’ behavior 
toward their children. 


Much of the data on parental behavior 


has been gathered from children’s reports 
(Schaefer, 1965b; Slater, 1962). The parent- 
child relationships and the influence of the 
parent can in this way be studied through 
the offspring’s description, which enormously 
simplifies data collection. Several studies 
(e.g, Lyle & Levitt, 1955; Serot & Teevan, 
1961) have pointed up the usefulness of 
the offspring-perception technique and one 
(Schaefer & Bayley, 1967) has directly af- 
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firmed the validity of certain Children’s 
Reports of Parental Behavior Inventory 
(CRPBI) scales. 

The present study is an attempt to extend 
Schaefer’s findings (1965b) to another norm 
group and to explore the usefulness of the 
revised 18-scale form of the CRPBI. 

In order to support Schaefer’s model, these 
data should factor in a way that can be 
reasonably explained by the three dimen- 
sions of Acceptance-Rejection, Psychological 
Autonomy-Psychological Control, and Firm 
Control-Lax Control. 


METHOD 
Subjects 


The Ss were 119 females and 99 males, mostly 
freshmen and sophomores who were volunteers (for 
credit) from general psychology classes at the Uni- 
versity of Connecticut. 


CRPBI 


This form of the CRPBI consists of 18 scales, 6 
of which are based on 16 items and 12 of which are 
based on 8 items. Items consist of behaviors, for 
example, “Isn’t very patient with me,” “Says I’m a 
big problem,” “Tells me I’m good looking,” which 
S indicates are like, somewhat like, or not like the 
parent being described. These responses are scored 
3, 2, and 1, respectively, and summed to yield scores 
on individual scales. 


Procedure 


The Ss were tested in classroom-size groups. Sepa- 
rate, but virtually identical, mother and father forms 
were administered in a single session, in counter- 
balanced order. The Ss were asked to describe their 
parents’ behavior as they remembered it when they 
were 16 years of age. 
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Analysis 


Four 18 X 18 (scales) product-moment correlation 
matrices were formed; female Ss’ scores on the 
mother form, females on the father form, males on 
the mother form, and males on the father form. Each 
matrix was separately factor analyzed by the prin- 
cipal components method, after which an orthogonal 
Varimax rotation was performed, using communality 
estimates from the squared multiple-correlation co- 
efficients in the diagonals. The program (Dixon, 
1965) allows one option and that is the value which 
limits the number of eigenvalues of correlation co- 
efficients used in the analysis. With the optional 
value set at 1.00, three factors resulted in three of 
the analyses; in the fourth analysis a fourth factor 
was extracted. However, its eigenvalue was only 
1.006, it accounted for only 6% of the variance, and 
it appeared to be a specific factor, having a substan- 
tial loading only on the “Inconsistent Discipline” 
scale. Therefore, a three-factor solution was com- 
puted. The first three factors are essentially the same 
in both cases. 


RESULTS 


Table 1 presents the rotated factor load- 
ings for both factor analyses done on 
females’ reports of their mothers and fathers, 
respectively. 

With regard to the analysis of reports of 


TABLE 1 


Roratep MATRICES ror FEMALES’ REPORTS OF 
MATERNAL AND PATERNAL BEHAVIOR 


HERBERT J. Cross 


mothers’ behavior, the three factors extract 
74% of the total matrix variance. Factor I 
which accounts for 50% of the common 
variance, loads high on Acceptance, Child 
Centeredness, Positive Involvement, and A¢. 
ceptance of Individuation. This factor shows 
high negative loadings on Rejection, Hostile 
Detachment, Hostile Control, Tnstilling Per- 
sistent Anxiety, and Withdrawal of Relations 
Factor I is clearly a love versus hostility fac 
tor and it is nearly identical to Schaefer’s Ac 
ceptance versus Rejection dimension (1965b, 
p. 554). Factor II accounts for only 26% 
of the common variance and shows high posi 
tive loadings on Control and Enforcement 
with high negative loadings on Non-Enforce 
ment, Lax Discipline, and Extreme Autonomy 
and is quite similar to Schaefer’s third dimen 
sion, Firm Control versus Lax Control. Factor 
III accounts for 23% of the common variance 
and shows highest loadings on Possessivenes 
Intrusiveness, Control through Guilt, Hos 
Control, and Instilling Persistent Anxiety, aní 
is similar to Schaefer’s second dimensio 
Psychological Autonomy versus Psychological 
Control. 


Mother factors Father factors 
Scale = 
I II I h? I II II 
Acceptance 93 | —16 00 | 90 89 | —03 | —28 
Child Centeredness 82 | —18 29 | 79 89 | —06 | —06 
Possessiveness 30 | —07 82 76 64 04 53 
Rejection —87 19 ZES —71 13 59 
Control —21 69 45 73 —18 79 37 
Enforcement —45 70 27 76 —33 72 36 
pontive Involvement 8&5 | —11 15 | 76 85 01 06 
ntrusiveness 07 34 70 61 29 36 61 
Control through Guilt —51 06 70 | 74 —16 04 85 
Hostile Control —69 32 55 | 88 —43 38 71 
Inconsistent Discipline —41 | —42 40 | 50 —24 | —54 48 
Non-Enforcement 04 | —85 00 | 73 —08 | —89 | —05 
Acceptance of Individuation 88 | —27 | —06 | 85 79 | —26 | —36 
Lax Discipline k 18 | —87 TST 83. 25 | —79 04 
Instilling Persistent Anxiety —64 21 57 | 79 —41 28 71 
Hostile Detachment —88 07 09 | 79 —78 | —02 49 
Withdrawal of Relations —63 03 40 56 —39 02 64 
Extreme Autonomy 19 | —63 | -23 | 50 —07 | —65 | -12 
Percentage of common variance 50 26 23 40 28 31 


Note.—Decimals omitted. 
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TABLE 2 
ROTATED MATRICES FOR Mates’ Reports OF 
MATERNAL AND PATERNAL BEHAVIOR 
| Mother factors Father factors 
Scale 

I Ir m h I I m k 
Acceptance 03 89 | —02 80 90 | —09 | —07 83 
Child Centeredness 29 84 05 79 86 12 14 77 
Possessiveness 73 26 | —01 60 28 05 78 69 
Rejection 43 | —68 | —06 | 66 —72 03 43 70 
Control 60 12 | —60 | 73 —08 75 40 74 
Enforcement 47 | —08 | —64 64 —19 76 23 68 
Positive Involvement 15 85 01 75 88 00 12 79 
Intrusiveness 70 09 | —21 54 01 26 69 54 
Control through Guilt 80 | —22 05 70 —23 04 78 66 
Hostile Control 83 | —35 | —19 | 84 —49 26 68 78 
Inconsistent Discipline 27 | —28 61 52 —44 | —50 46 65 
Non-Enforcement —08 | —02 88 78 —07 | —85 03 73 
Acceptance of Individuation —25 79 20 72 87 | —17 | —15 82 
Lax Discipline 07 16 85 75 15, | —87 11 80 
Instilling Persistent Anxiety 81 | —12 | —06 67 —43 20 74 17 
Hostile Detachment 31 | —81 10 76 —85 04 30 83 
Withdrawal of Relations 49 | —45 23 50 —60 06 45 58 
Extreme Autonomy —25 17 77 68 07 61 27 46 
Percentage of common variance 36 37 28 43 27 29 


Note.—Decimals omitted. 


With regard to the factor analysis of 
females’ reports of father’s behavior, each 
factor is remarkably similar to those just 
described. Each scale loads at approximately 
the same magnitude except for minor dif- 
ferences which seem to indicate that, while 
Factor III is substantially the same as in the 
first analysis, the factor axis is positioned at 
a slightly different location. A configurational 
analysis of the scales in three-dimensional 
space (cf. Schaefer, 1965b) shows essentially 
the same pattern as found in the first analysis. 

Thus, an overall comparison between the 
two factor analyses indicates that they are 
quite similar in loading patterns and amounts 
of common variance accounted for by each 
factor. Further, in both analyses, the factor 
patterns are clearly analogous to Schaefer’s 
proposed dimensions, with one minor differ- 
ence. Schaefer’s Psychological Autonomy 
versus Psychological Control factor is stronger 
than his Lax Control versus Firm Control 
factor while the findings here are reversed. 

The factor structure of the males’ reports 
of father data is strikingly similar to both 
the analyses presented in Table 1. The magni- 


tude and direction of each loading is approxi- 
mately the same as in the analyses of the data 
from females. Again the overall results clearly 
fit Schaefer’s model. 

The factor structure of males’ reports of 
mothers’ behavior data is clearly different 
from the other three analyses. The first factor 
accounts for slightly less common variance 
than the first factor in the other analyses 
(36% as compared to 437%, 40%, and 50%) 
and shows high loadings on Possessiveness, 
Intrusiveness, Control by Guilt, Hostile Con- 
trol, and Instilling Persistent Anxiety. Fac- 
tor I in this case is not an Acceptance versus 
Rejection factor but rather clearly emerges as 
a Psychological Autonomy versus Psychologi- 
cal Control factor. Factor II, which accounts 
for 26% of the common variance, factors out 
as the Acceptance versus Rejection factor and, 
in fact, the loading pattern is strikingly simi- 
lar to that of Factor I in each of the other 
three analyses. Factor III, which accounts for 
22% of the common variance, shows high 
negative loadings on Control and Enforce- 
ment, and high positive loadings on Incon- 
sistent Discipline, Non-Enforcement, Lax Dis- 
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cipline, and Extreme Autonomy. Thus, it 
emerges as the Lax Control versus Firm Con- 
trol factor and is similar to Factor II in the 
other three analyses. 


Discussion 


Three of the four factor analyses presented 
here (females’ reports of mothers and fathers, 
and males’ reports of fathers) are highly simi- 
lar. In each case Factor I, the Acceptance 
versus Rejection dimension, accounts for con- 
siderably more common variance than either 
Factor II or Factor III. It is, therefore, the 
most relevant dimension for females reporting 
on their mothers, females reporting on their 
fathers, and males reporting on their fathers. 
The dimensions of Psychological Autonomy- 
Psychological Control and Firm Control-Lax 
Control are clearly secondary to Acceptance- 
Rejection. 

Mothers of males, in strong contrast, are 
evaluated primarily on the Psychological 
Autonomy versus Psychological Control Di- 
mension, and secondarily on the Acceptance 
versus Rejection dimension. These two factors 
are nearly equal in amount of common vari- 
ance they account for, however. 

An obvious interpretation of the discrep- 
ancy between males’ reports of their mothers 
and other reports is that the mother-son rela- 
tionship is more psychologically intense, that 
is, mothers are more neurotically involved 
with sons than fathers are, or than fathers or 
mothers are with their daughters. 

Even so, probably the most important over- 
all dimension is Acceptance versus Rejection 
since it accounts for substantial amounts of 
variance in each analysis. 

Some kind of dimensionalizing is probably 
useful for investigating the effects of parental 
behavior on personality, and Schaefer’s three 


dimensions seem to be a step in the right 
direction. 


HERBERT J. Cross 


This study leads to the following condu 
sions; (a) The 18-scale form of the CRP3] 
is a useful instrument for assessing yom 
adults’ memories of their parents? behavior 
(b) These reports can be conceptualized ef. 
fectively with Schaefer’s three dimensi 
Acceptance versus Rejection being the my 
relevant of the three. (c) Males’ reports o 
their mothers are based more on the dimension 
of Psychological Autonomy versus Contro 
than are males’ reports of fathers, or female 
reports of either mothers or fathers. l 
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DIFFERENTIAL THERAPEUTIC RESPONDING OF A 
AND B QUASI-THERAPISTS TO SCHIZOID AND 
NEUROTIC COMMUNICATIONS* 


JURIS I. BERZINS? ano EDWARD SEIDMAN 
University of Kentucky 


This experiment investigated the proposition that the differential effectiveness 
of “A” and “B” therapists with schizoid and neurotic patients is based on 
behavioral differences which emerge even with untrained Ss in the appropriate 
interpersonal contexts. In a 3X2 factorial design, 72 male undergraduates 


(As, ABs or middles, and Bs) responded to tape-recorded schizoid and neurotic 


communications by writing self-chosen 


“helpful” responses. Analyses of these 


responses confirmed the predicted Therapist X Patient interaction effects. When 
As were paired with the schizoid and Bs with the neurotic patient (relative to 
the opposite pairings), they gave longer responses (p < .005), emitted more 


responses of a declarative 


rather than questioning form (p < .025), and 


showed more positive (p < .025) and fewer negative (p < .05) Bales social- 
emotional reactions. The Ss intermediate on the A-B scale (ABs) did not react 


differentially to the two patient types. 


Following clinical studies which demon- 
strated that so-called “A” therapists achieved 
greater success in psychotherapy with schizo- 
phrenic patients than “B” therapists (see the 
reviews by Betz, 1967; Carson, 1967) while 
Bs obtained better outcomes with neurotic pa- 
tients than As (McNair, Callahan, & Lorr, 
1962), several therapy-analogue studies have 
been reported which yielded results consonant 
with those of clinical studies, even though the 
“therapists” were totally untrained under- 
graduate males (Berzins & Seidman, 1968; 
Carson, Harden, & Shows, 1964; Sandler, 
1965). The separation of therapists into A 
and B “types” has usually been accomplished 
by selecting extreme groups of responders to 
a small number of items taken from the 
Strong Vocational Interest Blank, the A-B 
scale, whose content ostensibly entails varying 
degrees of interest in manual-technical activi- 
ties (Bs endorse these interests; As do not). 


1This research was partially supported by a Uni- 
versity of Kentucky summer research fellowship to 
the first author, and a National Institute of Mental 
Health predoctoral research fellowship (1-F1-MH- 
36960-01) to the second author. The authors grate- 
fully acknowledge the use of the University of Ken- 
tucky Computer Center, and also express appreciation 
to John V. Haley for statistical consultation. 

2 Requests for reprints and copies of the A-B scale 
as well as of all stimulus materials should be sent 
to Juris I. Berzins, Department of Psychology, Uni- 
versity of Kentucky, Lexington, Kentucky 40506. 


While the relationship of such interests to 
performance in dyadic interactions of a 
“helpful” nature is only beginning to be 
understood, the finding that trained and un- 
trained “therapists” perform similarly with 
the schizoid and neurotic classes of patients 
suggests that the A-B variable may have 
some “basic” bearing upon personality theory 
and measurement. 

The present experiment was designed to 
investigate some features of the “natural” 
treatmentlike behaviors and reactions of 
college students varying in A-B status when 
they are exposed to communications repre- 
senting two classes of patients, schizoid and 
neurotic. A portion of the present study, 
which has been reported previously (Berzins 
& Seidman, 1968), showed that, when As had 
responded “helpfully” (in writing) to a 
schizoid patient, and when Bs had responded 
to a neurotic patient, they reported greater 
satisfaction with their responses and found it 
easier to choose helpful responses than when 
As and Bs had responded under the opposite 
conditions of pairing. These differences in 
subjective reactions, which were in line with 
expectations based on clinical studies, left 
unanswered the question: On what behavioral 
differences are these interaction effects based? 

This analogue experiment thus attempts to 
pinpoint, under controlled circumstances, dif- 
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ferences in the “helpful” behaviors of un- 
trained As and Bs. At the same time, the 
freedom given to Ss (i.e., to emit responses 
they felt would be most helpful at each point 
that the taped communications were inter- 
rupted) makes the analytic task formidable. 
Whatever technique of analysis the investi- 
gator adopts, however, should yield results 
consistent with the prediction that instead 
of main effects for therapist or patient “type,” 
Therapist xX Patient interaction effects should 
be obtained. In other words, it is expected 
that “better” responding takes place when As 
are paired with the schizoid patient, and Bs 
with the neurotic patient, than under the 
opposite Therapist X Patient pairings. 


MeEtTHop 
Design and Procedure 


The essential features of this study have been 
reported previously (Berzins & Seidman, 1968). 
Briefly, the basic design was a 3 2X2 factorial, 
using three independent groups of “quasi-therapists” 
(As, ABs or middles, and Bs), two patient types 
(AVOS, or avoidance of others, and TAS, or turning 
against the self) as representative of schizoid versus 
neurotic psychopathology (Phillips & Rabinovitch, 
1958), and two orders of presentation (TAS-AVOS 
and AVOS-TAS). This report focuses upon the 
initial combinations of patient and “therapist”; 
second-position data are explored only in a limited 
fashion. Analyses were conducted on the resulting 
3X2 factorial arrangement with cell frequencies 
randomly balanced at 12 (total N = 72). 

The data were collected from undergraduate males 
attending their usual laboratory sections in intro- 
ductory psychology at the University of Kentucky; 
Es were their usual laboratory instructors (N=8). 
After listening to each patient communication seg- 
ment, Ss were given 1 minute to give a “helpful” 
response (in writing) as though speaking directly to 
the patient. The Ss were classified into As, ABs 
(middles), and Bs on the basis of a trichotomization 
of the distribution of scores on a 15-item version of 
the A-B scale obtained in a laboratory meeting 1 
week later. 


Patient-Type Stimulus Materials 


The tape-recorded, enacted patient communications 
and their validation have been Presented in the pre- 
vious report.. Here, the first (of five) patient- 


. communication segments are Presented for illustrative 
purposes: 


3 Again, the authors thank James E. Dublin and 
Robert D. Welch for their skillful enactment of the 
TAS and AVOS scripts. 
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TAS patient, Segment 1: The thing I keep think 
ing over and over . . . I should get going but ] 
can’t . .. it’s just useless for me to try anythi 
. . . I haven’t been sleeping at all for a week.. 
well, you know, I get some sleep but not enough to 
keep me going, not what I should . . . even with 
Sally lately I’ve been acting more like a jerk, a 
phony ...and the more I think about it th 


more I get all tensed up... how long can | 
keep this up . . . I mean it just keeps going o 
and on... 


AVOS patient, Segment 1: Well I know thi 
sounds a little crazy ... and I know Im no 
crazy but. . . it seems like things happen to m 
in a different way or . . . more than other peopl 

+ + or they just don’t understand . . . I guess] 
mean wouldn't understand because I can’t tell then 
what it is... like... the other day I Wa 
shaving and I looked in the mirror and the though 

+. Sort of came to me like it was from th 
mirror, that the person in the mirror wasn't m 
at all ... was somebody who shaves every a 
like everybody ... and it was a funny feelin 
like I was him... 


The TAS communications revolved around nel 
rotic-depressive complaints, including suicidal ides 
tion, while the AVOS tape, on the whole, showed! 
mixture of schizoid and paranoid features. . 


Dependent Measures and Analyses 


It had been initially hoped that the self-chosti 
“helpful” responses of these Ss might be analyzabl 
through the application of scales developed by vat 
ous investigators for rating the content of therapi 
interactions (e.g, Rogers, Gendlin, Kiesler, & Truas 
1967; Strupp, 1957). It became apparent, howeve 
that such scales were inappropriate for the analysi 
of the written responses of untrained Ss. Accordingly 
it was decided to follow a strategy which wa 
content-free to varying degrees, in the hope that | 
would be the largely structural and formal characte 
istics of the “helpful” communications which woul 
clarify the behavioral differences, if any, among th 
therapist-types. Data were analyzed according to! 
scheme which arranged the analyses in increasil 
order of reliance upon the content of resp 
and concomitantly required increasing degrees 
inference from judges. The dependent measures welt 

1. Length of response—a simple word count wa 
obtained for each response.* f 

2. Form of response—responses were dichotomous) 
coded as “declarative formulations” or “others 
(largely questions). A declarative formulation W% 
grammatically defined as any statement which í 
or could “naturally” end in a period or exclamation 
mark, and which in whole or in part had the a 
tient as its direct referent (explicitly, e.g, using 


* Special thanks are extended to Wynona J. Moore 
Lois M. Gridley, and Peggy A. Mull for their pail 
taking transcription of the data on individual in 
cards. 
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second-person pronoun, or implicitly). This dichot- 
omy of course did not discriminate between positively 
and negatively toned communications, but coding, 
performed without knowledge of Ss’ A-B status, in- 
volved minimal inference and reliability was perfect. 

3, Interaction characteristics of response 5—the 
Bales Interaction Process Analysis (Bales, 1950) was 
employed according to standard instructions but with 
one modification—only one of the 12 Bales catego- 
ries was assigned to an S’s response during one re- 
sponse period. This procedure yielded five category 
scores per S. The Bales system is exhaustive and 
one-half of the categories permits classification on 
the basis of the affective characteristics of response. 
Scoring was performed by the writers, working 
independently and without knowledge of Ss’ A-B 
status, While only one score was assigned per re- 
sponse, note was kept of additional (overlapping) 
categories. Initial agreement between scorers was 
82%, Responses on which disagreement had occurred 
were resolved by consensus and one final score 
assigned to each response. Subsequently, the 12- 
category scores were collapsed into the four major 
classes of Positive (social-emotional) Reactions, At- 
tempted Answers, Questions, and Negative (social- 
emotional) Reactions. 

4. Content scales—three 5-point semantic-differen- 
tiallike scales were used. The scales were defined by 
three adjectives at the endpoints and comprised 
ratings of Relevance (relevant, accurate, to the point- 
irrelevant, inaccurate, beside the point), Accept- 
ance (accepting, warm, friendly-rejecting, cold, un- 
friendly), and Assertiveness (assertive, active, strong- 
tentative, passive, weak). These ratings were inde- 
pendently performed by four advanced graduate 
students in clinical psychology ê who were given 
transcripts of the patient communications preceding 
Ss’ responses, To control for possible biases associ- 
ated with judges’ own A-B status, two judges used 
were As and two were Bs. The reliabilities of the 
content ratings, calculated according to an analysis 
of variance technique (Winer, 1962, p. 124), were 
satisfactory for Relevance (rk = .70) and Acceptance 
(re=.79), while agreement on Assertiveness was 
more modest (rk =.54). The mean of the judges’ 
ratings was used as the score for each S. 


RESULTS 
A-B Status and Length of Response 


Three by two analyses of variance were 
performed on the number of words emitted 
in the “helpful” responses at each of the 
five points of interruption (hereafter termed 
trials), as well as on overall means. In the 


5 The authors are indebted to Jesse D. Geller and 
Gordon F. N. Fearn for recommending the Bales 
analyses for these data. 

6The authors wish to thank Anthony Burry, 
Michael L. Campbell, Andrew Sappington III, and 
David J. Sheskin for serving as judges in this study. 
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latter analysis, no main effects for therapist 
or patient type were found, but a highly 
significant Therapist X Patient interaction 
effect was observed (F = 6.29, df = 2/66, 
p < 005). The mean number of words per 
trial, presented in Table 1, reveal that “com- 
patible” pairings (As with AVOS and Bs with 
TAS) elicit considerably longer responses than 
incompatible pairings (As with TAS and Bs 
with AVOS). For each patient, curves plotting 
mean words across trials showed no overlap 
between As and Bs, and the interaction F 
ratios were significant for four of the five 
trials. ABs (middles) were indeed intermedi- 
ate in length of response and, more impor- 
tantly, showed no differential responsiveness 
to the patient types. 

It appears that, despite the simplicity of 
the dependent measure, the results attest to 
genuine behavioral differences in As and Bs 
as a function of the patient with whom they 
are paired. These differences were maximal 
on the very first trial, that is, after less than 
1 minute of listening to the patient (inter- 
action F = 8.55, df = 2/66, p < 001). The 
magnitude of the differences is noteworthy 
also: On the first trial with the AVOS patient, 
for example, As responded with twice as 
many words as did Bs (As: M = 21.25; Bs: 
M = 10.00). 


A-B Status and Form of Response 


For this analysis, all “helpful” responses 
were dichotomized into declarative formula- 
tions and “others.” An analysis of variance 
for the number of declarative formulations 
emitted in five trials showed no main effects 
but, as in the preceding analysis, the Thera- 
pist X Patient interaction effect was signifi- 


TABLE 1 


Mean Number or Worps EMITTED 
PER TRIAL 


Patient type 


Therapist type 
TAS AVOS 
A 17.18 21.90 
AB 20.02 20.10 
B 23.73 15.45 
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ficant (F = 4.81, df = 2/66, p < .025). The 
cell means are presented in Table 2. 

Trial-by-trial curves for each patient type 
again showed no overlap between As and Bs, 
although ABs tended to match (or occasion- 
ally exceed) the number of declarative formu- 
lations given by the “compatibly” paired 
groups. ABs thus again did not react differen- 
tially to the two patients. 

The grammatical form of the “helpful” 
response is apparently another potent struc- 
tural index of performance when As and Bs 
are exposed to the two patient types. Maxi- 
mum effects were again discerned on the first 
trial, where the TAS patient elicited declara- 
tive formulations from only 3 of 12 As but 
from 10 of 12 Bs. For the AVOS patient, 
the respective numbers for As and Bs were 
eight and two. A 3 X 2 contingency table for 
the first trial gave a chi-square value of 7.32 
(df = 2, p < .05). Had one predicted declara- 
tive formulations to be emitted in this differ- 
ential manner, 77% of As and Bs would have 
been correctly classified on the first trial; 
66% of these Ss would still be correctly 
placed across the five trials. 


A-B Status and Interactive Characteristics of 
Response 


Separate 3 X 2 analyses of variance were 
performed on the mean number of responses 
placing into each of the four major Bales 
categories in five trials. The cell means, shown 
in Table 3, indicate that regardless of thera- 
pist or patient type, relatively more responses 
placed within the Answers and Questions 
categories than in the Positive or Negative 
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TABLE 2 


Mean NUMBER OF DECLARATIVE FORMULATIONS 
Em{ıtteD IN Five TRIALS 


Patient type 


Therapist type 
TAS AVOS 
A 2.33 3.50 
AB 3.83 3.75 
B 4.08 2.08 


(emotional-social) Reaction categories, sug- 
gesting that the experimental situation elicited 
predominantly task-oriented, emotionally neu- 
tral responses from Ss. 

When responses coded as Positive or 
Negative Reactions occurred, however, sig- 
nificant Therapist X Patient interaction ef- 
fects emerged (Positive Reactions: F = 4.51, 
df = 2/66, p<.025; Negative Reactions: 
F = 3.30, df = 2/66, p < .05). These effects, 
which indicate that the judged emotional 
quality of “helpful” responses was more posi- 
tive and less negative under “compatible” 
pairing conditions, significantly extend the 
findings relative to length and form of re- 
sponse. Since the increased emission of posi- 
tively toned responses under compatible pair- 
ing conditions appears to have face-valid 
implications for the “effectiveness” of re- 
sponses (or at least for rapport-enhancing 
reactions to patients), the Bales categories 
add important information to the prior struc- 
tural indexes of differential performance. 

It can be seen that ABs here place between 
As and Bs in the “emotional” categories and 


TABLE 3 
Mean Number or Responses Copen INTO Each BALES CATEGORY IN Five TRIALS 


Bales categories 


Therapist 
type Positive Reactions Answers Questions Negative Reactions 
TAS AVOS TAS AVOS TAS AVOS TAS AVOS 
A 0.50 1.33 1.42 1.33 2.08 1.75 
i : ; $ 1.00 0.58 
AB 0,92 1.00 2.75 1.92 1.00 1.00 0.33 1,08 
B 1.50 0.50 2.25 1.33 1.00 1.67 0.25 1,50 


Note,—For each therapist-by-patient co 
and only one Bales category. z gre 


the row means sum to 5,0, because each response was coded into one 


to react similarly to the patients. 
, ABs show a borderline (p < .10) 
cy to emit more Answers, regardless of 
t type, than the other groups. Mild 
t-type main effects were observed in 
instances: the AVOS patient tended to 
more Negative Reactions but fewer 
rs than the TAS patient (both effects, 
10). 


B Status and Content of Response 


he three content scales selected, relevance, 
eptance, and assertiveness, were applied to 
‘responses given on the first trial only, 
it was here that maximal differences 
en As and Bs were obtained in the 
ses for length and form of response. It 
s hoped that these ratings would clarify 
content dimensions involved in the inter- 
ction effects. 
Table 4 presents the breakdown of the 
nt scale means by therapist and patient 
e Analyses of variance showed no main 
teraction effects on the acceptance scale; 
action effects on the other two scales, 
er, were higly significant (assertiveness: 
9.24, df= 2/66, p< .001; relevance: 
= 6.20, df = 2/66, p< 005). Thus, when 
responded to the AVOS and Bs responded 
| the TAS patient, they were rated as more 
"assertive but less relevant than As and Bs 
paired oppositely. ABs comprised the most 
tive and least relevant group of the 
ee, in apparent agreement with their 
reliance on Answers reported above. 
fain effects for therapist type on assertive- 
S, F = 3.57, df = 2/66, p< 05; on rele- 
, F = 3.44, df = 2/66, p < .05.) 


TABLE 4 


Content scales 


Relevance Acceptance Assertiveness 


Tas | Avos| TAS | AVOS| TAS AVOS 


“A | 3.25 | 248 | 3.10 | 3.29 | 3.02 | 3.58 
TAB | 233 | 242 | 308 | 3.48 3,56 | 3.60 
5 314 | 271 | 3.10 | 3.63 | 2.40 


2.48 


D "Notes = relevant, accepting, assertive. 
' 
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Correlational analyses, in which these con- 
tent ratings were correlated with five-trial 
means on the other dependent variables used 
in this study, clarified the “meaning” of the 
two content scales on which significant inter- 
action effects were observed. Ratings of asser- 
tiveness and relevance appeared most closely 
related to the dichotomous coding for declara- 
tive formulations, such that the emission of 
declarative formulations was regarded as as- 
sertive but not particularly relevant, while 
(Bales) questions were seen by judges as rele- 
vant but not particularly assertive. The third 
content scale, Acceptance, related positively to 
Bales Positive Reactions and negatively to 
Bales Negative Reactions, as one would expect 
if there were convergence between indepen- 
dent judges’ ratings of the “accepting” quali- 
ties of these responses. Apart from the accept- 
ance ratings, then, the ratings of first-trial con- 
tent appear strongly related to the “structural” 
dependent variables used in the preceding 
analyses. Ratings of assertiveness, in particu- 
lar, related to length of response (r= 35), 
declarative formulations (r= .53), Bales 
Positive Reactions (r = .32), Bales Answers 
(r = 30), Bales Questions (7 = —.40), and 
Bales Negative Reactions (r = —.11). All co- 
efficients, excepting the last, were significant 
at p < 01. 


Relationships among Measures 


To clarify the relationships among those 
dependent measures which were applied to all 
five trials, that is, response length, declarative 
formulations, and the Bales categories, an 
intercorrelation matrix (Table 5) was com- 
puted. The matrix also includes these Ss’ 
subjective reactions (Berzins & Seidman, 
1968), to see whether any structural variables 
analyzed in this study might have acted as 
“determinants” of postexperimental subjective 
reactions (satisfaction with responses and ease 
of selecting helpful responses). Inspection 
reveals that Ss’ subjective reactions are but 
modestly predictable from the dependent 
measures used here. Response length is the 
only variable significantly associated with 
both subjective reactions. 

Response length is also significantly corre- 
lated with every other variable in the matrix, 
suggesting that a general “productivity” or 
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TABLE 5 
INTERCORRELATIONS AMONG DEPENDENT MEASURES AND SUBJECTIVE REACTIONS 

Variable 2 | 3 | 4 | 5 | 6 7 | 8 
1. Response length Ai hs 27* 39** me —34* 27* sa 
2. Declarative formulations 47** 12 Te —21 p 2r 
3. Bales Positive Reactions =12 mA i ee p3 ni 
4. Bales Answers —60* mats, H e 
5. Bales Questions = hc, a 
6. Bales Negative Reactions w 
7. Satisfaction with responses 
8. Ease of responding helpfully* 


.—All coefficients are Pearson r's ; decimals have been omitted. N = 72. 
AUE Taa item, ‘which was scored in the ‘‘difficult” direction, have been reflected. 
*p <05. 


> < 0l. 


“responsivity” variable may hold the key to 
the observed relationships. Substantially posi- 
tive relationships exist between the “positive” 
indexes of performance (response length, de- 
clarative formulations, Bales Positive Reac- 
tions) but their magnitude hardly suggests 
equivalence. The dichotomization of responses 
into declarative formulations versus others, 
however, appears to be almost equivalent to 
the Bales distinction between Answers and 
Questions. It is also apparent that Questions 
and Negative Reactions are uniformly nega- 
tively related to other dependent measures. 


Further Results 


Since, in the original design, each S in this 
experiment later responded to the “other” 
patient (subsequent to the responses investi- 
gated thus far), the question of the effect of 
the initial pairings upon later responses may 
be raised. Briefly, analyses conducted for 
response length and for declarative formula- 
tions indicated that the differential behaviors 
characterizing Ss’ performance with the first 
patient continued unchanged—even though 
the “other” patient now comprised the stimu- 
lus. Trial-by-trial curves continued to show 
no overlap between As and Bs. While such 
consistencies in response are intriguing, they 
must be regarded with caution in the absence 
of information regarding more precise deter- 
minants of the behavioral differences which 
emerged initially, for example, on the first 
trial with the first patient. 


Discussion 


In interpreting the results of this study, 
it is well to note at the outset that the cate- 
gories of analysis were not selected before- 
hand, but, in the absence of appropriate 
precedents, resulted from a concern with dis- 
covering the “natural lines of cleavage,” if 
any, in the “helpful” responses of these 
untrained Ss. 

The major findings of this study were that 
when As responded “helpfully” to the schizoid 
patient, and Bs responded to the neurotic 
patient—relative to the opposite pairings— 
they emitted responses characterized by 
greater length, by declarative rather than 
interrogatory form, and by positive (social- 
emotional) overtones, It has already been 
established that these Ss, subsequent to per- 
forming in this manner, felt relatively sat- 
isfied with their performance and reported 
greater ease in selecting helpful responses 
(Berzins & Seidman, 1968). Persons inter- 
mediate in A-B status (ABs) did not react 
differentially to the two patients. The differ- 
ences in the therapeutic responses of As and 
Bs, which were marked after as little as 
1 minute of exposure to the “patients,” 
Moreover tended to persist across trials; at 
no point was overlap between As and Bs 
observed. 

For these untrained Ss, the somewhat cot 
related measures of length and form of re- 
sponse appeared to account for the major 
variations in emitted “therapeutic” responses: 
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A relatively parsimonious integration of the 
results can be evolved around an “informa- 
tion-processing” model. That is, when As and 
Bs were compatibly paired with patients (i.e., 
As with AVOS and Bs with TAS), they be- 
haved as though they had assimilated and 
processed enough information to be able to 
produce a declarative, positively toned re- 
sponse of appreciable length (which Ss felt 
to be satisfying and helpful). When As and 
Bs were incompatibly paired (As with TAS 
and Bs with AVOS), they behaved as though 
they had not obtained enough information to 
be able to formulate a helpful response; they, 
in effect, “requested more information” from 
the patient (engaged in shorter, negatively 
toned inquiries which they felt to be less 
satisfying and less helpful). Subsequent pa- 
tient communications, which in fact provided 
a great deal of relevant “information,” failed 
to modify the responses of compatibly or 
incompatibly paired Ss; declarers continued 
to declare and questioners to question. The 
initial pairing conditions thus appear to deter- 
mine how much information Ss are able to 
“process” in the interest of formulating 
helpful responses. 

What features of the patient communica- 
tions can be held responsible for the observed 
differences in response? The tape-recorded 
communications were designed to expose Ss 
to the TAS and AVOS symptom clusters in 
a naturalistic manner; there was no attempt 
to equate the tapes for dramatic content or 
“pull” at any one point. In scrutinizing the 
first segments for decisive contrasts, one clear 
difference emerges in the dysphoric affect of 
the TAS patient (which is not present in 
the blander, faster speaking AVOS patient) 
and the cognitive disorganization (disjointed 
speech, vagueness) of the AVOS patient. 
Perhaps the disjointed utterances of the 
AVOS patient are particularly troublesome to 
Bs in light of their reportedly greater need 
for cognitive clarity and field-independent 
Perceptual-cognitive style (Pollack & Kiev, 
1963: Shows & Carson, 1965) or perhaps 
Bs themselves would tend to rely upon 
AVOS-like adjustive techniques under stress 
(Berzins, Friedman, & Seidman, 1969; 
Sandler, 1965), As in turn may be dis- 
rupted by the TAS patient’s dysphoria and 
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intropunitiveness, since several studies sug- 
gest that As themselves may display depres- 
sive, intropunitive behaviors under stress 
(Berzins et al., 1969; Sandler, 1965). This 
line of reasoning suggests that the emission 
of helpful responses is impeded by Ss’ “blind 
spots” if these happen to resemble those of 
the patient. If this notion has merit, then 
the present results would argue for the efficacy 
of pairing therapists with patients on the 
basis of dissimilar reactions to stress, which 
may be partially indexed by a complementary 
(not similar) A-B status. In any case, one 
important heuristic implication of the present 
research is that since marked effects were 
observed on the very first trial, a systematic 
search for the precise cues involved in trigger- 
ing the differential “helpful” behaviors can 
be undertaken. 

In considering the generality of the effects 
observed in this study, considerable caution 
is in order, however. While it may seem that 
the structural indexes of differential perform- 
ance (especially length of response) coupled 
with Ss’ own subjective reactions to the ex- 
perience, can be interpreted as personality- 
based antecedents to differential performance 
in psychotherapy, it must be stressed that, 
unlike in actual therapy, Ss in this study re- 
ceived no feedback from the patients which 
in any way could modify their subsequent 
performance. The absence of trial-to-trial 
overlap between As and Bs (which continued 
unchanged when the “other” patient was 
introduced) must probably be interpreted as 
stereotyped responding in untrained Ss. 
The effects of first-trial pairings were sub- 
stantial, but whether such effects apply to the 
performances of practicing therapists is a 
matter for further research. It would be de- 
sirable to obtain evidence on the treatment 
performances of AB therapists to facilitate 
comparisons with As and Bs, since ABs did 
not consistently occupy an intermediate po- 
sition on the dependent measures used in this 
study. In this way, important information 
regarding the possible utilization of mental 
health volunteers and the feasibility of 
therapist-patient matching could be obtained. 

On the whole, the results of the present 
experiment suggest that the A-B variable is 
intrinsically related to variations in “helpful” 
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performance in male dyads. While the reasons 
for the predictive power of the A-B variable 
are only beginning to be understood, the 
differential responsiveness of As and Bs to 
varying forms of disordered behavior provides 
an important variance-reducing ingredient for 
the design of psychotherapy studies. 
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EXPERIMENTAL APPROACH 
SOCIAL DESIRABILITY IN 


TO DEFINING THE ROLE OF 
PERSONALITY ASSESSMENT: 


IS THERE ONE RESPONSE PROCESS OR TWO?* 


STEPHEN LEFF? anp NANCY LAMB 


Washington University 


This study was designed to 
response processes when they endorse 
their personal ideals, 


determine whether Ss engage in one or two different 


items as descriptive of their selves and 


and when they rate the items’ characteristic social desir- 


ability. The experiment used three sets of 2 X 2 factorial designs, one for each 


rating category (self, personal ideal, 


social approval and the instructional 
analyses of variance showed 
responding when they 


are asked to describe 
and when they are asked to describe the socially 


and social ideal). The Ss’ need for 
set were the variables controlled. The 


that Ss do engage in two very different kinds of 


their selves and their personal ideals, 
desirable. The first two rating 


categories were differentially influenced by the independent variables. In con- 
trast, the finding that neither of these variables was instrumental in determining 


the nature of the response in the 


third category lends support to the hypothesis 


that items do have a characteristic social desirability which is derived from 


their perceived cultural value. 


Edwards (1957), Kenny (1956), and 
others have demonstrated that in structured 
psychometric situations, the probability that 
an individual will endorse a particular item 
as descriptive of his self or his ideal can be 
predicted accurately by the location of that 
item on a continuum derived from group 
judgments of item social desirability. Edwards 
(1957) has hypothesized that this phenomena 
reflects the tendency of Ss to implicitly judge 
and respond to inventory items in terms of 
their characteristic cultural value even when 
overtly engaged in self-descriptive or ideal- 
descriptive tasks. 

Messick (1960), 
(1961), and Scott (1963), 
have hypothesized that these 
are determined by multiple 
which mirror not merely the characteristic 
cultural value of items, but also the need of 
Ss to conform or to respond in a socially 


Crowne and Stephens 
on the other hand, 
high correlations 
response sets 


1 This paper was presented as part of a symposium 
at the meeting of the Southwestern Psychological 

sociation, Houston, April 1967. The authors wish 
A express their appreciation to Malcolm D. Gynther 
or his critical comments in the preparation of this 
Manuscript. 
pe Now at Harvard Medical School. Requests for 
ha S should be sent to Stephen Leff, Massachu- 
Hct Mental Health Center, 74 Fenwood Road, 
oston, Massachusetts 02115. 


approved fashion, and other situational and 
interpersonal variables. As evidence for this 
hypothesis Messick (1960) cites his findings 
that the social desirable dimension is com- 
posed of nine identifiable factors, only one of 
which appeared to reflect the social desirabil- 
ity variable as Edwards defined it. 

The hypotheses of Messick and the others 
who share his theoretical position are based 
on the assumption that Ss engage in at least 
two very different processes when they en- 
dorse items as descriptive and when they 
rate the characteristic social desirability of 
the items, processes that may be related but 
which are by no means identical. In contrast, 
Edwards’ hypothesis is based on the assump- 
tion that when Ss endorse inventory items 
as descriptive of their selves or ideals they 
are engaging in essentially the same process 
that they follow when they make judgments 
about the social desirability of those items. 

This study attempted to further delineate 
the positions of Edwards and Messick with 
the possibility of furnishing support for either 
of them. The Ss with differing personality 
characteristics were asked to describe their 
selves, their personal ideals, and the socially 
desirable, while at the same time their under- 
standing of the Es’ hypothesis about the ex- 
periment was manipulated. It was reasoned 
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that if differing response processes were, in 
fact, involved in the separate tasks, the re- 
sponses in each descriptive category would 
be differentially affected by the personality 
and/or situational variables manipulated. 
Conversely, it was predicted that if a com- 
mon response process underlay the modes of 
responding, the responses in each category 
would not be differentiated from each other 
with respect to the effects of the independent 
variables. 


METHOD 
Subjects 


Sixty white male undergraduates ranging in age 
from 18-20 were drawn from a sample of 71 original 
volunteers. (The explanation of this reduction in 
sample size is presented in procedural considerations.) 


Instruments 


The instruments used included the Crowne- 
Marlowe test of the need for social approval (Crowne 
& Marlowe, 1964) and the Interpersonal Check List 
for descriptions of personality traits (Leary, 1957). 


Procedure 


In order to ascertain their attitudes toward the 
instructions which they were to be given, Ss were 
asked whether they thought good adjustment was 
characterized by having high ideals and living up 
to them (idealistic attitude) or by having moderate 
ideals and recognizing the areas in which one fails 
to live up to them (realistic attitude). The Es 
originally contemplated having equal numbers of 
“idealistic” and “realistic” Ss; however, the original 
S population was such that the “realistic” attitude 
was chosen by 69 of the original 71 Ss. Therefore, 
it was not possible to contrast what might be con- 
sidered personal set or orientation, and the two Ss 
who chose the “idealistic” alternative were dropped 
from the study, The sample size was then reduced 
from 69 to 60 by randomly eliminating Ss from each 
of the experimental groups with more than 15 Ss. 

Each S completed the Crowne-Marlowe test and 
was instructed as to the Es’ hypothesis: one-half of 
the Ss were informed that good adjustment was 
hypothesized to be characterized by having high 
ideals and living up to them, and the other half 
was told that good adjustment was hypothesized 
to be a function of having moderate ideals and of 
recognizing the ways in which one failed to live 
up to them. 

Following these instructions, the Interpersonal 
Check List was administered in booklet form and 
Ss were asked to complete the booklet for the 
following categories: For the first category (self) 
Ss were asked to describe the kind of person they 
were at that time; for the second (personal ideal) 
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they were asked to describe the kind of person they 
would like to be; for the third (social ideal) Ss were 
asked to describe the person they thought society 
found most desirable. 

For the purpose of data analysis this experiment 
used three 2 X 2 factorial designs, one for each of 
the descriptions Ss were asked to make (self, per- 
sonal ideal, social ideal). Instructional set (either 
“idealistic” or “realistic”) and Ss’ need for social 
approval (either “high” or “low” as determined by 
dividing the Crowne-Marlowe scores at the median) 
were the independent variables. ~ 

Item clusters (Leary’s octant scores) rather tha 
single items were used as dependent variables in 
this study to increase reliability and to facilitate 
the analysis of the processes under investiga- 
tion, Dominance-submission (Dom) and love-hostility 
(Lov) summary scores were also computed in the 
manner described by Leary, transformed into stan- 
dard scores, and converted to single summary points 
on Leary’s two-dimensional grid. Difference scores 
among the three summary points of the descriptive 
categories (self, personal ideal, social ideal) were 
then calculated for each S. These difference scores, 
along with the summary scores, were analyzed in 
a supplementary 2 X 2 analysis of variance. 


RESULTS 


Tables 1 and 2 summarize the analyses of 
variance of S’s self, personal ideal, and social 
ideal descriptions as a function of need for 
social approval and instructional set. These 
tables reflect the fact that Ss high in the need 
for social approval checked significantly more 
adjectives as descriptive of themselves in 
octants measuring affiliation and responsibil- 
ity (7 and 8) and obtained higher Lov scores 
than did those low in that need. These high 
need for social approval Ss similarly described 
their personal ideals by checking more adjec- 
tives than low Ss in Octants 7 and 8, and 
in Octant 6 (Respectful). Table 2 indicates 
that they also achieved higher Lov scores for 
the personal ideal category. 

A further inspection of Tables 1 and 2 
reveals that although the instructional set had 
a limited influence on Ss’ self-descriptions, 
there was no marked or’ specific pattern. In- 
stead, it effected primarily the way Ss de 
scribed their personal ideals. On this category 
the instructions influenced those octants mea- 
suring power or lack of power (1 and 5), 
independent strivings (2), and rebelliousness 
(4). In all cases Ss who had been told that 
good adjustment is a function of high ideals 
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TABLE 2 


SUMMARY OF THE ANALYSES OF VARIANCE OF SUBJECT’S 
SELF, PERSONAL IpEaL, AND SoctaL Inga DOM 
AND LOV Scores As A FUNCTION or NEED 
FOR APPROVAL AND INSTRUCTIONAL SET 


DOM LOV 
Source 
af) MS F df | MS F 
Self-description 
Approval (A) 1 | 244.02 | 2.82*| 1| 340.82 | 7.19%* 
Instruction (B) | 1 | 138,02 | 1.59 1 8.82 | 0.19 
AXB 1 2.81 | 0.03 1 | 268.81 | 5.68** 
Error 56| 86.61 56| 47.35 
Personal ideal 
Approval (A) 1 6.02 | 0.74 1 | 209.07 | 6.81** 
Instruction (B) | 1 0.15 | 0.02 1 4.27 | 0.14 
AXB 1 18.15 | 2.22 1 32,26 | 1.05 
Error 56 8.18 56 30.72 
Social ideal 
Approval (A) 1 0.60 | 0.05 1| 86.40 | 1.99 
Instruction (B) | 1 6.67 | 0.53 1| 26.67 | 0.61 
AXB 1 0.00 | 0,00 1 1,07 | 0.02 
Error 56| 12.48 56| 43.42 
* p <.10, 
** p <.025. 
k p < 001. 


checked significantly more of the items in 
these categories than did those who had been 
told that the moderate ideals reflected good 
adjustment. 

Finally, as Table 1 indicates, the category 
of social ideal manifested no definitive trends 
as a function of either the need for social 
approval or instructional set. In no instance 
did either of the independent variables yield 
significant differences in more than one 
octant. Thus, although significant differences 
at the .10 level were obtained as a function 
of the need for approval in Octant 4 and 
as a function of instructional set in Octant OF 
as single statistics they do not appear to be 
parts of any meaningful pattern. This inter- 
pretation is supported by the fact that, as 
Table 2 indicates, the independent variables 
had no significant effects on either the Dom 
or Lov summary scores, 

A supplementary analysis of variance of 
the difference scores among Leary’s summary 
points indicated trends approaching signifi- 
cance ($ < .10) for Ss high in the need for 
social approval to perceive their self-ratings 
as being closer to their personal and their 
social ideals than did Ss low in the need for 
social approval. The Ss who had received 
idealistic instructions described their personal 
ideals as being significantly more different 
from their social ideals (p < 05) than did 
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Ss who received the realistically oriented 
instructions. 


CONCLUSIONS AND IMPLICATIONS 


The results of this study support Messick 
(1960) and those others who hypothesize that 
when Ss judge the social desirability of in- 
ventory items (social ideal) they engage in 
a very different kind of response process than 
they do when asked to describe their selves 
or their personal ideals. When Ss gave self- 
descriptions, the need for social approval 
(which is a discriminable personality charac- 
teristic) affected their scores, but instructional 
set had little specific influence. When Ss were 
asked to describe their personal ideals they 
responded partly on the basis of this person- 
ality variable and partly on the basis of the 
instructional set they were given. When asked 
to describe the social ideal, however, neither 
of these factors, instructional set or need for 
approval, was instrumental in determining the 
nature of the responses. 

It is also true, however, that this study 
also lends support to Edwards’ (1957) hy- 
pothesis that items do have a characteristic 
social desirability which is derived from their 
perceived cultural value and which is not 
significantly affected by some situational and 
personality variables. In spite of differing 
needs for social approval and differing instruc- 
tional sets, Ss describing their social ideal 
agreed for almost all octants on the degree 
to which each was socially desirable. 

The finding that Ss’ personal ideals were 
most susceptible to the influence of situational 
variables than either self-ratings or social ideal 
ratings suggests that instructions to rate one’s 
ideal also elicit a distinctive response set. 
One suggestive explanation of this fact might 
be that when Ss describe their personal ideals 
they are, by the nature of the task, neither 
bound by the actuality of their own behavior 
nor by their perceptions of the cultural norms, 
and in the absence of these guidelines they 
become more susceptible to the influence of £. 
One implication of this finding would appeat 
to be that the influence of an Æ on the re- 
sponses of his Ss is mediated by the nature 
of the dependent variables he chooses t0 
include in his study. 


Rote oF SOCIAL DESIRABILITY 


The results of the difference scores analyses 
are consistent with the previous studies of 
Kenny (1956), which have shown that Ss 
high in the need for social approval saw their 
actual behavior as less discrepant from either 
their personal ideals or the socially desirable 
than did those low in this need. A finding 
not so predictable, which would seem to 
warrant further research, was that Ss who 
were informed that good adjustment is a 
function of having high ideals and living up 
to them tended to describe their personal 
ideals and social ideals as being more dis- 
crepant than those who received the other set 
of instructions. Perhaps these Ss took their 
instructions as stressing an “individualistic” 
orientation and counterindicating social con- 
formity. 
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ATTITUDES TOWARD DEATH: 
A PSYCHOLOGICAL PERSPECTIVE * 


HERMAN FEIFEL ? 


Veterans Administration Outpatient Clinic, Los Angeles 


As a consequence of psychology’s enhanced interest in the existential richness 
of human behavior, research in the area of death is becoming a productive 
enterprise. The influencing aspect of consciousness of death is active over the 
entire age continuum and is not restricted to the sick, aged, suicidal, or combat 
soldier. The meaning of death is multidimensional and varies not only between 
individuals but within the same person. Additionally, dying and death possess 


S ~ ir, 


psychological and social features and are not merely biological events. Our 
overall socially repressive orientation toward death promotes neurotic concerns 
about it. More intimate familiarity with death is required in our developmental 


upbringing and culture. 


Two major developments influenced psy- 
chology’s halting approach in probing atti- 
tudes toward death as a relevant variable in 
human conduct. One was psychology’s natal 
need to raise its flag independently of phi- 
losophy and ethics. Scientific respectability 
meant occupying oneself with measurable 
stimuli and responses which were repeatable 
and public. A consequence was restraint in 
scrutinizing consciousness as a datum and 
neglect of personality. The other was a waning 
of faith in the providential and sacred accom- 
panied by a lack of transcending significance 
for death, This secular turn was further asso- 
ciated with a decline in the kinship group and 
increasing social fragmentation of the family, 
a growing impersonality emerging from a 
technologically dominated society, and deritu- 
alization of grief and mourning practices—all 
fusing to exacerbate our hostility toward and 
repudiation of death. 

The events of World War II, however, and 
the ensuing press of urgent social problems 
forced psychology to look beyond its tradi- 
tional positivism and enter into the bailiwick 
of moral issues and life choices. The view 


1A version of this paper was presented at the 
University of Minnesota symposium “Death, Grief 
and Bereavement: An Interdisciplinary Evaluation,” 
Minneapolis, May 18, 1967. Preparation of the paper 
was facilitated by National Institute of Mental 
Health Grant M-2920 and the Veterans Administra- 
tion Medical Research Program. 

2 Requests for reprints should be sent to Herman 
Feifel, Veterans Administration, 1031 S, Broadway, 
Los Angeles, California 90015. 
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that a vital psychology must be rooted in 
man, not in a mathematical physics model, 
became more insistent. At the same time, 4 
legacy of the same war thrust all of us int) 
a baleful situation in which the world annihi 
lation fantasies of the schizophrenic patient! 
no longer appeared unrealistic. Physical sci 
ence has presently made it possible for us al 
to share a common epitaph. Not only the indi: 
viduality of death but posterity and social 
immortality are now menaced. 

In this junction of psychology’s waxing 
interest in the existential richness of humai 
life, and sharpened awareness of life’s tempor 
ality and recognition that all truth is germani 
to the exigencies of survival, research in the 
area of death is emerging as an authentic and 
fertile enterprise. 

Consideration of death is undoubtedly ont 
of the foremost sources of anxiety for man: 
Indeed, a major task of both religion an 
philosophy has been to grapple with its inte 
sity and complexity. Psychology, in coming 
to grips with the meaning of death for tht 
individual, asserts its responsibility to ree™ 
amine its philosophic and humanistic heritag? 
in the context of its generic ideal of science 

Investigation of attitudes toward death ani 
bereavement makes available a supplementaty 
entryway in understanding adaptational strat 
egies used in coping with pain, crisis, 2 
stress. It further affords the possibility ° 
improving treatment of the seriously sick a” 
others facing the prospect of oncoming death: 
Additionally, it extends our comprehension 0 
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how death affects the social fabric of society. 
Overriding these, however, may be our due 
recognition of the importance of the future 
in steering present behavior. It is man’s ex- 
celling capacity to conceptualize a future— 
and inevitable death—which distinguishes him 
from other species and makes myopic imita- 
tive application of a physical sciences method- 
ology to him. Anticipation guides a majority 
of our actions and expectation serves as a 
principal mediator of goal-directed and pur- 
poseful behavior. As human beings, the pres- 
ent contains not only the psychological past 
but psychological future as well. Death is an 
absent presence even before its actual arrival. 
Only with consciousness of mortality does 
concern with human fulfillment, salvation, and 
the notion of the uniqueness and individuality 
of each one of us receive complete meaning. 

The piloting force of awareness of death is 
active at all age levels. Its domain is not 
confined to the sick, aged, suicidal, or combat 
soldier. In this regard, it may be pertinent 
to emphasize that we not shut out children 
from the realities of death. The efforts of 
investigators as Anthony (1940), Maurer 
(1966), Nagy (1948), and Wahl (1958) dis- 
close that we do not protect the child by 
attempting to shield him from death. Rather, 
we hinder his emotional growth. In actuality, 
this proscription displays more of the adult’s 
or parent’s own anxieties and strong emotions 
concerning death than the child’s genuine ca- 
pacity to manage knowledge of the existence 
of death. Naturally, level of explanation, 
timing, and individual differences have to be 
appreciated. 

Until recently, the psychoanalytic con- 
Ceptualization of attitudes toward and fear of 
death as derivative events was dominant. 
Sentiments about death were essentially mani- 
festations of a more ultimate reality, that is, 
Separation anxiety or conflicts about castra- 
tion. Doubtless, such clinical displacement 
does occur. Death fears can be secondary 
Phenomena. Nevertheless, incoming data in- 
Cteasingly suggest that the reverse may be 
More to the point. Apprehensiveness over 
ely annihilation and concerns about fini- 
ude themselves assume dissembling guises. 

depressed mood, fears of loss, sundry 
Psychosomatic symptoms, and varying psy- 


293 


chological disturbances all have evidenced 
affinity to anxieties concerning death. Gillespie 
(1963), the English psychiatrist, for example, 
goes so far as to declare that fear of death 
is the most traumatic factor in fashioning 
senile psychosis. Feifel (1959) and Searles 
(1961) have both advanced the hypothesis 
that one of the intentions of schizophrenic 
denial of reality, in certain persons, may 
function as a magical holding back, if not 
undoing, of possible death. If living leads 
inevitably to death, the way not to die is 
not to live. We need to incorporate thoughts 
and emotions about death into our personality 
studies and therapeutic sights. Too frequently 
do we number their impetus by engaging 
ourselves with the symbolic as divorced from 
the real aspects of death. 

We are currently more knowing about the 
multidimensional meanings which death holds 
for people. Death can be the “gentle night” 
or “great destroyer.” It can reflect surcease 
from pain and tribulation, reunion with one’s 
family, loss of control, punishment, or loneli- 
ness. Developmental changes, cultural condi- 
tioning, religious orientation, personality 
characteristics, and level of threat, among 
others, contribute their shaping power. 

There is also the implication that attitudes 
toward death can oscillate in the same indi- 
vidual, from strong avoidance through anxious 
hope and uneasy resignation to calm accept- 
ance. The point in time one centers on can 
be crucial. In this context, Feifel (1965) has 
indicated the coexistence of contradictory 
attitudes toward death, for example, realistic 
acceptance of death and its simultaneous 
rejection in a subtle equilibristic balance in 
many seriously ill and terminally ill patients. 
He has suggested that this counterpoise seems 
to serve adaptational needs of the patient, 
allowing him to maintain communal associa- 
tions and yet organize his resources to contend 
with oncoming extinction. 

Additionally, we are discovering that treat- 
ment of the seriously ill and dying may be 
too conditioned to considerations and wants 
of the healthy rather than to the needs of the 
sick. Research findings underscore the desire 
of most patients for honest and sharing talk 
from physicans and family about the serious- 
ness of their illness (Feifel, 1959). This is in 
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strong contrast to the prominent orientation 
of physician, nurse, and family to avoid such 
confronting discussion (Feifel, 1965; Glaser 
& Strauss, 1965). The threat to professional 
narcissism and existing social scruples about 
death are obviously contributory. Reap- 
praisal of what and how we instruct our 
medical, nursing, and seminary students con- 
cerning dying, death, grief, and bereavement 
seems in order, 

Truth and what sustains one’s sense of 
reality tend to be more supporting and bene- 
ficial to the patient than deception and denial. 
We must resist disinheriting the patient psy- 
chologically and socially at the very time we 
enhance attention to his physiological require- 
ments, Approaching death is better integrated 
when the patient feels understood and can 
voice his feelings and thoughts about dying 
and death. Clinical experience demonstrates 
that when this happens, less depression, de- 
viant behavior, and blame of others show 
themselves in patients. Further opening chan- 
nels of communication tends to attenuate 
feelings of inadequacy and guilt not only in 
the patient but in the professional people 
involved in treatment and family as well. 

Some terminally ill patients, when apprised 
of their prognosis, manifest an adjustment 
equal to or better than that seen in similar 
patients who have not been so informed 
(Zinker, 1966; Zinker & Fink, 1966). Reali- 
zation of possible death appears to give per- 
mission, even freedom, to some to be them- 
selves rather than extensions or mirrors of 
other people’s values. One can learn, appar- 
ently, not only to live with enormous stress 
but to grow and change under its goad. 
Indeed, history furnishes us with numerous 
examples where recognition of impending 
death has fostered creativity rather than 
paralysis, 

Magnified cognition exists that death repre- 
sents a fact of social as well as psychological 
import. The funeral ceremony, although per- 
sonal in design, is certainly societal in conse- 
quence. A person’s death affects the emo- 
tional life of the bereaved but it also entails 
changes in the status and social position of 
the survivors, 

What are some deductions from these per- 
ceptions? Man is a creature in time and space 
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whose consciousness permits him to nullify 
their strictures. This brings into his ken the 
awareness of inexorable death as a compelling 
stimulus. As a time-ridden being, man is faced 
with the task of identifying himself with 
eternity. His most viable response will issue 
from basic philosophical, religious, or psycho- 
logical deliberations about death already jn/ 
his possession. The agony of selfhood is not 
endurable for most of us in pondering death 
without resources, be they transcendental, 
inspirational, or existential. We pay expen: 
sively for the taboo we affix to the subject 
of death. Closer psychological familiarity with 
death is called for in our developmental up 
bringing and culture. Our socially repressive 
outlook encourages neurotic anxieties about 
death. The evolution of an ars moriendi priot 
to the advent of death is the charge. 

Cutting across all the populations the a 
thor has studied (Feifel, 1959, 1965) is the 
view of death as the extreme abomination it 
man’s experience. Despite invocation of denial 
and a host of other strategems, fear of fink 
tude is never quite hushed. Withstanding all 
jeremiads and alienation, in the face of death 
the vast majority in all the author’s groups 
that is, terminally, seriously, chronically, ant 
mentally ill, as well as normal subjects, assess 
the world as good and not to be ceded if 
possible. 

Man has a legitimate need to face away 
from death. In truth, who is to say that 
under certain conditions this may not be 
salutary? Unfortunately, excessive camouflage 
and expulsion of the notion of death prevail 
in the United States leading to a falsification 
of the essence of man. If we accept death 
as a necessity rather than strive to demote it 
to the level of accident, there may be less 
need to project fear of death outside our- 
selves. This might possibly mute some of the 
violence of our times. Energies now bound up 
in continuing attempts to shelve and repress 
the concept of death would be available to 
us for more constructive aspects of living- 
perhaps even fortify man’s gift for creative 
splendor against his genius for destruction. 
Forster (1962, p. 394) has commented pene- 
tratingly that “death destroys a man but the 
idea of death saves him.” 

Finally, what of psychology’s more specific 
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assignment in the area of death? Psychothera- 
peutic functioning and models of personality 
and psychopathology need amplified represen- 
tation of the future and death in their hori- 
zons. Findings suggest that one’s philosophy 
of life and death lies at the nexus of meaning, 
value, and personality. 

The thrust of empirical attack requires 
extension. Theoretical expressions must be 
more rooted in the reasonably precise bases 
of systematic observation and experimenta- 
tion. It may be that psychology is particu- 
larly favored to be enlightening here since 
some feel that it is a sector where humanistic 
and physicist-engineer cultures, too often 
insulated and at cross-purposes with one an- 
other, intersect. Clearly, psychology’s own 
imperative for interdisciplinary perspective 
should not be minimized. 

Methodology will have to capture, in more 
astute form, the manifold meanings of death 
not only between individuals but within the 
same person. We must tighten our grasp of 
the bonds existing among verbally expressed 
attitudes toward death, fantasy notions, and 
below-the-level-of-awareness ideas. As in other 
ateas, we have the bedeviling challenge of 
making constructs, such as fear of death, 
operationally definable yet integrating the 
cogency of such explanatory concepts as 
purpose, anxiety, redemption. 

The pivotal undertaking for psychology has 
been alluded to by Murphy (1959, p. 320): 
“Instead of looking at the grim skeleton and 
crossbones and being told this is reality, let 
us study the observer himself, regard him as 
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the reality and ask why skulls and crossbones 
are the chief realities for im.” 

Only by encompassing the concept of death 
into his life will man fully understand him- 
self—and only the culture which countenances 
death will truly savor life. 
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SENSING HUMOR: 


LATENCY AND AMPLITUDE OF RESPONSE 
RELATED TO MMPI PROFILES* 


DOUGLAS S. HOLMES 2 


Stockton State Hospital 


Forty-eight male Ss responded to cartoons by (a) sorting one set into four 
equal piles ranging from Most Funny to Least Funny, and (b) reacting im- 
mediately to each cartoon of another set by manipulating a knob. The Ss were 
divided, according to their MMPI profiles, into PPd (highest score on the 
Psychopathic Deviate scale), and NPPd (highest score on some other scale). 
Consistent with expectations, PPd Ss preferred sexual (and hostile) humor 
more than NPPd Ss, preferred sexual and hostile humor more than nonsense 
humor, and responded to all cartoons more quickly than NPPd Ss. The im- 
mediate reaction measures of latency and amplitude appear to hold promise 


for future research on humor. 


Lack of systematic knowledge concerning 
humor, rather than lack of relevance, prob- 
ably accounts for the infrequency with which 
humor is discussed in contemporary psycho- 
logical literature. Only once in the 18 years 
of publication of the Annual Review of Psy- 
chology has humor been mentioned. On that 
occasion, Jenness (1962) commented: “Sense 
of humor, long respected as a personality 
variable, has proved elusive as an object of 
research [p. 506].” Perhaps its elusiveness in 
the laboratory is a function of our devices of 
detection rather than a function of the esoteric 
nature of humor, Allport (1954) implied as 
much: “There is... one... trait that, 
up to now, has defied successful psycho- 
logical study, viz. sense of humor... its 
accurate measurement is beyond the present 
competence of psychology [p. 437].” 

Throughout the years there have been re- 
markably few variations in the techniques of 
measuring response to humor. In general, the 
statement of Flugel (1954) remains valid 
today: “Experimentation has . . . taken the 


1 This study was supported in part by a grant 
from the California State Department of Mental 
Hygiene. The author wishes to express his apprecia- 
tion to Dianna Minnehan for her assistance in data 
collection and analysis. 

2 Requests for reprints should be sent to Douglas 
S. Holmes, who is now at Smith Richardson Founda- 
tion, P. O. Box 3265, Greensboro, North Carolina 
27402. 


form of the presentation of humorous stimuli 

. the subject being asked to rank the 
items in the order of funniness, or to give 
them marks in accordance with a predeter- 
mined scale [p. 709].” 

It seems clear that more sensitive measures 
of response to humor would be useful to those 
studying humor. Accordingly, the primary 
purpose of this study was to try out an in- 
novation in measuring an S’s response to 
humor. Instead of relying totally on tradi- 
tional, judgmental rating responses of humor 
preference, measures were obtained which 
more closely approximate the perceptual 
process of humor-being-sensed. 

In order to provide a context for the in- 
novation in measuring sense of humor, cat- 
toons were categorized into sex, hostility, and 
nonsense; Ss were divided into two groups, 
one with psychopathic tendencies and one 
without such tendencies; and mean cartoon 
scores were compared between groups. Pre- 
vious research and theory suggested that 
personality correlates would be found using 
this procedure (Dahlstrom & Welsh, 1960; 
Kambouropovlov, 1926; Luborsky & Cattell, 
1947; Tollefson & Cattell, 1963-1964). The 
present report describes some differences i? 
Tesponse to humor between the two groups, 
discusses implications of the obtained differ- 
ences, and discusses the potential value of the 
new measures that were used, 
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METHOD 
Subjects 


Male psychiatric technicians and technician trainees 
who had worked at Stockton State Hospital for 7 
years or less were asked to participate during work- 
ing hours. Virtually all were willing. Scheduling 
was feasible for 48 men, all of whom were high 
school graduates. 


Conditions 


The Ss were divided according to the peak scores 
they obtained on the MMPI. Fifteen Ss with a peak 
score on Pd and five Ss with a peak score on Ma 
followed by a secondary peak on Pd formed the 
PPd group (Peak Psychopathic Deviate). The 28 
remaining Ss formed the NPPd group (Nonpeak 
Psychopathic Deviate). The major peak scores of Ss 
comprising the NPPd group were Hy (N=8), Sc 
(N=7), and Ma (N=6). Nine of the PPd Ss 
and 12 of the NPPd Ss had at least one T score 
of 70 or above. Table 1 presents means and standard 
deviations for the two groups on three matching 
variables and indicates that the groups did not differ 
in age, vocabulary level (measured by the 40-item 
Shipley-Hartford Scale), or level of manifest anxiety 
(measured by the 50-item Taylor MA scale, Scale 16 
in Appendix I, Dahlstrom & Welsh, 1960).* 


Cartoon Stimuli 


Three clinical psychologists rated independently the 
content of hundreds of cartoons. The variety of 
cartoon sources ranged from the New Yorker 
Twenty-Fifth Anniversary Album (1951) to Best 
Cartoons of the Year 1965 (Lariar, 1965) to College 
Laughs (1964). From the cartoons which were cate- 
gorized with perfect agreement, E selected 40 cartoons 
With hostile content, 40 cartoons with sexual content, 


Following customary procedure, Scales Mf and 
Si were excluded from consideration as peak scores. 
The PPd group contained Ss with two different pro- 
files because, in a normal population, an MaPd 
Sequence is commonly thought to be essentially the 
same as a PdMa sequence (Dahlstrom & Welsh, 
1960). Examination of PPd profiles indicates that 
the group as a whole is most accurately described. 
as consisting of normal individuals with psycho- 
pathic tendencies. A majority (11) of the Ss in PPd 
had a PdMa or MaPd profile. Normal persons with 
these profile patterns are expected to “show clear 
Manifestations of psychopathic behavior [Dahlstrom 
Welsh, 1960, p. 192].” Considering only the first 
digit of the profile, “peak scores on Scale four (Pd) 
almost without regard to elevation of the profile 
[provide] evidence of lack of social conformity or 
self-control and a persistent tendency to get into 
scrapes [Dahlstrom & Welsh, 1960, p. 188].” 

‘Tt was important for anxiety level (Doris & 
Ferman, 1956) and vocabulary level (Holmes, 
1968) to be controlled because of previously demon- 
Strated relationships with humor of the type used in 
this study. 
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TABLE 1 
Mean AGE, VOCABULARY, AND MANIFEST 
Anxiety or PPd anp NPPd Ss 
PPd NPPd 
Item 
M SD M SD 
Age 32.50 | 11.40 | 31.11 | 8.97 
Vocabulary 31.55 3.36 | 31.07 | 3.97 
Manifest anxiety | 10.10 | 4.83 9.46 | 6.33 


Note.—For PPd, N = 20; for NPPd, N = 28. 


and 30 cartoons with nonsense content. In addition, 
E inserted 10 cartoonlike stimuli which were out of 
context and not intended to be funny when in isola- 
tion. These stimuli were introduced in order to keep 
S honest in the Immediate Reaction situation, The 
cartoons were split randomly into two equivalent 
forms, A and B, with some attempt made to sepa- 
rate cartoons with highly similar specific content. 
Form A and Form B each had 20 cartoons with 
hostile and 20 with sexual content, 15 cartoons with 
nonsense content, and 5 cartoonlike stimuli that were 
not funny. 


Categorization Measures 


Four by six photographic reproductions of the 
stimuli were printed and each was encased in trans- 
parent plastic. Following precedent (Byrne, 1956), 
E requested Ss to sort the cartoons into four piles 
of 15 each, ranging from Most Funny, to Next, to 
Next, to Least Funny. Order of the cartoons was 
randomized for each S. Initial measures were ob- 
tained by summing the scores for each content area, 
with Most Funny=4, Next=3, Next=2, and 
Least Funny = 1. 


Immediate Reaction Measures 


A standard 35-mm. slide of each stimulus was ob- 
tained for use in a Kodak Carousel projector. The 


-S was seated in a dark room in front of a projection 


screen and was asked to rest his right arm on a 
table of appropriate height. A small box containing 
one large protruding butterfly-shaped knob and one 
push button was placed on this table. After S was 
told to push the button in order to have a slide 
projected through a small window, E left the room. 
A series of 12 slides, including some practice cartoons 
similar to those contained in Forms A and B, intro- 
duced S to the mechanics of turning the knob on a 
continuum of 180 degrees in order to register his 
appreciation of the cartoons. The S had to return 
the butterfly knob to the zero level before pushing 
the button for the next slide. Following this, $ was 
instructed to turn the knob an appropriate distance 
as soon as he got the point of a cartoon, then switch 
to the next cartooon as soon as he was finished with 
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the previous one. Since some stimuli were not funny 
at all, he was told simply to switch to the next 
cartoon without turning the knob whenever he 
thought a stimulus did not contain any humor. After 
completion of the 12 training slides, either Form A 
or Form B was presented, with S totally controlling 
the rate of cartoon presentation. Order of the car- 
toons was random, with the restriction that three 
hostility, three sex, and three nonsense or not-funny 
stimuli occur in each consecutive set of nine. Once 
order had been determined, it remained constant 
throughout the study. 

A continuous recording of S’s responses was ob- 
tained in the slide projector room by means of a 
Heath Model EU-W 20 strip chart recorder moving 
at 12 inches per minute. Movement of the push 
button, which activated the projector only when the 
butterfly knob was at zero level, was indicated on 
the chart paper by movement of the recorder pen in 
a negative direction. 

Two measures were obtained in the Immediate 
Reaction situation, Latency of response refers to the 
time between presentation of a cartoon and reaction 
to it. Amplitude of response refers to the distance 
the knob was turned by S, with the two poles of the 
continuum being zero for not funny and 80 for 
very funny. 

Responses to the not-funny stimuli are not in- 
cluded in the Results section. For the most part, 
these stimuli were seen as not funny and thereby 
served the purpose of compelling S to respond to 
each cartoon as a separate stimulus. 


Procedure 


The Ss were scheduled in groups of two to six 
each, tested individually, and then scheduled for a 
second session to be conducted about 1 week later. 
Each of the techniques, Immediate Reaction and 
Categorization, plus each of the forms, A and B, 
were exposed to Ss in an alternate manner: one 
technique and one form on the first day, the other 
technique and form on the second day. Several tests 
besides the MMPI and Shipley-Hartford Vocabu- 
lary List were administered. Each session lasted 
approximately 14 hours. 


Transformations of Data 


For each S, mean responses to sex, hostility, and 
nonsense cartoons were summed and the Proportion 
of that sum contributed by sex, hostility, and non- 
Sense means was calculated. The proportions, which 
underlie the analyses of amplitude and categoriza- 
tion responses reported in Table 3, represent 
“ipsative” rather than “normative” Measures (Cattell, 
1944), Although the distinction between “ipsative” 
and “normative” measures is important to person- 
ality theorists and diagnosticians interested in humor 
(Tollefson & Cattell, 1963-1964), it is largely ir- 
relevant in the present study because overall liking 
for the cartoons used appears to be the same for 
PPd and NPPd Ss (Table 2, normative data), 
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RESULTS 


Table 2 presents a summary of latency te. 
sponses to the 55 cartoons containing humor, 
Nonsense cartoons were responded to more 
quickly than other cartoons, perhaps because 
there were relatively fewer cartoons that 
required reading in the nonsense category. All 
cartoons were responded to more quickly by 
PPd than by NPPd Ss. On the average, PPd 
Ss responded more than 14 seconds faster 
than WPPd Ss. This relatively quick response 
time is not attributable to hypomania on the 
part of the peak-Ma Ss included in PPd be- 
cause the mean of the five peak-Ma Ss in this 
group was higher than the mean of the 
remaining 15 PPd Ss. 

Although measures of cartoon comprehen- 
sion were not obtained, it seems likely for two 
reasons that PPd Ss actually saw a humorous 
point—not necessarily the only point—to the 
cartoons more rapidly than VPPd Ss, First, 
all Ss reported turning the knob as soon as 
the point to a cartoon was seen, and second, 
the obtained difference of more than 1} 
seconds is far greater than would be expected 
on the basis of reaction-time differences in 
initiating a psychomotor act (turning the 
knob) in response to a signal. It has not been 
demonstrated, however, that the obtained 
differences in latency of response reflect com- 
prehension of specific points in the cartoons 
or that the obtained differences are specific 
to humorous stimuli. They may relate instead 
to more general response tendencies, to stimu- 
lus characteristics not uniquely humorous, 
or to aspects of the measurement situation 
per se that differentially influence PPd and 
NPPd Ss. 

Table 2 also summarizes amplitude re- 
sponses obtained in the Immediate Reaction 
situation. The nonsignificant main effect indi- 
cates that there is no evidence to suggest that 
PPd and NPPd Ss differ in the overall extent 
to which they appreciate humor. The signifi- 
cant interaction effect, however, indicates 
that PPd and NPPd Ss differ in their appreci- 
ation of particular types of humor. Examina- 
tion of the mean scores reveals that PPd 5s 
appreciated sex and hostility cartoons more 
than nonsense cartoons, with the difference 
between sex and nonsense cartoons being stë- 
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TABLE 2 
IMMEDIATE REACTION RESPONSES TO HUMOR 
Amplitude Latency 
Source df 
MS F |MS F 
Between Ss 
PPd, NPPd 1| 193.69 | .43 | 95.19] 8.05** 
Ss within groups | 46 | 446.90 11.82 
Within Ss 
Sex, hostility, non- 
sense cartoons 2| 13.97) .20 | 9,83] 15.60%" 
PPd, NPPd X Sex, 
Hostility, Non- 
sense Cartoons 2 | 250.61 | 3.64** | 1.54| 2.44* 
Residual 92 | 68.89 63 
Mean amplitude 
Sex Hostility | Nonsense 
34.85 33.95nb 28,900 
28.938 29.93,» 31.794 
Mean latency 
PPd (N 6.42ab 6.63» 6.070 
NPPd (N 8.404 8.324 7.340 


Note.—For amplitude and latency considered separately, 
with Sm = VResidual/20, cells containing the same subscripts 
are not significantly different from each other at the .05 level by 
pan multiple-range test (Duncan, 1955). 

<.10, 

** p <.05, 

wap <01. 
wtp < .001. 


tistically significant by range test. NPPd Ss 
appear to have a more balanced appreciation 
of the three cartoon types. 

Mean scores in Table 3 highlight the pat- 
tern of differences revealed by the significant 
interaction effect in Table 2.5 (Sex and hostil- 
ity scores were combined in Table 3 because 
inspection of means and standard deviations 
tevealed only trivial differences between them. 
For sex and hostility, respectively, mean 
amplitude proportions were .354 and .350 for 
PPd, and 320 and .334 for NPPd; mean 
categorization proportions were .349 and .346 
for PPd, and 333 and .342 for NPPd.) In 
both Immediate Reaction and Categorization 
situations, PPd Ss preferred sex plus hostility 
cartoons significantly more than nonsense car- 
toons. No particular preference was exhibited 
by NPPd Ss, with the direction of statistically 
insignificant differences being reversed in the 


5 Note that main effect differences between PPd and 
NPPd in Table 3 are artifacts caused by dividing 
Sex plus hostility by two in the presence of differ- 
ences in PPd, NPPd preferences for sex, hostility, 
and nonsense cartoons. Interpretations of Table 3 in 
the text have taken this into account. 
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two measurement situations. The pattern in 
the data, then, is that the preference for sex 
plus hostility cartoons shown by PPd Ss is 
accompanied, in the absence of a preference 
by NPPd Ss, by differences between PPd and 
NPPd Ss in cartoon preference. Statistically 
significant differences are that PPd Ss enjoyed 
nonsense cartoons less than NPPd Ss; as indi- 
cated in the amplitude proportion data 
(Table 3), and enjoyed sex cartoons more, 
as indicated in Table 2. 

Although, as a group, PPd Ss appreciated 
sex and hostility cartoons to an equal extent, 
individual preferences in the PPd group are 
related to variables measured by the Mf 
(masculinity-femininity) scale of the MMPI. 
For the PPd group, the Pearson product- 
moment correlations between the Mf scale of 
the MMPI and preferences for sex cartoons 
are —.47 for Immediate Reaction (ipsative 
data) and —.40 for Categorization. A cor- 
relation of —.44 is significantly different from 
a zero correlation at the .05 level of confi- 


TABLE 3 


CATEGORIZATION AND AMPLITUDE (PROPORTIONS) 
RESPONSES TO HUMOR 


Categorization Petar 
Source df 
MS P MS F 
Between Ss 
PPd, NPPd 1 679.50 | 2.44 3,565.04 | 6.43* 
Ss within groups | 46 279,04 554,59 
Within Ss 


Sex plus hostil- 
ity, nonsense 
cartoons 

PPd, NPPd 
X Sex plus 
Hostility, 
‘Nonsense 
Cartoons 

Residual 


4,017.10 | .96 


15,478.76 | 6.53** 


5,743,58 | 2.42 
2,370.56 


32,681.38 | 7.80%" 
4,191.35 


LTS 


Mean categorization proportions 


Sex plus Nonsense 
hostility 
PPd (N = 20) 348b 304a 
NPPå (N = 28) 338b 32 5ab 
Mean amplitude proportions 
PPd (N = 20) 352 2960 
NPPd (N = 28) +327ab «346b 


Note.—For categorization and amplitude considered separ- 


ately, with Sm = VResidual/20, cells containing the same sub- 
script are not significantly different from each other at the .05 
level by Duna multiple-range test (Duncan, 1955). 

< 05. 


wp <01. 
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dence. The interpretation is that PPd Ss who 
have more masculine interests prefer sex 
cartoons more than hostility and nonsense 
cartoons. With NPPd Ss this does not appear 
to be true. The comparable correlations for 
the VN Pd Ss are —.12 for Immediate Reac- 
tion and —.16 for Categorization. These small, 
nonsignificant correlations suggest that the 
relationship between masculine interests and 
appreciation of sexual cartoons is largely or 
totally confined to the PPd Ss. 


DISCUSSION 


Since it is commonly assumed that hostile 
and sexual impulses are expressed in hostile 
and sexual humor, the cartoon stimuli in this 
study were initially categorized into sex, hos- 
tility, and nonsense. The five major substan- 
tive findings are the following: (a) PPd and 
NPPd Ss, overall, appreciated humor to an 
equal extent; (b) PPd Ss preferred sexual 
and hostile humor more than nonsense humor; 
(c) PPd Ss preferred sexual (and hostile) 
humor more and nonsense humor less than 
NPPd Ss; (d) PPd Ss with strong masculine 
interests preferred sexual cartoons more than 
PPd Ss with less strong masculine interests; 
(e) PPd Ss appeared to see the point to all 
cartoons more quickly than did NPPd Ss. 
These findings with PPd Ss fit well with 
straightforward expectations of behavior de- 
rived from knowledge of psychopathic persons 
who are not experiencing situational anxiety. 
Psychopathic persons are generally described 
as shallow, glib individuals who lack inhibi- 
tion and seek immediate gratification of im- 
pulses (Coleman, 1956). It is not surprising, 
therefore, to discover that Ss with a psycho- 
pathic tendency appear to respond quickly 
or glibly to humor, prefer humor which in- 
volves impulse gratification, and uninhibitedly 
prefer heterosexually relevant humor more if 
they are actually more interested in the op- 
posite sex. The results should be generalized 
cautiously, however, because psychiatric tech- 
nicians have been infrequently studied and it 
may be that their population characteristics 
interact somehow with the variables measured 
in the present study. 

Other investigators who have used similar 
stimulus materials have also discovered posi- 
tive correlations between strength or expres- 
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sion of impulses in a humorous context and 
strength or expression of such impulses in a 
nonhumorous context. The variety of situa- 
tions studied ranges from observation of 
actual aggressive behavior (Byrne, 1956) to 
aggressive sentiments expressed on paper-and- 
pencil tests (Hetherington & Wray, 1964; 
Murray, 1934) to content of TAT stories 
(Grziwok & Scodel, 1956) to experimentally 
aroused hostility (Dworkin & Efran, 1967; 
Strickland, 1959). It is also becoming evident 
that high inhibition concerning aggression 
interferes with apparent enjoyment of hostile 
humor. Gollob and Levine (1967) and Singer, 
Gollob, and Levine (1966) demonstrated that 
experimental arousal of aggressive inhibition 
decreases preference for aggressive humor. 
Hetherington and Wray (1964) demon- 
strated that dissolving inhibition in liquor 
increases preferences for hostile humor in 
high-aggression, high-inhibition Ss. 

The results of the present study, in which 
high-impulse, low-inhibition (PPd) Ss re- 
sponded more appreciatively to impulse- 
related humor, are consistent with what would 
be expected from the studies mentioned above. 
Furthermore, obtained differences between 
PPd and NPPd Ss suggest that it may be 
necessary to distinguish between these two 
kinds of Ss when attempting to understand 
the dynamics of humor. Perhaps, for PPd Ss, 
responding to humor is simply one of many 
alternative ways to act out impulses. In the 
absence of deep-rooted conflicts concerning 
acting out, there would be little reason for 
humor to serve as an outlet for bottled-up 
emotions. With WPPd Ss, for whom acting out 
in general is more difficult, responding to 
humor may be one of the few available ave- 
nues of expression. If so, complex and subtle 
patterns of psychodynamic determination may 
pertain to WPPd Ss, but not, under ordinary 
circumstances, to PPd Ss. 

The attempt to measure humor-being- 
perceived worked successfully in the present 
study. The Ss readily accepted the Immediate 
Reaction situation. With only one exception, 
in response to an open-ended question, Ss first 
commented on the cartoons or on their own 
sense of humor before commenting on the 
response task. They appeared to pay atten- 
tion to the stimuli rather than to the appa- 
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ratus or the measuring technique. Hence, it 
would seem that neither elaborate warm-up 
procedures nor “special” laboratory environ- 
ments are necessary in order to obtain reason- 
able measures of humor-being-perceived. The 
rapport usually established to obtain an S’s 
cooperation apparently is sufficient. 

Measures of humor-being-perceived appear 
to possess a substantial potential for use in 
laboratory research. The type of measurement 
situation used in Immediate Reaction lends 
itself well to systematic laboratory study by 
means of tightly controlled manipulation of 
stimulus, environmental, and S variables. For 
example, stimuli might be varied in their 
complexity, content, and subtlety; social en- 
vironment might be varied by introducing the 
illusory presence of other Ss through tape- 
recordings; emotional states might be manipu- 
lated by visual or auditory presentation of 
nonhumorous material. 

Measures of humor-being-perceived also 
appear to possess potential value for applica- 
tion. As a measure of emotional functioning, 
latency might reveal perceptual sensitization 
or blocking in areas of conflict or vulnerabil- 
ity. As a measure of intellectual functioning, 
latency might reveal speed of cognitive proc- 
essing as a function of stimulus complexity. 
In both applications, with Ss kept honest by 
presentation of not-funny cartoonlike stim- 
uli, amplitude measures would provide an 
indication of whether or not S actually saw 
the point. Amplitude measures also, of course, 
would indicate degree of preference for dif- 
ferent stimuli. If knowledge of the contribu- 
tion of emotional and intellectual factors were 
available, then instruments might be devel- 
oped which would be relatively free from the 
effects of test anxiety, test-taking styles, and 
test-wise Ss. 
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PSYCHIATRIC HOSPITAL EXPERIENCE AND ATTITUDES 
TOWARD “MENTAL ILLNESS” 


JEAN J. SMITH: 
Kings View Hospital, Reedley, California 


Prior to and following psychiatric hospital experience, an experimental group 
composed of patients, student nurses, and hospital employees was tested with 
a five-factor attitude scale, Opinions about Mental Illness. The group showed 
significant decreases in attitudes of authoritarianism and social restrictiveness 
toward mental patients. Attitude changes appeared not to be related to differ- 
ences in indoctrination or time spent in the hospital. All significant attitude 
changes in the experimental group were in the direction of greater conformity 
to the attitudes of professional treatment staff or greater favorableness. A con- 
trol group showed a significant decrease in belief in interpersonal etiology, but 


otherwise showed no significant changes. 


Earlier studies have shown that during 
psychiatric hospital experience, attitudes of 
student nurses toward the “mentally ill” 
change in a favorable direction (Hicks & 
Spaner, 1962) and those of patients change 
toward conformity to the attitudes of the 
treatment staff (Manis, Houts, & Blake, 
1963). Since Struening and Cohen (1963) 
have analyzed attitudes toward “mental ill- 
ness” into five factors, it is of interest to 
determine whether changes in attitudes are 
uniform on all five factors or are selective. 
The first factors are described as follows by 
Cohen and Struening (1962): 


Factor A—Authoritarianism. ...a gestalt made 
up of authoritarian submission and anti-intraception 
with a view of the mentally ill as a class inferior 
to normals and requiring coercive handling. 

Factor B—Benevolence. . . . a kindly, paternalistic 
view towards patients whose origin is in religion and 
humanism rather than a scientific or professional 
dogma. It is encouraging and nurturant, but still 
acknowledges some fear of patients, 

Factor C—Mental Hygiene Ideology. . . . the idea 
that mental patients are much like normal people, 
differing from them perhaps in degree, but not in 
kind, in sharp contrast with the Factor A orientation. 

Factor D—Social Restrictiveness. . . . the belief 
that mental illness is a threat to society which must 
be met by some restriction in social functioning 
both during and following hospitalization. 

Factor E—Interpersonal Etiology. ...a belief 
that mental illness arises from interpersonal experi- 
ence, particularly deprivation of parental love and at- 
tention during childhood . . . or more generally the 
mental health of those in loco parentis... . Some- 


1 Requests for reprints should be sent to Tif: 
Smith, Kings View Hospital, P. O. Box 631, Reedley, 
California 93654. 
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what less central is a belief that abnormal behavior 
is motivated; e.g., mental illness is an avoidance of 
problems . . . [pp. 352-355]. 


Lewis and Cleveland (1966) indicate that 
student nurses, following an 8-week psychi- 
atric experience, exhibited attitudinal changes 
on Cohen and Struening’s scales. They found 
a significant decrease in Authoritarianism, and 
increases in Mental Hygiene Ideology and 
Interpersonal Etiology. No consistent changes 
were observed for Benevolence or Social 
Restrictiveness. 

The purpose of the present study was to 
investigate further the specific nature of 
change in the attitudes of several groups after 
varying lengths of exposure to psychiatric 
hospital experience, 


METHOD 
Subjects 


Three experimental subgroups and two control 
subgroups were included, totaling 129 experimental 
and 103 control Ss. One experimental subgroup con- 
sisted of 65 patients consecutively admitted, for the 
first time, to a small, church-sponsored, private 
psychiatric hospital whose therapeutic community 
treatment approach maximizes interpersonal contact 
at all levels, This patient subgroup was heterogeneous 
with respect to diagnosis, education, cultural back- 
ground, and religious orientation. The median age 
of the group was 38 years, and the range was 16-67 
years. Length of hospitalization varied from 3 to 
34 weeks, with a median of 10 weeks. Excluded as 
Ss were patients who had previous admissions, who 
were too disturbed at the time of admission to CO- 
operate in giving data, and those from whom it was 
impossible to obtain posttest data, usually because of 


abrupt departure. These constituted 30% of the 
total sample. 


ATTITUDES TOWARD MENTAL ILLNESS 


A second experimental subgroup included 33 stu- 
dent nurses who underwent a psychiatric affiliation 
in the same hospital. All student nurses spent the 
same number of days in the hospital, although for 
some the experience was spread over a 16-week 
period, while for others it was 8 weeks, Their ages 
ranged from 20 to 40 years, with a median of 21. 

The third experimental subgroup consisted of 31 
conscientious objectors and Mennonite Voluntary 
Service workers (VS) employed as ward personnel 
by the hospital. These employees spent 1-2 years at 
the hospital, depending upon the nature of their 
assignments under Selective Service or Voluntary 
Service. Their age range was 19-35 years; the median 
was 22. The median age of the total experimental 
group was 23 years with a range from 16 to 67. 

The control group included 81 students enrolled 
in an introductory psychology course at the same 
college as that attended by the student nurses and 
22 Mennonite Voluntary Service personnel (VS con- 
trols) who were assigned to nonpsychiatric service 
in other locations. Ages of the students ranged from 
19 to 58 years with a median of 20, while the range 
of the VS controls was 20-48 years, with a median 
of 23. The median age of the total control group 
was 21 years with a range from 19 to 58. 


Hypotheses and Procedure 


A questionnaire consisting of the items from Cohen 
and Struening’s (1962) Opinion about Mental Il- 
ness (OMI) was administered on a test-retest basis 
to all members of the experimental and control 
groups. The first hypothesis to be tested was that 
the pretest-posttest scores of the experimental group 
would reflect the findings of Manis, Houts, and 
Blake (1963), exhibiting shifts in the direction of 
conformity to the attitudes of the hospital treatment 
staff, The second hypothesis was that such con- 
vergence, should it occur, would also support the 
findings of Hicks and Spaner (1962) that the atti- 
tudes of student nurses toward psychiatric patients 
shifted in a favorable direction. 

In order to provide a base line for the evaluation 
of attitudinal shifts, as well as to provide a criterion 
of “favorableness,” the OMI was administered to the 
Clinical treatment staff of the hospital (three psy- 
chologists, five psychiatrists, and four social workers). 
The mean scores of this group on the five OMI 
factors were then used as a base line against which 
to evaluate pretest and posttest scores of both the 
experimental and control groups for degree of shift 
and for relative favorableness or unfavorableness. 

Pretest scores were obtained from all the members 
of the experimental group upon their arrival at the 
hospital, Patients were tested immediately after 
admission, student nurses during the first day of 
affiliation, and employees when they started work. 
Posttest scores were obtained from the patient and 
pon nurse subgroups just prior to their departure 
tom the hospital, either at the time of discharge 
or upon termination of their hospital affiliation. 

The members of the employee subgroup, by reason 
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of their status as volunteers or conscientious objec-" 
tors, were committed to work at the hospital for 
predetermined periods of from 1 to 2 years. Conse- 
quently, it was decided to retest them at varying 
time intervals in order to test the second hypothesis 
that degree of change in attitudes would be positively 
related to time spent in the hospital. Four retest 
intervals were employed. One-fourth were retested 
after they had been in the hospital for 1 month; 
one-fourth were retested after 6 months; one-fourth 
after 12 months; and the remaining fourth after 24 
months. Assignment of individuals to each of the 
four retest intervals was made on a rotational basis, 
that is, Sı after one month, Sə after 6 months, Ss 
after 12 months, etc. Obviously, however, only Ss 
who were known to be assigned to the hospital 
for 24-month tours could be assigned to the 
fourth interval, and adjustments in the distribution 
of Ss among the four retest intervals were made 
accordingly. 

The control Ss were tested in groups. The student 
control Ss received the OMI pretest at the beginning 
of their introductory course in psychology, and the 
posttest 14 weeks later before beginning the mental 
health unit of their course. The VS control Ss com- 
pleted the pretest as part of their orientation for 
nonpsychiatric work assignments. Posttest scores for 
these Ss were obtained by canvassing them by mail 
1 year later. Since at this time they were widely 
scattered throughout the world, posttest scores were 
obtainable for only approximately 60% of the origi- 
nal VS control subgroup. This selectivity of response 
may have biased the subgroup’s results, possibly 
accounting for its change on one attitude dimension, 
as seen below. 

A third hypothesis predicted that the varying 
degrees and kinds of indoctrination to which the 
experimental subgroups were subjected would be 
reflected by varying degrees of convergence of post- 
test means toward the mean of the treatment staff 
on each of the five factors, Since the student nurses 
were subjected both to hospital exposure and to 
systematic instruction, they were expected to show 
the most change. Employees were subjected to in- 
service training efforts which, while not as systematic 
or as intensive as those of the student nurses, never- 
theless suggested that employees might change more 
than patients. Hence, the predicted order for degree 
of change was student nurses > employees > pa- 
tients. Since the control subgroups were subjected 
neither to indoctrination nor to exposure to psychi- 
atric patients, their posttest means were not expected 
to converge toward the treatment staff means or to 
change in degree of favorableness to patients. 

All questionnaires were scored separately for each 
of Cohen and Struening’s five factors. Scoring was 
based on the published factor loadings of 53 items 
(Cohen & Struening, 1962); items were scored in 
the direction of their positive or negative factor 
loadings for the respective scales. One item, “The 
patients of a mental hospital should have something 
to say about the way the hospital is run,” received 
a high-negative loading for Factor A and a high- 
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positive loading for Factor C. This item was scored 
only for Factor C. A 6-point Likert forced-choice 
response continuum was used in the questionnaire. 
Response alternatives were: Agree very much, Agree 
on the whole, Agree a little, Disagree a little, Dis- 
agree on the whole, and Disagree very much. Re- 
sponses were scored from O (extreme disagreement) 
to 7 (extreme agreement), omissions being scored 4. 
Item scores were summed algebraically with nega- 
tive values assigned to items having negative factor 
loadings. 


RESULTS AND DISCUSSION 
Pretest Group Differences 


Inspection of Table 1 reveals that the mean 
pretest OMI score of the experimental group 
was significantly higher (p < .01)* than that 


? Significance of all pretest intragroup differences 
was evaluated by means of two-tailed ¢ tests, for 
independent means, since no predictions had been 
made about their direction. Pretest-posttest differ- 
ences were evaluated for significance by means of 
one-tailed tests for correlated means since their 
directions had been predicted. 
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of the control group on Mental Hygiene 
Ideology and significantly lower (p < .01) on 
Social Restrictiveness. The control group was 
higher on Authoritarianism, but the difference 
was of borderline significance (.05 > p > .01), 
The mean pretest scores of all subgroups 
were higher than those of the treatment staff 
for Authoritarianism, Benevolence, and Social 
Restrictiveness, and below on Mental Hygiene 
Ideology. These differences were all significant 
at or beyond the .01 level. The student nurses 
were significantly less authoritarian on pretest 
(p < .001) than were the other experimental 
or control subgroups, but there were no 
consistent pretest differences between sub- 
groups on the remaining four factors. 


Pretest-Posttest Group Changes 


Comparison of pretest and posttest OMI 
means and standard deviations is also seen in 


TABLE 1 
Pretest AND Postrest OMI MEANS AND STANDARD DEVIATIONS OF ALL Groups 
Authoritari- Mental hygiene} Social restric- | Interpersonal 
anism Benevolence ideology tiveness etiology 
Group N 
Pre- | Post- | Pre- | Post- | Pre- | Post- | Pre- | Post- | Pre- | Post- 
test test test test test test test test test test 
Treatment staff 3 Te 
21.2 = 40.1 = 51.0 E 7.8 — 24.9 = 
SD 8.5 3.2 4.7 3.6 6.2 
Experimental group 129 
M 47.8** | 40.1 44.3 | 43.5 44.7 45.2 |14.2**| 11.8 | 23.2 24.1 
SD 21.6 20.2 6.3 6.4 7.8 7.9 8.7 7.6 7A 14 
Nurses 33 
m an na 44.4 43.4 | 45.0 46.5 | 12.1* 10.1 | 22.0 24.6 
x f 5.6 5.4 ki 
pitts 5.6 5.4 5.7 5.8 5.3 5.4 E 
M 54.1** | 44.5 | 43.5 43.3 45.0 45.4 |15. 
ja a f al i "hd 12.9 | 22.7 23.0 
SD 227 | 24] 68 | 76 ; 
Employes 8.7 8.8 | 10.6 8.3 iD) 8.1 31 
M 52.4** | 45.8 | 45.9% | 44.0 43.9 | 43.4 
E . A A i 14.6" | 11.1 | 25.6 25.8 
SD 15.5 | 179 | 5.2 j 
Control group ae ae REENE Khat 5g 103 
a a tee a 44.0 | 40.5 | 40.2 |20.8* | 19.4 | 24.1**| 22.2 
coe i í . 5.1 7.5 7.9 | 80 7.4 | 6.6 6.5 af 
we el sa By Lae 39.8 39.1 | 20.9 19.7 | 23.0* 21.5 
Voluntary service A ; ; is Ne CAR ot 22 
M 56.1 55.4 | 44.4 42.8 | 43.2 
. ; : 2 | 44.0 | 20.3 | 18. 1 | 24.8 
SD 19.5 20.6 4.5 5.0 5.8 7.2 | 8.7 a ay “58 


* p = .05, one-tailed £ test. 
** > = .01, one-tailed ¢ test. 


ATTITUDES TOWARD MENTAL ILLNESS 


Table 1. Overall changes from pretest to post- 
test were Clearly selective for the various fac- 
tors. The total experimental group showed 
significant decresases (p < .01) on Authori- 
tarianism and Social Restrictiveness, con- 
verging on the treatment staff norms in agree- 
ment with the findings of Manis, Houts, and 
Blake (1963). The total group showed bor- 
derline decreases (p < .05) on Authoritarian- 
ism and Social Restrictiveness, and a signifi- 
cant (p< .01) decrease on Interpersonal 
Etiology. The experimental group remained 
significantly higher on Mental Hygiene 
Ideology and lower on Social Restrictiveness 
than the control group (p < .01). In addition, 
at posttest the control group was signifi- 
cantly more authoritarian than the experi- 
mental group, and the difference on Inter- 
personal Etiology approached significance 
(.05 > p> .01). 

Most of the posttest means of the experi- 
mental and control subgroups remained sig- 
nificantly above those of the treatment staff 
on Authoritarianism, Benevolence, and Social 
Restrictiveness, and significantly below on 
Mental Hygiene Ideology. Exceptions were in 
the student nurse, patient, and VS subgroups. 
The differences between the student nurses 
and the treatment staff on Authoritarianism, 
Benevolence, and Social Restrictiveness were 
no longer significant. In the patient and VS 
subgroups the posttest means on Benevolence 
were no longer significantly above that of the 
treatment staff. 

Individual subgroups showed varying pat- 
terns of change. The student nurse subgroup 
showed a borderline decrease in Authoritarian- 
ism and Social Restrictiveness, and a border- 
line increase in Interpersonal Etiology. This 
partially supported the findings of Lewis and 
Cleveland (1966), who found a decrease in 
Authoritarianism and increases in Mental 
Hygiene and Interpersonal Etiology, among 
student nurses, The patient subgroup showed 
a significant decrease in Authoritarianism, and 
a borderline decrease in Social Restrictiveness. 
The employee subgroup decreased to a sig- 
nificant degree in Authoritarianism and Social 
Restrictiveness, and to a borderline degree 
in Benevolence. 

Three changes in the control subgroups are 
of interest. These are the significant decrease 
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in Authoritarianism and the borderline de- 
crease in Interpersonal Etiology shown by the 
student controls, and the significant decrease 
in Interpersonal Etiology shown by the VS 
controls. The first two may reflect the influ- 
ence of the introductory psychology course 
to which the student controls were exposed 
between pretest and posttest. However, Costin 
and Kerr (1962) found that exposure to an 
abnormal psychology class resulted in stu- 
dent OMI changes in the direction of lower 
Authoritarianism, Benevolence, and Social 
Restrictiveness, and increased Interpersonal 
Etiology. The significant decrease in Inter- 
personal Etiology in the VS controls may 
reflect their experience with foreign customs 
and living conditions, and increased awareness 
of the importance of social and economic fac- 
tors in the etiology of emotional disturbance. 
However, evidence for this interpretation is 
lacking. Further study might reveal whether 
or not the changes observed in these control 
subgroups are fortuitous. 


Order of Change 
The data yielded no evidence that degree 


of change followed the predicted order of 


student nurses > employees > patients. A chi- 
square for the relative ranks of the three sub- 
groups on all five factors failed to approach 
significance (p > .05). The borderline in- 
crease in Interpersonal Etiology exhibited by 
the student nurses may reflect the influence 
of the formal instruction which supplemented 
their hospital experience, although the data 
neither support nor disprove this speculation. 
Indeed, the apparent predominance of atti- 
tudinal change (Authoritarianism, Benevo- 
lence, Social Restrictiveness) over ideological 
change (Mental Hygiene Ideology, Interper- 
sonal Etiology) within the experimental group 
suggests that the changes which did occur 
may be attributable to the effects of increased 
personal familiarity with patient behavior, 
rather than to those of indoctrination or 
instruction. 


Length of Hospital Experience 


In order to test the hypothesis of a rela- 
tionship between degree of change and length 
of experience in the hospital, an analysis of 
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variance was carried out on the scores of the 
members of the employee subgroup, who had 
been retested after 1, 6, 12, or 24 months. 
Since this subgroup exhibited significant 
change only on Authoritarianism and Social 
Restrictiveness, only the scores of these fac- 
tors were analyzed. The F values obtained 
were 1.30 and .80 for the two factors, re- 
spectively, lending no support to the hypothe- 
sis since they were far below the values re- 
quired for statistical significance. These find- 
ings suggest that attitude changes may well 
have occurred in the early stages of Ss’ 
hospital experience and that they were not 
cumulative with time. 

However, since the small N (31) calls into 
question the power of the F test of signifi- 
cance in this instance, this conclusion must be 
tentative. Nevertheless, the present result is 
of interest in view of recent studies such as 
that of Muench, Tutko, and Stuckey (1967), 
and that of Keith-Spiegel and Spiegel (1967), 
in which attitudes of students toward psychi- 
atric patients have been shown to change as 
result of rather extensive exposure to a hos- 
pital environment. The present results sug- 
gest that change might be induced by con- 


siderably shorter periods of exposure than the 


approximately semester-long periods employed 
in these studies. Further study with more Ss 
is needed. 


J. J. SmE 
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DEVELOPMENT OF AN EMPATHY SCALE’ 


ROBERT HOGAN 2 
Johns Hopkins University 


The concept of empathy—the intellectual or imaginative apprehension of an- 
other’s condition or state of mind—is central for understanding a broad range 
of social phenomena including, in particular, moral development. Within this 
latter context, an empathic disposition can be regarded as the capacity to adopt 
a broad moral perspective, that is, to take “the moral point of view.” This 


paper discusses the development of a 64-item self-report measure of empathy, 


constructed by comparing the responses of groups 
using the combined MMPI-CPI item pool. After providing evidence 


empathy, 


with high- and low-rated 


concerning the scale’s reliability and validity, an attempt is made to show its 
relevance for specifically moral conduct by relating empathy scale scores to real 


life indexes of socially appropriate 
validated measures of personality. 


Some form of empathic disposition, role- 
taking ability, or social sensitivity is assumed 
by all approaches to personality which take 
the interpersonal situation as a major focus 
of concern. Accordingly, most writers in the 
fole-theoretical tradition (Cottrell, 1942; 
Gough, 1948; Mead, 1934; Sarbin, 1968) 
have given careful attention to this aspect 
‘of social functioning. Mead, for example, has 
argued that role-taking ability is the key vari- 
‘able in social and moral development; extend- 
ing this line of reasoning he equates the “g” 
factor in intelligence with social sensitivity, 
‘the origins of which can be found in the 
central nervous system. In a similar vein, 
Cottrell and Dymond (1949) also maintained 
that empathy is the basic process in all social 
interaction. 

_ Empathy, seen as an everyday manifesta- 
tion of the disposition to adopt a broad moral 
Perspective, to take “the moral point of view,” 
also becomes important within the context of 


_ ‘This paper is based on a dissertation submitted 
in partial fulfillment of the requirements for the 
degree of Doctor of Philosophy at the University of 
p tio rnia, Berkeley. The research was supported by 
Public Health Service Fellowship 1 Fi MH 30, 
101-01A1 (MTLH), and was conducted at the Insti- 
tute of Personality Assessment and Research, Univer- 


Sity of California, Berkeley. The author wishes to 
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Requests for reprints should be sent to Robert 
ogan, Department of Psychology, Johns Hopkins 
niversity, Baltimore, Maryland 21218. 


behavior and to certain previously well- 


research in moral development. By taking the 
moral point of view, a person is said to con- 
sider the consequences of his actions for the 
welfare of others (cf. Baier, 1958); in its 
social implications, this resembles Mead’s 
notion of taking the role of the generalized 
other. A willingness or, tendency to put one- 
self in another person’s place and to modify 
one’s behavior as a result, is clearly an 
important aspect of moral growth. 

Because empathy plays a central role in 
many phases of social psychology, a valid and 
easily administered measure would be quite 
useful. However, most attempts to develop 
such a measure have not been particu- 
larly well received. Four examples might be 
mentioned. 

Dymond (1948, 1949, 1950) developed a 
measure of empathic ability using rating 
forms. The results of Dymond’s research have 
great intuitive appeal; unfortunately the 
method used to obtain empathy scores is 
cumbersome, and has been questioned on 
methodological grounds (Hastorf & Bender, 
1952). The validity of the Empathy Test 
(Kerr, 1947) has been questioned by review- 
ers (cf. Hall, 1965; Thorndike, 1959). Taken 
at face value, the Test of Social Insight 
(Cassel, 1963) might seem promising. The 
consensus of reviewers (Black, 1965; Bordin, 
1960) appears to be, however, that viewed 
more closely the test lacks validity and has 
little to do with an empathic or perceptive 
capacity for understanding others. More re- 
cently, new evidence has been presented for 
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the validity of the Chapin Social Insight Test 
(Chapin, 1942; Gough, 1965b). Although 
quite interesting and useful, the Chapin test 
seems more closely tied to the realm of social 
and interpersonal effectiveness than to the di- 
mension of empathy per se, although overlap 
certainly exists. 

In spite of the apparent difficulty involved 
in developing a valid and acceptable measure 
of empathy, the theoretical import of the 
concept requires that continuing efforts be 
made. The presént report provides informa- 
tion concerning the origins and initial valida- 
tion of another such attempt. 


Is Empathy an Artificial Concept? 


In view of the apparent gap between theory 
and measurement, the possibility exists that 
the term “empathy” is a creature of Acade- 
mia. The practical appeal of the concept 
would be enhanced if intelligent laymen were 
found to use it in a meaningful fashion. 

The consensus of dictionaries is that em- 
pathy means the intellectual or imaginative 
apprehension of another’s condition or state 
` of mind without actually experiencing that 
person’s feelings. Although this definition im- 
plies a task, it says nothing about accuracy of 
performance. Empathy refers only to the act 
of constructing for oneself another person’s 
mental state; the verisimilitude of the result- 
ing construct is not a necessary part of the 
concept’s meaning. 

Fourteen nonpsychologists (school teachers, 
statistical clerks, etc.) were provided this 
definition of empathy. They were then asked 
to describe, using this definition, their con- 
ception of a highly empathic man with 50 
items selected from the 100-item California 
Q sort (Block, 1961) ; these items were chosen 
for content directly relevant to empathy. The 
14 individual Q-sort descriptions were inter- 
correlated, and the mean interjudge correla- 
tion was .53; the estimated reliability of the 
total composite from the Spearman-Brown 
correction was .94 (cf. Block, 1961, p. 37). 
This finding suggests that the group had a 
common behavioral referent for the concept 
of empathy and could use the term in a more 
or less consistent manner. 

Similarly, to check agreement among psy- 


HOGAN 


chologists, nine graduate students in psychol 
ogy were also asked to contribute a 50-item 
Q-sort description of an empathic man. Her 
the intercorrelations had a mean of .51; th 
estimated composite reliability was .90. 

Finally, to determine the amount of agree, 
ment between laymen and psychologists, the 
two Q-sort composites were correlated, yield. 
ing a coefficient of .86; when corrected for 
attenuation in 711 and 722, this becomes .93, 

These findings suggest that people hold a 
common conception of the behavioral conno. 
tations of empathy. We should also inguin 
about the uniqueness of this conception, thay 
is, to what degree will the composite Q sot 
for an “empathic man” correlate with a simi 
larly derived description of “a good man,’ 
or “a mentally healthy man”? 

Using the same 50 items, 10 undergraduatt 
psychology majors were asked to contribute 
Q-sort descriptions of “a good man,” whet 
good was to be taken in a general moral ol 
ethical sense. The 45 correlations betweell 
their descriptions had a mean of .62, thi 
estimate reliability of the composite de 
scription of “a good man” was .94. The 
correlation between this composite and the 
psychology graduate students’ ‘empathy’ 
composite was .29, .32 when corrected fol 
attenuation. 


members in psychology at Johns Hopkins 
were asked to describe “the mentally healthy 
man,” using the same Q sort as before. The 
average correlation of these descriptions wa 
-65, and the reliability of the composite was 
estimated to be .95. The correlation betwee 
the composite description of the empathic and 
the mentally healthy man was .47, .51 cor 
rected for attenuation. On the basis of role 
theoretical considerations (i.e., an empathic 
disposition leads to effective social functioning 
which, in turn, facilitates mental health), W° 
would expect this latter correlation to P 
moderately high, but certainly less than -? 
The obtained correlation is considered bol 
appropriate and conceptually meaningful. 
(Note: Each of the Q-sort composites “is 
cussed above was developed independently: 
thus there is no overlap in persons among 
the groups.) 


An EMPATHY SCALE 


Method 


From these analyses, it can be inferred that em- 
pathy refers to a relatively discrete social phe- 
nomenon recognizable in the experience of laymen 
and psychologists alike. These findings serve as 
justification for the development of our empathy 
scale, and also provide a basis for its construction. 
The scale was assembled in the following manner. 
First, a criterion for assigning ratings of empathy 
was developed. Second, evidence was gathered for 
evaluating the ratings. Third, the ratings were used 
to define highly empathic and nonempathic groups. 
Fourth, an item analysis was conducted to select 
items for a scale which would discriminate between 
the nominated groups; the resulting scale might 
then serve as a measure of a general empathic 
disposition. 

The empathy criterion was constructed as follows. 
Four faculty and research psychologists and three 
advanced graduate students in psychology at the 
University of California, Berkeley, were asked to 
describe their conceptions of a highly empathic man. 
They used the definition of empathy given above, 
and the full California Q sort (100 items). The seven 
Q-sort descriptions were intercorrelated, the co- 
efficients ranged from .59 to .78, with a mean of .71; 
the estimated reliability of this composite was 94. 
This composite description of a highly empathic 
man served as our empathy criterion. To illustrate 
the criterion, the following are the five items selected 
pot characteristic of an empathic man (Block, 

2): 


1. Is socially perceptive of a wide range of inter- 
personal cues. 

2. Seems to be aware of the impression he makes 
on others. 

3. Is skilled in social techniques of imaginative 
play, pretending, and humor. 

4. Has insight into own motives and behavior. 

5. Evaluates the motivation of others in interpret- 
ing situations. 


The content of these items is clearly relevant to 
empathy: all reflect insight, perceptiveness, and social 
acuity, The five items chosen as least characteristic 
of a highly empathic man were: 


1, Does not vary roles; relates to everyone in the 
same way. 

2. Judges self and others in conventional terms like 
“popularity,” “the correct thing to do,” social 
Pressures, etc. 

3.1s uncomfortable with uncertainty and com- 
plexities. 

4. Extrapunitive; tends to transfer Or project 
blame. 

5. Handles anxiety and conflicts by, in effect, re- 
fusing to recognize their presence; repressive or 
dissociative tendencies. 


ce items embody a complex dimension of con- 
eee antiintraception, and reliance on the 
lefensive techniques of projection and repression. 
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Since 1950 many different groups of people have 
been assessed at the University of California’s Insti- 
tute of Personality Assessment and Research (IPAR). 
Two groups of IPAR Ss were used in the present 
research. The first consisted of 100 military officers; 
the second sample contained 45 research scientists 
and 66 student engineers. Each of these Ss was 
studied by 8-10 skilled Os who recorded their im- 
pressions, formed during weekend living-in assess- 
ments, on the Q sort; composite Q-sort descriptions 
of these persons are on file at the Institute. 

The composite Q-sort description of each indi- 
vidual in the two samples was correlated with the 
empathy criterion described above. The resulting 
correlation coefficient was considered the empathy 
rating for that person. The ratings ranged between 
—.58 and +.68 in the two samples. 

Attempts to validate the criteria used in scale 
development lead logically to an infinite regress; 
thus, serious validational efforts are most appropri- 
ately directed at the measures themselves. None- 
theless, in view of the distinctly a priori nature 
of our empathy ratings, it seemed important to 
examine some of their behavioral and interpersonal 
implications. 

The first indication of these implications comes 
from the following. Using the present sample of 
military officers, Gough (1955) developed a “Total 
Social Acuity Index” based on weighted scores de- 
rived from the Kerr Empathy Test, the Chapin 
Social Insight Test, and the Cline movie procedure 
(Cline, 1953). This index of social acuity correlated 
26 (p < 01) with the military officers’ empathy 
ratings. 

Next, for the 45 research scientists and 40 of the 
student engineers, scores based on overall charades 
performance (Barron, 1954) correlated .61 and .58 
with empathy ratings. These figures are to some 
degree inflated because many of the Os who con- 
tributed to the charades performance scores also 
provided Q-sort descriptions of the same Ss. 

Finally, empathy ratings were correlated in each 
sample with the ‘standard scales of the California 
Psychological Inventory (CPI; Gough, 1964), the 
MMPI (Hathaway & McKinley, 1943), and the 
Chapin Social Insight Test. Empathy ratings were 
only modestly related to these standard measures; 
the largest correlations were with the Capacity for 
Status (.24), Achievement via Independence (.22), 
Intellectual Efficiency (.22), Self-Acceptance (.21), 
and Achievement via Conformance (.20) scales of 
the CPI.3 In general, the correlations were positive 
for the CPI and the Chapin test which stress ef- 
fective social functioning, and negative for the 
MMPI which has the opposite orientation. These 
correlations suggest that empathy ratings define a 
dimension which includes social competence, intel- 
Jectual promise, and feelings of self-worth. 

Our examination of empathy ratings permits two 
conclusions. First, use of the ratings as criterion mea- 


3 In these samples a correlation of .20 is significant 
at the .05 level. 
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sures of empathy seems justified to some degree. 
Second, the ratings reflect an important aspect of 
social performance which is not measured well by a 
variety of widely used personality scales. 

The empathy scale was developed by the standard 
technique of an item analysis of the responses of 
high-rated versus low-rated groups. From empathy 
ratings, Ss in the military officer sample, and in the 
sample of research scientists and student engineers, 
were placed into high (27%), middle (46%), and low 
(27%) subgroups. Studying each sample separately, 
the responses of the high and low subgroups were 
compared for 957 true-false items in the CPI, the 
MMPI, and an IPAR pool of items, using the chi- 
square or Fisher’s exact statistic to evaluate differ- 
ences. From these two analyses, 64 items (32 scored 
true, 32 false) were selected for the final scale, 

Although the foundations of the empathy scale 
rest on a comparison of the responses of high- and 
low-rated groups, final selection of the items was 
not a “blind-empirical” procedure. Items for the 
final scale were retained on the basis of four con- 
siderations, The first was that differences in endorse- 
ment frequencies between the high- and low-rated 
groups be in the same direction in both samples 
(61 items). Second, McClelland (1951) performed 
an analysis of the MMPI item pool against a 
criterion of rated role-playing ability, and Gough 
(1955) item-analyzed the CPI-MMPI pool against 
his Social Acuity Index mentioned earlier. Fifteen 
items which appeared in these two earlier analyses 
also attained significance in the present research and 
were retained. Third, 17 of the items finally selected 
failed to attain statistical significance, but were 
retained on the basis of relevant content. Finally, 
items were chosen with an eye toward balancing the 
scale’s true and false keying. 

The following 64 items constitute the empathy 
scale. Thirty-one are from the CPI, 25 are from 
the MMPI, and the remaining eight items come from 


various experimental testing forms used in studies at 
IPAR in Berkeley: 


A. Scoring of items in the CPI: 4(T), 8(T), 
25(F), 52(T), 67(F), 79(F), 81(F), 84(T), 86(T), 
97(T), 98(F), 100(T), 127(T), 186(F), 191(T), 
194(F), 198(T), 239(T), 247(F), 255(F), 271(F), 
275(T), 287(T), 359(T), 361(F), 363(F), 364(F), 
403(T), 421(F), 442(F), 463(F). 

B. Scoring of items in the MMPI; 15(F), 26(F), 
57(T), 73(F), 78(T), 79(F), 100(T), 129(F), 
170(F), 204(T), 231(T), 244(F), 248(T), 254(T), 
327(F), 336(F), 355(T), 372(T), 399(T), 404(F), 
407(T), 410(T), 417(F), 463(F), 478(F). 

C. Items from IPAR testing devices:4 


1. As a rule I have little difficulty in “putting 
myself into other People’s shoes.” (T) 

2. I have seen some things so sad that I almost 
felt like crying. (T) 

3. Disobedience to the 
justified. (F) 


*The author wishes to thank D. W. MacKinnon 
for permission to reproduce these items. 


government is never 
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4. It is the duty of a citizen to support his 
country, right or wrong. (F) 

5. I am usually rather short-tempered with 
people who come around and bother me with 
foolish questions. (F) 

6. I have a pretty clear idea of what I would 
try to impart to my students if I were a 
teacher. (T) 

7. I enjoy the company of strong-willed peo- 
ple. (T) 

8. I frequently undertake more than I can 
accomplish. (T) 


Validity and Reliability of the Scale 


We turn next to an examination of the, 
validity and reliability of the empathy scale, 
Extensive analyses have been conducted with 
it, only a portion of which will be reported 
here. The first question to ask is, “How well 
does the scale meet its design requirements?” 
The empathy scale was built to predict 0: 
sort-derived empathy ratings. In the samples 
used in its development (N = 211), the aver- 
age correlation between the scale and empathy 
ratings was .62; in an independent sample of 
medical school applicants (NW = 70), this 
figure was .39, | 

A further check of the scale’s validity is 
provided by the following: Five groups of Ss; 
studied at IPAR were rated by the assessment 
staff for “social acuity,” defined as: 


The ability to respond intuitively and empathically 
to others and to group situations. A person high on 
this variable would be described as insightful, per- 
ceptive, and discerning, as having the knack of 
“sizing up” social situations and of making sound 
and dependable evaluations of people. He will be 
receptive to nuances and subtleties of behavior 
which might escape the notice of others, and will 
possess a flair for interpreting and integrating his ; 
observations. 


The initial reliability of social acuity ratings 
for these samples ranged between .52 and .77, 
average .69; applying the Spearman-Brow? 
formula to estimate the reliability of the 
composite ratings, the coefficients varied be- 
tween .68 and .86, average .80. The mean 
correlation between empathy scale scores and | 
tated social acuity in the samples used t0 
develop the scale was .58, This figure is to 
some degree inflated because most of the Os 
who made social acuity ratings provided Q- 
sort descriptions of the same Ss. These Ss, M 
turn, were used to develop the scale. In an 
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independent sample of 70 medical school 
applicants, the empathy scale correlated .42 
with rated social acuity. 

Some evidence also exists for the validity 
of the empathy scale when used with a 
younger population. The scale was adminis- 
tered to 121 junior high school students (51 
boys and 70 girls) in the 13-15 year age 
range.® Two teachers were given the definition 
of social acuity previously cited, then asked 
to rate the five most and five least socially 
acute boys in their respective classes. The 
procedure was repeated for the girls. Com- 
bining boys and girls separately, the empathy 
scale mean and standard deviation for the 10 
“socially acute” boys defined in this manner 
were 33.0 and 4.1. Comparable values for the 
“nonsocially acute” boys were 27.2 and 4.3 
(t= 2.93, df = 18, p < .01). For the 10 most 
and 10 least “socially acute” girls, the means 
and standard deviations were 36.2 and 5.3, 
30.6 and 5.5, respectively (¢ = 2.20, p= 05). 

With a sample of 50 college undergraduates, 
the reliability of the empathy scale estimated 
by a test-retest correlation after a 2-month 
interval, was .84. Applying the KR-21 formula 
to the scores of 100 military officers yields a 
coefficient of .71. 


Psychological and Diagnostic Meaning 


The empathy scale appears to have ade- 
quate reliability and concurrent validity to 
warrant further investigation of its psycho- 
logical and diagnostic meaning. 

If, as Mead (1934) suggested, empathic 
ability underlies social intelligence, then the 
empathy scale should relate to both measures 
of social effectiveness and functional intel- 
ligence. Table 1 provides evidence on this 
point. The empathy scale achieves its highest 
correlations with those scales from the CPI 
(1-5) specifically designed to measure social 
and interpersonal adequacy. The scale is also 
moderately related to Intellectual Efficiency, 
an index of the degree to which a person 
effectively mobilizes his intellectual resources. 

The relationship between the empathy scale 
and intelligence scores is somewhat ambigu- 


id Gratitude is expressed to Homer Jones, Principal 
of Vallejo Junior High School, for his cooperation 
in the testing of these students. 
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TABLE 1 
CORRELATIONS BETWEEN EMPATHY SCALE 
SCORES AND SELECTED VARIABLES 
Group 
Tests and scales 
A B 
CPI 
1. Dominance 48 56 
2. Capacity for Status 62 51 
3. Sociability 58 56 
4, Social Presence 56 60. 
5. Self-Acceptance 34 1 
6. Well-Being 19 AL 
7. Responsibility Al 27 
8. Socialization 05 | —.01 
9. Self-Control —.04 | —.11 
10. Tolerance 42 33 
11. Good Impression 34 07 
12. Communality ,00 05 
13. Achievement-Conformance 30 123 
14, Achievement-Independence 27 AT 
15. Intellectual Efficiency 52 32 
16. Psychological-Mindedness 16 46 
17. Flexibility .23 .29 
18, Femininity —.04 | —.18 
Scholastic Aptitude 
1. MCAT Verbal —.16 
2. MCAT Quantitative - 11 
3. SAT Verbal 32 
4. SAT Quantitative 14 
MMPI 
1. Lie Scale 29 | —.16 
2: F (infrequent items) —.15 | —.25 
3. K (ego functioning) s35 31 
4. Hs (hyponchondriasis) 00 | —.04 
5. D (depression) —.38 | —44 
6. Hy (hysteria) A7 AS 
7. Pd (psychopathic deviate) ll 07 
8. Mf (feminine interests) —.01 16 
9. Pa (paranoia) AS | —.01 
10. Pt (psychasthenia) —.21 | —.05 
11. Sc (schizophrenia) —.11 | —.07 
12. Ma (mania) 44 14 
13. Si (social introversion) —.65 | —.48 
14, A (anxiety) —.40 | —.41 
15. R (repression) —.09 | —.18 
16. Es (ego strength) 34 38 
17. Sd (social desirability) .50 37 
18. TMA (manifest anxiety) —.49 | —.20 


.—Group A: 70 medical school applicants 7 = +23, 
? Roos; r= 30, p < .01. Group B: St female college seniors 
7 2.27, p < 05; 7 = .35, 9 < .01. MCAT: Medical College 


‘aptitude Test. 

ous. The correlations presented between the 
scale, the MCAT, and the SAT in Table 1 
show this ambiguity—coefficients ranging 
from low negative to moderately positive 
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depending on the population and the measure 
of intelligence used. 

The empathy scale has been compared with 
a number of other personality assessment de- 
vices to gain broader understanding of its 
predictive implications. Correlations with 
MMPI scales are also given in Table 1. From 
these it can be inferred that high scorers on 
empathy will be sociable, optimistic, and free 
from unnecessary doubts and worries. 

Table 2 offers comparisons between the 
empathy scale and indexes from the Myers- 
Briggs Type Indicator (Myers, 1962) and the 
Study of Values (Allport, Vernon, & Lindzey, 
1960). 

Certain single personality scales have been 
so widely used in research that any report 
of a new measure must indicate its relation- 
ship to them, Table 1 provides correlations 
between the empathy scale, the Ego Strength 
scale (Barron, 1955), the Manifest Anxiety 
scale (Taylor, 1953), and Edwards’ scale for 
Social Desirability (Edwards, 1957). Table 2 
gives evidence concerning the relationship 
between the empathy scale and Eysenck’s 


TABLE 2 


CORRELATIONS BETWEEN EMPATHY SCALE SCORES 
AND THE VARIABLES INDICATED 


Tests and scales r 
Study of Values* 
1, Theoretical =15 
2. Economic 17 
3. Aesthetic .08 
4, Social 19 
5. Political .00 
6. Religious =.14 
Maudsley Personality Inventory? 
1. Neuroticism —.28 
2. Extraversion 1 
Myers-Briggs Type Indicator? 
1. Extraversion 63 
2. Introversion —.61 
3. Sensation —.16 
4. Intuition 16 
5. Thinking —.18 
6. Feeling 10 
7. Judging Function =.17 
8. Perceptive Function 10 
Wr Smee ay 
NM = 42 males.” = 30,4, = O epee: 
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(1959) factor-pure measures of neuroticism 
and extraversion. 

Next, we must consider the empathy scale’s 
relation to both authoritarian tendencies and 
acquiescence, for which the California F scale 
(Adorno, Frenkel-Brunswik, Levinson, & San. 
ford, 1950; Bass, 1955) is an accepted index, 
In a sample of 48 male undergraduates the 
correlation between the Empathy scale and 
the F scale was —.52; the same index in a 
sample of 29 female undergraduates was 
—.30. Finally, in the same sample of under- 
graduate men, the empathy scale correlated 
—.31 with the Dogmatism scale (Rokeach, 
1960). 

An examination of the correlations between 
the empathy scale and existing personality 
measures shows several moderately large co- 
efficients. Given the size of certain of these 
correlations, it might be argued that the em- 
pathy scale is only a redundant measure of, 
for example, extraversion. Such a_ proposal 
would overlook two important points. First, 
the empathy scale predicts such diverse phe- 
nomena as low anxiety, self-acceptance, and 
lack of authoritarian tendencies as well as it 
does extraversion. Thus one may also argue 
that the empathy scale reflects attributes 
common to a wide range of interpersonal be- 
havior rather than redundantly measuring 
one specific aspect of social functioning. A 
more fundamental indication of the scale’s 
utility, however, is that it predicts rated 
empathy better than all existing measures 
with which comparisons have been made. 

The empathy scale has been scored for 11 
groups of men and three groups of women 
(total W = 1,086). Means and standard de- 
Viations for these groups are presented in 
Table 3. The highest mean score was ob- 
tained by a small group of upper-division men 
enrolled in a personality assessment course 
at the University of California. The class 
contained for the most part students hoping 
to do graduate work in clinical psychology. 
The “Education abroad students” were 
highly selected students chosen to participate 
in the University’s overseas program. Means 
for the two delinquent groups have been 
estimated by scoring the 39 empathy scale 
items found in the CPI. In two comparisons 
(N = 170), this CPI empathy subscale cor- 
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TABLE 3 


NORMATIVE DATA FOR THE EMPATHY SCALE 
ON THE SAMPLES ĪNDICATED 


Group N | M |SD 
Males 
1. Psychology majors 24 | 44.7 |5.2 
2. Education abroad students 18 | 43.2 | 4.8 
3. Medical students 70 | 42.4 | 5.3 
4. Research scientists 45 | 40.3 | 5.4 
5, College students (all levels) 90 | 39.1 | 4.7 
6. Honor students in engineering | 66 | 38.0 |6.4 
7. Military officers 100 | 37.7 | 7.1 
8, Architects 124 | 36.8 | 5.3 
9. Junior high school students 
(Grades 7 and 8) 51 | 31.0 | 5.0 
10. Prison inmates 92 | 30.4 | — 
11. Young delinquents 100 | 29.1 | — 
Females 
1. College seniors 143 | 41.5 | 5.1 
2. College students (all levels) 93 | 40.7 | 5.9 
3. Junior high school students 
(Grades 7 and 8) 70 | 33.7 | 5.2 


related 92 with the full-scale score. Addi- 
tionally, in these groups the mean of the 
item p values for the 39 items is 44, the 
mean p value for the remaining 25 items is 
50, Thus the prorated means underestimate, 
but only very slightly, the appropriate full- 
scale values. It should also be noted that 
females score one to two points higher than 
males on the average; that is, women seem 
to be slightly more empathic than men—a 
point which accords well with conventional 
wisdom on this topic. 


Interpersonal Implications and Psychological 
Basis of Measurement 


Gough (1965a) has presented a strategy 
for discovering and analyzing the psychologi- 
cal meaning of test scores. The method pro- 
ceeds through three stages. In the first we 
ask how well a scale relates to its appropriate 
Criteria. The second stage rests on analyses 
intended to reveal the scale’s underlying basis 
of measurement. The third requires a search 
for unusual or unexpected relationships. From 
Gough’s scheme, data presented thus far are 
necessary for the primary evaluation of a 
scale. The evidence indicates that the empathy 
scale predicts to some degree what it seeks 
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to predict. It is appropriate therefore to move 
to the second level of evaluation, that is, 
exploration of the scale’s psychological basis 
of measurement and personological meaning 
(insufficient evidence is available to permit 
discoveries at the third level). 

A fundamental question at the second level 
of analysis is, “Does the scale meet the 
requirements set by its conceptual base?” 
The empathy scale was developed within the 
framework of a multidimensional theory of 
moral development (Hogan, 1967). In con- 
trast with psychoanalysis, for example, the 
theory predicts that a person who fails to 
internalize parental prohibitions and the rules 
of society will not necessarily become delin- 
quent. Rather, if he learns to recognize the 
existence of the rights of others, if he can 
adopt the moral point of view, then he may 
behave appropriately despite experiencing so- 
ciety’s rules as alien and external. Internaliz- 
ing social prohibitions and learning to take 
the moral point of view are seen as two 
independent stages in moral development. 

From this discussion we predict that there 
will be differences in the behavior of poorly 
socialized groups differing in the degree to 
which they are disposed to take the moral 
viewpoint. To test this expectation, the Social- 
ization(So) scale of the CPI was selected as 
an appropriate index of strength of internal- 
ization of social rules—evidence supporting 
this choice is substantial (cf. Gough, 1965a). 
The empathy scale was assumed to reflect the 
ability to adopt the moral point of view. In 
a sample of 429 nondelinquent Ss, 124 
(40.3% of the sample) had low So scores 
(raw score < 33). From this latter group, 
50 Ss were chosen, of whom 25 had low 
scores on the empathy scale (raw score < 30), 
and 25 had high scores (raw score > 44). 
Persons with low scores for both socialization 
and empathy were placed in one group, low- 
socialization-high-empathy Ss were placed in 
a second group. Using the CPI, a scale by 
scale comparison of the two groups was 
conducted, the results of which appear in 
Table 4. The two groups differed significantly 
on 12 of the 17 scales which could be mean- 
ingfully compared. When the mean scores for 
the two groups are plotted on a CPI profile 
sheet, the resulting profiles are quite distinc- 
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TABLE 4 
Comparison OF Hicu-Empatoy—Low-SoctaizaTion AND LOW-EMPATHY- 
Low-SOCIALIZATION GROUPS on CPI SCALES 
Group A Group B 
Scale t $ 
M SD M SD 

1. Dominance 32.2 4.5 26.9 6.1 3.5 01 
2. Capacity for Status 23.6 2.9 19.2 4.0 4.5 01 
3. Sociability 26.6 4.1 24.0 6.2 1.7 10 
4. Social Presence 42.6 48 35.4 7.2 4.7 01 
5. Self-Acceptance 25.0 2.6 22.4 6.6 1.8 10 
6. Well-Being 37.4 3.8 34.0 4.6 2.9 01 
7. Responsibility 29.4 4.5 27.3 4.6 1.6 ns 
8. Socialization® 28.8 2.4 29.4 4.1 —0.7 ns 
9. Self-Control 25.4 6.0 23.1 7.2 1.2 ns 
10. Tolerance 25.0 4.8 20.6 4.8 3.2 01 
11. Good Impression 16.2 3.8 13.4 6.2 1.9 10 
12, Communality 25.0 2.0 25.5 3.2 —0.7 ns 
13. Achievement-Conformance 28.2 4.0 23.6 3.9 4.1 OL 
14. Achievement-Independence 24.1 3.2 21.8 6.3 1.6 ns 
15. Intellectual Efficiency 43.0 4.1 35.5 7.9 4.3 01 
16, Psychological-Mindedness 15.2 3.6 12.1 3.5 J1 01 
17. Flexibility 14.6 3.7 12.4 4.8 1.8 10 
18, Femininity 16.5 3.0 14.8 5.1 1.5 ns 
Empathy Scale 46.4 2.7 29.4 3.8 18.1 OL 


Note.—Group A: high empathy, low So; N 
^ Scale on which groups were selected for low scores. 


tive. The low-So-high-empathy profile is 
favorable. The group characterized by low 
scores on both So and empathy appears 
marginally adjusted and prone to antisocial 
behavior. These observations, based on infer- 
ences from the profiles, give tentative support 
to the conceptual meaning of the empathy 
scale, 

Extending the implications of the analysis 
given above, if the empathy scale reflects the 
disposition to adopt the moral point of view, 
then legal offenders should receive lower 
scores than nondelinquents. To test this pre- 
diction, the empathy scale was applied to a 
group of 92 inmates at a state prison. In 
this case only the 39 empathy scale items 
appearing on the CPI were scored. The pris- 
oners’ scores on the Army General Classifica- 
tion Test ranged between the 15th and the 
99th percentile. Their mean was at the 52nd 
percentile, indicating that the group was of 
approximately average intelligence. For the 
prisoners, the mean and standard deviation 
on the 39 item empathy subscale were 18.5 
and 5.1. These values can be compared with 


= 25. Group B: low empathy, low So; N = 25, 


those from a group of 100 military officers, 
which were 22.7 and 4.8 on the same scale. 
The difference in the means of these groups 
is highly significant (p< .001). Using 
Tilton’s (1937) method as an estimate, there 
is 39.5% overlap in the scores of the two 
groups. Although the officers are undoubtedly 
more intelligent than the prisoners, it is 
doubtful that the obtained differences in em- 
pathy scores can be explained entirely in 
terms of differences in intellectual endowment. 

A sample of 103 fraternity men was chosen 
to perform a final empirical check on the 
interpersonal meaning of the scale. The 
empathy scale mean and standard deviation 
for this group was 38.6 and 6.2. Each mem- 
ber had also been described by five of his 
peers on the 300-word Gough Adjective 
Check List (Gough, 1960; Gough & Heilbrun, 
1965). A composite description of each 
Person can be obtained from these five indi- 
vidual descriptions by counting the number 
of times each word is checked, and using the 
totals as scores. Thus, if Jones is checked by 
all five of his peers as “friendly,” his score 
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on this adjective will be 5. Jones will have 
then 300 scores, one for each adjective. 

From 33 adjectives with correlations sig- 
nificant at or above the .05 level, those 10 
with the most postive, and the 10 adjectives 
with the most negative correlations are listed 
below. 

The 10 words most descriptive of high 
scorers and the correlations are: 


pleasant (.32 sociable (.26) 
charming (.28) sentimental (.26) 
friendly (.27) imaginative (.26) 
dreamy (.27) discreet (.25) 
cheerful (.26) tactful (.24) 


The high scorer on the empathy scale seems 
likable and friendly, possessing considerable 
charm, poise, and tact. He is outgoing, warm, 
and very much at ease in the interpersonal 
situation—the term urbane might even be 
appropriate. 

The 10 words most descriptive of low 
scorers are: 


cruel (—.37) unemotional — (—.23) 
cold (—.29) unkind (—.23) 
quarrelsome (—.27) hard-hearted (— .23) 
hostile (—.24) argumentative (—.21) 
bitter (—.24) opinionated  (—.21) 


Thus the low scorer on the empathy scale 
appears somewhat aloof, disaffected, and 
disposed to alienate those around him. 


Discussion 


A proper evaluation of the empathy scale 
must answer two questions. First, “To what 
degree do scale scores reflect an empathic dis- 
Position?” This question is obviously directed 
toward the primary validity of the scale itself. 
The second question, “Do highly empathic 
persons, as identified by the scale, adopt the 
moral point of view more often than non- 
empathic persons?” has implications for both 
the scale and the model of moral development 
from which it was derived. 

In reply to the first question, the empathy 
scale appears to work well. From the peer 
ratings, test correlates, and other valida- 
tional evidence presented above, high scorers 
seem to be socially acute and sensitive to 
nuances in interpersonal behavior. Low scor- 
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ers, on the other hand, seem hostile, cold, and 
insensitive to the feelings of others. 

The second question, “Do highly empathic 
persons adopt the moral point of view more 
often than nonempathic persons?” is harder 
to answer. This is due in part to the highly 
inferential nature of the judgment involved; 
that is, behavior which entails adopting the 
moral point of view is often hard to recog- 
nize. Additionally, it is always difficult to 
answer adequately questions about the con- 
struct validity of a personality scale. It would 
appear that delinquents score lower on the 
scale than nondelinquents., Furthermore, low- 
empathy scores seem to have negative impli- 
cations for the social conduct of persons in 
an otherwise normal population. We conclude 
that empathic ability, as measured by the 
empathy scale, seems to be at least one re- 
quirement for taking the moral point of view. 
Obviously more research is required for a 
complete answer to this question. 

It seems fair to conclude at this point that 
both the practical and theoretical value of 
the empathy scale have received a measure of 
support. However, the evidence presented here 
represents only a first step toward a truly 
definitive evaluation. It is hoped that the 
potential merits of the scale will attract the 
attention of psychological researchers so that 
its utility can be properly assessed. 
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Recent studies of the significance of the psychotherapist’s personality are sum- 
marized and discussed with the aim of developing measures for clinical 
research. Ten variables are described, comprising a relatively comprehensive 
and discrete list of potential importance. They include: objectivity, depend- 


ability, sincerity, sureness, directiveness, 


components, i.e., responsiveness and 
warmth, The measurement of conscii 
54 psychiatrists, 77 psychologists, 
method uses a questionnaire requiring 
and provides rankings among 


empathy (both affective and cognitive 
understanding), respect, interest, and 
ous attitudes toward these variables in 


and 78 social workers is described. The 


forced choices between paired adjectives 


the variables. The results indicate both simi- 


larities and differences among therapist groups in relative preferences for these 


characteristics. 


The process of psychotherapy, as revealed 
in the actual interaction between therapist 
and patient, reflects certain background char- 
acteristics of the therapist: his formal pro- 
fessional training, his years of experience, and 
his personal style. Increasing attention is cur- 
rently being paid to the role of the last factor 
in determining whether or not treatment will 
be successful. In the past, most studies in 
this area have focused on a limited view of 
the therapist’s personality, or else have de- 
scribed such characteristics in global terms, 
thereby failing to identify the various discrete 
elements inherent in the general categories. 

This article is a report of an attempt to 
conceptualize these specific elements so that 
a full range of personal characteristics of the 
therapist relevant to the treatment process 
may be identified. To illustrate this approach, 


1The results of this paper were discussed in part 
at the meeting of the American Psychological Associ- 
ation, San Francisco, September 1968. The authors 
are grateful for the cooperation of the individuals in 
the various professional groups who returned the 
completed forms, for the assistance of Chester (o 
Bennett, L. D. Carter, and James Mann in contact- 
ing the student (trainee) groups, and for the sta- 
tistical and clerical help of Sylvia Fleisch, Phyllis 
Garbose, Carol Lindop, and Patricia Batson. 

* Requests for reprints should be sent to Aron Z. 
Spilken, Boston University School of Medicine, Di- 
vision of Psychiatry, 80 East Concord St., Boston, 
Massachusetts 02118. 


some findings will be presented which examine 
the self-reported relative preferences assigned 
to these characteristics by different profes- 
sional groups with varying levels of thera- 
peutic experience. 

Prominent among previous studies in this 
area has been the work of Whitehorn and 
Betz (1960), who used response patterns to 
the Strong Vocational Interest Blank as a 
means of differentiating successful from un- 
successful therapists of schizophrenic patients. 
Therapists designated Type A showed a re- 
semblance in their responding to such voca- 
tional groups as lawyers and CPAs with their 
interest in verbal matters. Type B therapists 
bore a similarity to printers and math-science 
teachers with their interest in mechanical 
matters, These differences have been assumed. 
to reflect variations in degree of understand- 
ing, permissiveness with patients, and ability 
to establish a trusting relationship. 

Strupp, Wallach, and Wogan (1964), in a 
retrospective study of the psychotherapeutic 
process as experienced by the patient, ob- 
served that patients who reported having the 
therapist’s respect and interest tended most 
often both to judge themselves and to be 
judged by their therapists as having been in 
successful treatment. Also contributing to a 
better outcome was the patient’s experience . 
of the therapist as warm, natural, unstudied, 
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and attentive, and not as cold, distant, formal, 
or extremely passive. Patients who were un- 
certain about how the therapist felt toward 
them tended to have a less successful treat- 
ment experience. 

Evaluating therapy behavior by training 
orientation, Sundland and Barker (1962) sur- 
veyed 139 psychotherapists. They found 
that a general factor (analytic-experiential) 
significantly distinguished among therapists 
identifying themselves with different schools 
(Freudian, Sullivanian, and Rogerian). The 
13 scales used to tap this dimension included 
Frequency of Therapist’s Activity, Type of 
Therapist’s Activity, Emotional Tenor of the 
Relationship, Structure of the Relationship, 
and Therapist’s Security. 

McNair and Lorr (1964) analyzed the self- 
reports of 265 professional workers in a Vet- 
erans Administration setting. Psychiatrists, 
psychologists, and social workers indicated 
their preferred treatment techniques, along 
three dimensions derived from Sundland and 
Barker’s (1962) work, which were felt by the 
authors to be critical in describing the thera- 
peutic process. The three factors were ana- 
lytic, impersonal, and directive. Analytic re- 
ferred to the use of traditional psychoanalytic 
techniques, such as analysis of resistance and 
transference, and exploration of early child- 
hood experiences. Impersonal referred to the 
therapist’s preference for remaining reserved 
and uninvolved, as opposed to personalizing 
the relationship. Directive referred to a prefer- 
ence for planning a therapeutic strategy and 
guiding the therapeutic interaction in contrast 
to allowing the patient more freedom in 
shaping the course of treatment. They found 
that preferences for different technique pat- 
terns tended to distinguish among the profes- 
sions, but were not a function of experience, 

Tn contrast to the above studies based on 
patients’ and therapists’ reports, Truax and 
Carkhuff (1965) examined the therapeutic 
interaction directly through the analysis of 
tape recordings, focusing on therapist vari- 
ables of self-congruence or genuineness, accu- 
rate empathy, and unconditional positive re- 
gard. Increases in the level of therapist 
empathy and positive regard within a single 
session were found to be related to depth of 
the patient’s intrapersonal exploration. It was 
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further demonstrated (Truax, Wargo, Frank, 
Imber, Battle, Hoehn-Saric, Nash, & Stone, 
1966) that therapists providing high levels of 
these conditions showed greater patient im- 
provement than therapists providing lower 
levels. ' 

These previous studies employed quite dif- 
ferent methods and focused on limited groups 
of variables which overlapped conceptually, 
The various factors isolated in therapists’ 
reports, patients’ judgments, and direct ob- 
servations should now be related in a single, 
comprehensive schema, describing the range 
of therapists’ behavior. Further, the discrete 
elements inherent in global concepts should 
be identified in order to assess both their 
individual effects and their impact in various 
combinations. 

Consistent with this aim, 10 discrete vari- 
ables, derived both from prior studies and 
from clinical judgments were identified as 
relevant to the therapeutic process. Each 
variable was conceptualized as representing a 
continuum with the midpoint reflecting a 
moderate amount of a therapist characteristic 
and with high and low deviations at the 
extremes. The 10 variables are objectivity, 
dependability, sincerity, sureness, directive- 
hess, responsiveness, understanding, respect, 
interest, and warmth. 

1. Objectivity. This factor is defined as a 
moderate degree of involvement consonant 
with balanced perspective and freedom from 
personal bias. Very low objectivity reflects 
overidentification with the patient, extreme 
closeness, and loss of perspective. Very high 
objectivity reflects aloofness and detachment. 
McNair and Lorr (1964) dealt with similar 
endpoints in their interpersonal dimension, 
without providing for the midpoint. Simi- 
larly, Strupp et al. (1964) dealt with distance 
and formality as distinct qualities, rather than 
as points on a continuum, 

2. Dependability. It may be expected that 
the degree to which the therapist is consistent 
and reliable largely determines the establish- 
ment of a trusting therapeutic relationship. 
At the midpoint of this scale the therapist 
indicates he is responsible and can be counted 
on to do what he says. The high end of the 
scale describes conscientiousness to the point 
of self-sacrifice and martyrdom on the part 
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of the therapist. At the low end, the therapist 
is irresponsible and unreliable. 

3, Sincerity. Sincerity is derived from 
Truax’s concept of genuineness, and Rogers’ 
concept of therapist self-congruence. This re- 
fers to the presentation of a “real” person 
in the therapeutic interaction, and the degree 
of congruence between what the therapist is 
feeling at the moment and what he says. 
Whitehorn and Betz (1960) found that phy- 
sicians effective with schizophrenic patients 
expressed personal attitudes more freely than 
less effective therapists. Truax rates this 
quality from low to high, making the assump- 
tion that the higher score is preferable. On 
our scale, we have allowed for the possibility 
of what might be considered by some as an 
inappropriately high degree of genuineness. 

This factor in its moderate form is defined 
as truthfulness consistent with protection of 
patient’s feelings. The endpoints of the scale 
reflect extreme openness, personal confession, 
and self-revelation at the high end, and 
superficiality, deception, or therapist role- 
playing at the low end. 

4. Sureness. It may be expected that when 
a therapist is unsure, anxious, or lacking in 
confidence, therapeutic progress is disturbed. 
Knupfer, Jackson, and Krieger (1959) found 
an association between therapists’ confidence 
in themselves and their level of competence. 
Bandura (1956) also sees freedom from anx- 
iety as basic to competence and feels that 
the therapist’s ability to respond nonanxiously 
is one of the crucial factors in his ability to 
relieve patient’s anxieties. Excessive sureness, 
at the high end of this scale, is exemplified 
by the therapist manifesting grandiosity and 
appearing pompous and dogmatic. At the low 
end, the therapist seems anxious, inadequate, 
helpless, and indecisive. Moderation reflects 
the therapist appearing to believe in and 
know what he is doing; he appears at ease, 
comfortable, and confident. 

5. Directiveness. Moderation at the mid- 
point, reflects the therapist who appropriately 
Provides structure, guidance, and limit setting. 
At the high end of the scale the therapist is 
conceptualized as authoritarian, controlling, 
and manipulative; the patient is told what 
to do and what not to do both in and out 
Of treatment. At the low end of the scale the 
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therapist adopts a passive, permissive stance, 
allowing the patient under all circumstances 
to proceed at his own rate and behave as 
he pleases. In the McNair and- Lorr 
(1964) study, the authors found that direc- 
tiveness differentiated among the therapeutic 
professions. 

Empathy is the variable perhaps most uni- 
versally accepted as vital to successful psy- 
chotherapy. It has been typically treated as 
a global quality describing a meaningful flow 
of understanding and communication be- 
tween therapist and patient. Truax (1963), 
for instance, following the theoretical orienta- 
tion of Rogers, designed a compound Accurate 
Empathy Scale to measure both the thera- 
pist’s sensitivity to the patient’s current 
feelings and his ability to communicate his 
understanding of these feelings in a way ac- 
ceptable to the patient. Fox and Golden 
(1964) in summarizing the analytic literature 
point out that empathy has been conceptual- 
ized as both an emotional and intellectual 
process. To deal more precisely with empathy 
and to make it applicable to a wider range 
of theoretical approaches, it seemed desirable 
to separate this concept into its components 
of responsiveness and understanding, and to 
enlarge the concept of understanding to in- 
clude the therapist’s conceptualization of the 
role of feelings and defenses in the patient. 

6. Empathy (Responsiveness). At the high 
end of this scale the therapist is seen as 
sympathetic; he feels for and with the patient 
and communicates this to him. At the low end, 
the therapist may have intellectual knowl- 
edge, but no real emotional appreciation of 
the patient’s feelings. At the midpoint the 
therapist has an emotional awareness of how 
the patient feels and communicates his alli- 
ance with the patient, but is not totally 
identified with him. 

7. Empathy (Understanding). At the high 
end of this scale, the therapist imposes his 
understanding before the patient is ready to 
accept it; that is, his interpretation is too 
early or too deep. At the low end, the thera- 
pist is insensitive to patient cues and lacks 
understanding, or is only partially aware 
of what the patient is trying to say. At the 
midpoint, the therapist comprehends implicit 
or latent meanings, and is able to place them 
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in context. His communications are both ac- 
curate and appropriately timed. This factor 
seems most related to the concept of thera- 
peutic skill (Bergin, 1966). 

8. Respect. At the midpoint of a discrete 
respect continuum, the therapist shows appre- 
ciation and acceptance of the patient’s indi- 
vidual worth and uniqueness. At the high 
end, the therapist evidences uncritical or 
naive acceptance of the patient’s reports, 
without adequately assessing the facts. At the 
low end, the therapist does not appear to 
value the patient as an individual and may be 
judgmental, sarcastic, or insulting. Strupp et 
al. (1964) found that if the patient perceived 
the therapist as having respect for him, he 
was likely to report positive outcome. Others, 
such as Rogers (1962), seem to have incorpo- 
rated respect into more generalized concepts. 
Unconditional positive regard, for example, 
seems to include warmth along with a respect- 
ing acceptance of the patient. 

9. Interest. At the high end of this dimen- 
sion, the therapist appears to be prying, 
overly curious, and intrusive. At the low 
end he seems inattentive and disinterested 
in the patient. At the midpoint he is at- 
tentive and alert to the patient’s needs and 
communications. 

10. Warmth. At the midpoint of this con- 
tinuum, the therapist feels comfortable with 
and likes the patient, is reassuring and en- 
couraging. At the high end, the therapist is 
infantilizing, fosters dependency which might 
be difficult to terminate, or promotes intimacy 
that is uncomfortable or embarrassing to 
the patient. At the low end, the therapist 
expresses no liking or affection for the patient 
and seems cold and unresponsive. 

Thus, the authors’ conception of therapists’ 
personal and stylistic characteristics which are 
expected to influence the course of treatment 
includes these 10 variables. As a first step in 
experimentally examining these dimensions, it 
was decided to determine their relative value 
in the therapeutic community, comparing pro- 
fessed attitudes among therapists of different 
schools of training (psychiatry, psychology, 
and social work) and among therapists of 
different levels of experience. These professed 
values can then serve as a basis for compari- 
sons in future studies where these character- 
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istics are observed in actual therapeutic be. 
havior. The general assumptions were that 
these qualities are differentially valued by 
members of the various professions, and that 
experienced therapists feel differently about 
them than do the trainees. 


METHOD 


A scale was designed to determine the relative 
value which a therapist professed to have for each 
characteristic. Because self-ratings were to be ob- 
tained, only the midpoints of each characteristic 
could be expressed since many of the endpoints are 
clearly undesirable and would not be likely to be 
chosen as preferred traits. A list of synonyms repre- 
senting each variable was prepared. Examples of 
these are: 1. objective, impartial; 2. dependable, 
reliable; 3. genuine, sincere; 4. confident, secure; 
5. directing, guiding; 6. and 7. empathic, under- 
standing; 8. accepting, respecting; 9. interested, at- 
tentive; 10. warm, encouraging. It will be noted 
that for purposes of this technique, responsiveness 
and understanding were treated as one variable 
(Concepts 6 and 7). Thus nine concepts were 
evaluated. 

An adjective reflecting each of the nine variables 
was paired with adjectives of every other factor, 
resulting in 72 paired items. Each therapist was 
asked to select one term from each pair representing 
“qualities which are important in a good therapist,” 
as of more value to him than the other. It was indi- 
cated, in the instructions, that this was difficult 
since all items were selected for their basic value 
in therapy. To accomplish this, however, the rater 
was asked to make a forced choice in each pair on 
the basis of his first impressions. The choice was 
structured as a “relative preference” rather than an 
absolute one. To assess the consistency of the ratings 
the therapist was asked to complete a parallel form 
approximately 24 hours later, without referring back 
to the first. The parallel form was essentially the 
same with different combinations of 72 paired syno- 
nyms. In order to obtain as truthful a reaction 4s 
possible, the raters were asked that the forms be 
completed anonymously and that they be returned 
to us in stamped, addressed envelopes which were 
coded to identify subsamples. t 

The sample tested consisted of experienced (clinic 
or hospital staff) and inexperienced (graduate stu- 
dents and residents) members of the therapeutic 
professions: psychiatry, clinical psychology, and 
social work. The vast majority of these were traine 
in the psychoanalytic orientation. Because it was 
possible that the values revealed might merely reflect 
Professional status or education in general, as a 
control for this, dental and law students (i.e, indi- 
viduals with postgraduate education) were also 
sampled. 

Scoring of the Psychotherapist’s Preference Scale 
(PPS) was accomplished by summing the number 
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of adjectives underlined for each category. The sum 
of choices on both Forms A and B was used to 
represent the best estimate of a rater’s preferences. 
The scores for each of the nine concepts were then 
compared statistically across groups, both by profes- 
sion and by experience, in order to determine 
whether the different samples of therapists attached 
greater or lesser value to the various characteristics. 
‘As a follow-up to the two-way analysis of variance, 
Duncan’s multiple-range test (see Edwards, 1966) 
was used to determine precisely which mean scores 
were significantly different among the six therapist 
groups. In order to determine whether all profes- 
sionals value these concepts equally, the 209 psycho- 
therapists were compared directly with the 66 others 
in the sample by means of a one-way F test. Finally, 
in order to obtain an understanding of the relative 
importance attached to each concept by each class 
of raters, the preferences of each group were rank 
‘ordered, and multiple-range tests performed. 


RESULTS 


Table 1 indicates the number of completed 
forms returned by each group and the per- 
centage this represented of total forms dis- 
tributed to members of that group. Two 
hundred and nine psychotherapists were 
sampled, with an average rate of return of 
l %. This suggests that the sample is repre- 
entative of the population of psychothera- 
ists, The rate of return for the 66 law and 
dental students was much lower (42%) indi- 
tating less concern with the study and less 
Tepresentativeness. 

_ Reliability figures also appear in Table 1 
d indicate that for psychotherapists in 


general consistent responses were given to 
both forms. Responses to the two forms were 
reliably in agreement for 86% of these Ss. 
The average rho was .78 for this subsample. 


: dentists and lawyers showed less con- 
tency from Form A to B. Only 55% of 
this group was in reliable agreement. The 
Average rho was .61 for this subsample. These 
differences between level of agreement are 
Statistically significant (chi-square = 27.83, 
Ż< 001). 
s Tn Order to determine whether all groups 
ith postgraduate education equally value 
tain characteristics, psychotherapists and 
psychotherapy professionals were com- 
ited on each individual concept. For all but 
e of the concepts there are highly signifi- 
ant differences between the scores of the two 
ps. Psychotherapists valued empathy, re- 
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TABLE 1 


INFORMATION CONCERNING NUMBER AND RATE OF 
RETURN AND RELIABILITY OF 
COMPLETED FORMS 


, Percent- 
Number age evi-| Average 
return- pacan dencing dee 
Group ing com-| 28° % | signifi- | of cor- 
pleted or d cant |relation 
forms |5@™P!€¢) agree- | (rho) 
ment* 
Social workers 20 95 80 19) 
Social work stu- 
dents 58 67 88 15 
Clinical psycho- 
ogists 50 65 86 19 
Psychology 
trainees 27 79 96 84 
Psychiatrists 26 70 85 i1] 
Psychiatric 
residents 28 74 79 +16 
Dental students 25 53 64 62 
Law students 41 37 49 60 
All therapists 209 71 86 78 
Dental and law 
students 66 42 55 61 


Percentage of Ss in a group achieving rho of at least .60, 
which is significant at the .05 level of confidence for nine paired 
concepts, 


spect, and interest more highly, and objec- 
tivity, dependability, sincerity, sureness, and 
directiveness less highly than nontherapy pro- 
fessionals. The two groups did not differ 
significantly in their preference for warmth, 

The relative preferences for the nine con- 
cepts expressed by each of the various groups 
of psychotherapists appear in Table 2. Each 
concept is listed with its score, ranked ac- 
cording to preference, for each professional 
group and experience level, and for combined 
groups of experienced and inexperienced mem- 
bers of each profession. This last comparison 
involved 96 experienced versus 113 psycho- 
therapists in training. 

The total of the ranks of the concepts for 
the six groupings of therapists shows empathy 
to be clearly most preferred. Dependability 
and interest are tied for second and are 
followed by sincerity, respect, warmth, objec- 
tivity, sureness, and directiveness, in that 
order. A slightly different order of preference 
is produced by summing the scores for each 
concept across all groups, although empathy 
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TABLE 2 


COMPARISON oF CONCEPTS WITHIN SUBJECT GROUPS 


Rank 


Group 


Social workers 
Experienced 
Concept 
Score 


Duncan 
Social workers 
Inexperienced 
Concept 
Score 
Duncan 
Psychologists 
Experienced 
Concept 
Score 


Duncan, 
Psychologists 
Inexperienced 
Concept 
Score 


Duncan 
Psychiatrists 
Experienced 
Concept 
Score 


WARM 
13.19 


Duncan 

Psychiatrists 

Inexperienced 

Concept 
Score 
Duncan 

All social workers 
Concept 
Score 
Duncan 

All psychologists 
Concept 
Score 
Duncan 

All psychiatrists 
Concept 
Score 
Duncan 


EMP 
22.89 


DEP 
20.82 


SINC 
20.29 


EMP 
23.68 


INT 
20.89 


SINC 
19.24 


EMP. 
23.84 


SINC 
21.42 


DEP 
20.97 


EMP 
22.65 


DEP 
21.83 


SINC 
20.70 


INT 
19.82 


RES 
18.96 


INT 
20.75 


INT 
20.13 


DIR 
6.54 


RES 
15.75 


OBJ 
15.54 


SURE 
10.25 


DIR 
7.33 


DEP 
18.56 


WARM 
14.64 


SURE 
9.17 


DIR 
4,33 


RES 
19.18 


WARM 
13.86 


OBJ 
9.33 


DIR 
5.96 


RES 
16.15 


SURE 
10.83 


OBJ 
13.11 


WARM 
12.63 


Note—EMP = Empathy, INT = Interest, RES = Respect, DEP = Dependability, SINC = Sincerity, WARM = warmth 


OBJ = Objectivity, SURE = Sureness, DIR = Directiveness. 


remains the most, and directiveness the least 
preferred of the concepts. This is presented in 
the table under “All psychotherapists.” The 


results of Duncan’s multiple-range test are - 


shown under the means for each S grouping. 
Scores joined by a bar indicate preferences 


which are not significantly different from one 
another. 

Table 3 presents the same means grouped 
by concepts. Duncan’s multiple-range test was 
used to indicate preferences of the different 
professional groups which are not significantly 
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TABLE 3 
COMPARISON OF SUBJECT GROUPS WITHIN CONCEPTS 
Rank F 
Pro- 
1 fa Ve AAP E igual E a | EXP: |p <i Enter’ a 
sion rience action* 

Pi SWe | SWi | Pe PSVe | PSYi | 9.77 | .005 | 4.60 05 6.35 | .005 
15.54 | 11.80 | 11.60 | 10.50 | 9.44 | 9.11 
Pe PSYe | Pi PSYi | SWe | SWi 7.91 | .005 | 1.40 ns 0.85 ns 
22.92 | 21.06 | 20.82 | 20.81 | 18.65 | 18.53 
PSYe | Pe PSYi | Pi SWi | SWe 3.69 | .05 55 ns 1.32 ns 
22.00 | 21.15 | 20.33 | 20.29 | 19.50 | 18.50 
Pe PSYe | Pi SWe | PSYi | SWi 1.34 | ns 2.48 ns 0.08 | ns 
11.46 | 10.44 | 10.25 | 10.15 | 9.74 | 8.83 
SWi | Pi SWe | Pe PSYi | PSYe} 4.90 | .01 4.05 05 0.53 ns 
7.88 | 6.54 | 5.75] 5.35| 4.67 | 4.14 
PSYi | SWe | SWi | PSYe| Pi Pe 2.56 | .10 .80 ns 2.37 10 
25.22 | 24.45 | 23.41 | 23.10 | 22.89 | 22.38 
PSYe | SWe | PSYi | SWi | Pe Pi 14.51 | .005 64 ns 0.20 ns 
19.30 | 19.00 | 18.96 | 18.95 | 16.58 | 15.75 
SWe | PSYi | PSYe | Pe SWi | Pi 1.44 | ns 1.06 ns 1.42 ns 
22.15 | 21.15 | 20.54 | 20.46 | 20.45 | 19.82 
SWi_ | PSYe | PSYi | SWe | Pe Pi 1.99 | ns .00 ns 1,22 ns 
15.02 | 13.98 | 13.63 | 13.55 | 13.19 | 12.11 


Pi = Psychiatrist, inexperienced; 
gist, experienced ; SWi = Soci: 
ession X Experience. 


nt. F values and their significance levels 
esented to the right of the concepts. 

using on the preferences expressed by 
atrists, Table 3 indicates that psychi- 
Tesidents value objectivity more than 
Other groups, all psychiatrists value re- 
tt less than other groups of therapists, and 
enced psychiatrists value sureness more 
| inexperienced social workers. The com- 


Pe = Psychiatrist, experienced; PSYi 
enced i ker, inexperienced ; SWe = Social worker, experienced. 


pe eychologist, inexperienced; PSYe 


parisons for psychologists indicate that in- 
experienced psychologists value empathy more 
than do experienced psychologists and all 
psychiatrists. The comparisons for social 
workers show that all social workers value 
dependability and sincerity less than experi- 
enced psychiatrists and psychologists, experi- 
enced social workers value interest more than 
inexperienced psychiatrists, inexperienced so- 
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cial workers value warmth more than inex- 
perienced psychiatrists, and social work stu- 
dents and psychiatric residents both value 
directiveness more than experienced psychia- 
trists and all psychologists. 

Level of experience does not seem to dif- 
ferentiate relative preferences with the excep- 
tion of directiveness. Objectivity is partially 
related to this variable, but the results reflect 
the interaction with profession; that is, in- 
experienced psychiatrists’ scores seem to de- 
termine the greatest portion of this difference. 


Discussion 


Two central findings of this investigation 
were that (a) therapeutic qualities of em- 
pathy, dependability, interest, sincerity, and 
respect seem to be most valued by all psycho- 
therapists, and (b) within the range of 
preferences certain qualities were differen- 
tially valued according to the training of the 
psychotherapists. 

The general similarity of the preferences of 
all groups of therapists is striking. Greater 
differences might have been anticipated be- 
tween the subsamples because of variations in 
kind and amount of training and to a lesser 
extent in professional self-image and profes- 
sional goals. Social workers, for example, with 
their access to the community, and psychia- 
trists, with their medical training, might both 
be expected to consider themselves more in 
a position for active intervention than would 
psychologists. This may be reflected in the 
directiveness scores, especially for the inex- 
perienced social workers and psychiatrists. 
Just as noteworthy, however, is the fact that 
all therapists placed directiveness at the 
bottom of their list of preferences, It might 
also be expected, because of the structured 
nature of both the analytic approach and 
setting, that psychiatrists would tend to em- 
phasize technique variables, and social work- 
ers, faced with a more fluid and uncontrolled 
setting, would emphasize relationship vari- 
ables, with psychologists falling somewhere in 
between. This was not borne out by the data 
in any consistent way. Previous studies by 

„Fiedler (1950b) and Strupp (1955a) do in 
fact emphasize the similarities between pro- 
fessional groupings in conceptualization of an 
ideal therapeutic relationship or an ideal re- 
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sponse to a specific therapeutic issue. They 
differ, however, by demonstrating an experi- 
ence effect which was not found to be 
significant here. In addition, Fiedler found 
that nontherapists were also able to describe 
an ideal therapeutic relationship while the 
present data show consistent differences be- 
tween the values of psychotherapists and non- 
therapist professionals. 

It may be that psychotherapists, because 
of some self-selection factors, differ in per- 
sonality structure from members of other 
professions, and to a lesser degree, from other 
therapists in related disciplines. The type of 
training received may have been a major 
determinant of the response patterns, which 
in turn may reflect either behavior or merely 
goals or values. Thus far, from our data, it 
can only be stated that there are certain 
general patterns of overall agreement among 
a broad range of psychotherapists regarding 
these concepts in the abstract. In addition, 
there are subpatterns which tend to distin- 
guish therapists with different backgrounds 
and degrees of experience from one another. 

Another possible interpretation of the find- 
ings is that an artifact was produced by the 
context of the test. The consistency with 
which warmth, objectivity, sureness, and di- 
rectiveness are found at the bottom of the 
lists for each group may be taken to imply 
that they are the more value laden concepts 
and therefore more subject to negative inter- 
pretation, These factors may have been read 
as “seductiveness, coldness, dogmatism, and } 
authoritarianism” when paired with possibly 
more moderate or neutral terms. Conse 
quently, such concepts as empathy and de- 
pendability were found to be consistently 
highly rated by the various psychotherapy 
professions. 

Some corroboration regarding these values 
as conscious preferences among therapists 
comes from the work of McNair and Lort 
(1964). The “directiveness” dimensions in the 
two studies can be considered roughly equiva 
lent and “warmth” can be treated as a parti 
reciprocal of their “impersonal” factor. Whe? 
the data in their table presenting profession 
by technique pattern are broken down into 
the single technique factors, social workers 
are seen to prefer directiveness most ofte! 
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(71%), with psychiatrists (66%) and psy- 
chologists (29%) following in that order. 
This agrees with our findings for all social 
workers, psychiatrists, and psychologists. In 
Table 2 it is shown that the mean score for 
all social workers on this variable is highest 
for the three professions, and that all psy- 
chologists score lowest. An examination of 
Table 3 shows that in each case the trainee 
group is more in favor of directiveness than 
are the experienced members of their profes- 
sion. But the order for the experienced and 
inexperienced groups is the same. 

A similar breakdown of their impersonal 
dimension shows psychologists to be highest 
on the personal end of the factor (55%), 
social workers next with 49%, and psychia- 
trists with 34%. As indicated in Table 2, 
social workers are highest on warmth, fol- 
lowed by psychologists and psychiatrists. 
These are data for combined groups, however, 
and McNair è and Lorr’s sample consisted of 
experienced therapists. Table 3 indicates that 
experienced psychologists do in fact choose 
warmth more often on the PPS than do 
experienced social workers and psychiatrists. 

A number of differences in value preferences 
among groups of varying professional training 
or amount of experience have been noted in 
the results. This is the first step in the direc- 
tion of understanding what such differences 
Mean, but a variety of related comparisons 
Must be made before meaningful patterns and 
explanations will emerge. For instance, what 
are the relationships between the attitudes 
Measured and the behavior of therapists in 
the practice of psychotherapy? Do different 
relationships between attitude and practice 
hold for experienced and inexperienced mem- 
bers of the same profession and of different 
Professions? How much can attitudes and 
Practice be modified by training and experi- 
ence? How does this vary with the personal- 
ity of the therapist, the type of attitudes held, 
and the extremeness of the attitude? Which 
therapist attitudes and behaviors relate most 
strongly with positive outcome in psycho- 
Heeb. and how does this interact with pa- 
tient diagnosis? There are findings in this area 
Which might be clarified and systematized 
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by examination in a broader context (Fiedler, 
1950a; Strupp, 1955b). The data gathered 
provide base-line information within an inte- 
grated system suitable for such a broad 
survey of therapists’ attitudes and behavior. 

The authors have developed a comprehen- 
sive set of variables which define a wide range 
of therapist attitudes and behavior. The par- 
ticular instrument used in this study may 
prove of value as a self-rating device, but it 
is limited to aspects of personality which are 
nonoffensive or socially desirable. To measure 
extreme (high or low) features of a therapist’s 
interaction with a patient, other forms should 
be employed, reflecting the full range of these 
10 variables. 

To illustrate, one such technique may be 
used in rating actual therapy behavior from 
taped interviews or observation of the ongoing 
therapy. This form would also be suitable to 
ascertain the ability of supervisors, instruc- 
tors, and peers to predict or know the therapy 
behavior of others. Another technique in- 
cluding choices for moderate as well as exces- 
sive characteristics may be used to record the 
patient’s perception of his therapist. Lorr 
(1965) has already demonstrated the poten- 
tial importance of this technique with the 
finding that patient perception of therapist 
understanding and acceptance is significantly 
related to self-rated improvement and thera- 
pist assessment of client satisfaction. Finally, 
an anticipatory form of the same scale would 
allow the individual seeking treatment to 
record his expectations of what the therapist 
will be like. Heine and Trosman (1960) have 
already shown the usefulness of patient 
anticipations in the prediction of dropout 
from psychotherapy. An outpatient study is 
currently underway which will attempt to 
replicate their findings with the expanded list 
of variables presented here. Conceivably, 
objective measures of transference and coun- 
tertransference issues can be obtained by 
comparing patient anticipations, patient per- 
ceptions of therapists, and therapists’ self- 
ratings with the ratings made by objective, 
trained personnel from actual tape recordings. 
All scales use the same frame of reference, 
so that direct comparisons can be made 
between any or all of the different proposed 


forms. 
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AVERSION TECHNIQUES IN BEHAVIOR THERAPY: 


SOME THEORETICAL AND METATHEORETICAL 
CONSIDERATIONS 


G. TERENCE WILSON? anp GERALD C. DAVISON 
State University of New York at Stony Brook 


Recent psychophysiological research indicates that certain cues may be singu- 
larly appropriate as functional conditioned stimuli for certain response systems. 
Such findings raise into question the trend in the aversion conditioning literature 
away from chemically produced aversion and toward fear produced by painful 
electric shock. Although this shift from the chemical to the electrical is being 
made on sound functional grounds, the physiological research on appropriateness 
of cues suggests that contrary to prevalent assumptions in behavior therapy, 
one must consider most carefully the topographical nature of stimuli and 
responses. At the metatheoretical level, this research makes still more per- 
suasive the argument that behavior modification as a field be concerned with 


areas of psychology outside of learning and conditioning, 


An important issue in the use of aversion 
techniques concerns the nature of the noxious 
stimulus. Earlier versions of aversion treat- 
ment favored the use of chemically produced 
aversion in the form of a nausea-producing 
drug. (cf. Lemere & Voegtlin, 1950), Current 
trends, however, almost exclusively empha- 
size electrical aversive stimulation. Rachman 
(1965) and Rachman and Teasdale (in press) 
detail the reasons which have dictated this 
shift to electrical noxious stimuli. At a prac- 
tical level, the use of nausea-producing drugs 
results in an unpleasant and unsavory experi- 
ence for client and clinical staff alike. At a 
more theoretical level, Eysenck and Rachman 
(1965) have suggested that these drugs act 
as central nervous system neural depressants, 
and thereby are likely to impede the condi- 
tionability of the client in accord with Ey- 
senck’s personality theory. Yet these factors 
are really secondary to the most cogent of 
Rachman and Teasdale’s points, namely, the 
difficulty in adhering to the traditional condi- 
tioning paradigm (in both classical and an- 
ticipatory avoidance conditioning) when using 
chemical aversion. The crucial parameters 
here are temporal, that is, getting optimal 
interstimulus intervals; sequential, that is, 
having the CS precede the UCS; the fre- 

Requests for reprints should be sent to G. 
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quency of repetition of CS-UCS pairings; and 
finally, precision in specifying the intensity 
and duration of the stimuli. Variable and 
fluctuating individual differences in reactions 
to emetic drugs virtually proscribe the pre- 
cision required by the conditioning paradigm 
for successful application. 

Given their premise of the traditional con- 
ditioning paradigm, Rachman and Teasdale’s 
points are both well taken and compelling. 
However, some recent findings from the 
physiological psychology literature raise ques- 
tions for current practices in aversion therapy, 
and suggest also some metatheoretical con- 
siderations for behavior therapy. 

Garcia and his associates have recently 
demonstrated that the procedure of pairing a 
perceptible cue with an effective reinforcer 
does not lead automatically to effective asso- 
ciative learning. Rather, it seems that the 
cue must be “appropriate” for the conse- 
quences that ensue. Thus, Garcia and Koelling 
(1966) have shown that avoidance learning 
with gastrointestinal disturbances produced 
by ionizing radiation as the UCS readily 
transferred to a gustatory stimulus, but not 
to audiovisual and tactile stimuli. On the 
other hand, avoidance learning with electric 
shock as the UCS transferred to the audio- 
visual and tactile stimuli, but not to the 
gustatory stimulus. Garcia and Koelling 
(1967) reported the same differential effect 
following the injection of a drug. Garcia, 
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McGowan, Ervin, and Koelling (1968) paired 
either flavor or size of food pellets as condi- 
tioned stimuli with either malaise induced by 
X ray or electric shock as the UCS in four 
groups of rats. The combination of flavor and 
illness resulted in a significant conditioned 
decrease in food consumption, but that of size 
and illness did not. Conversely, the combina- 
tion of size and pain produced an inhibition 
of eating while flavor and pain did not. 
The authors argue that since flavor is closely 
related to the chemical composition of food, 
natural selection would favor associative 
mechanisms relating flavor and olfaction to 
the aftereffects of ingestion; and they sug- 
gest how effective associative learning depends 
on central neural convergence of the paired 
afferent input. 

These data appear to discredit the com- 
monly made procedural prescription that, in 
the treatment of an alcoholic, for instance, 
shock be used to condition aversion to the 
sight, smell, and taste of alcohol, to exoge- 
nous as well as to endogenous stimuli. It 
seems that fear responses from a shock UCS 
may be conditioned only to nongustatory 
attributes of alcohol in our example, and 
not to the taste and/or smell of alcohol. 
Any radical change in these conditioned 
stimuli, such as could be produced by a 
different environmental complex, could be 
expected to lead to the “spontaneous recov- 
ery” of the consumption of alcohol (Estes, 
1955). The best strategy would seem to be 
to create a chemically based aversion to the 
taste and/or smell of alcohol and not to the 
complex of visual, personal, and other stimuli 
defining the treatment situation. 

Garcia, Ervin, Yorke, and Koelling (1967) 
and Revusky (1968) have extended this no- 
tion of the peculiar “appropriateness” of cues 
to reinforcers, finding that reinforcement can 
be delayed well beyond the time interval 
posited as mandatory for effective reinforce- 
ment by the traditional S-R model of associ- 
ative learning. It will be recalled that one of 
the major arguments in favor of electrical 
aversion relates to the temporal control it 
allows. 

Interestingly enough, Lazarus (1968) 
has independently and from a purely clinical 
standpoint expressed similar views, He reports 
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the case of an alcoholic whose alcohol con- 
sumption remained recalcitrant to faradic 
shock but disappeared rapidly when a singu- 
larly foul-smelling admixture of smelling salts 
was substituted as the noxious UCS. He goes 
on to observe that whereas faradic shock 
seems “appropriate” when the concern is with 
visual and/or tactile stimuli, as would be the 
case in a handwashing compulsion, it may 
be inappropriate in handling overeating and 
alcoholic consumption. Of course these and 
similar clinical observations (e.g., Cautela, 
1967; Davison, 1968) are in line with Garcia’s 
psychophysiological findings. 

The foregoing raises also some interesting” 
issues on the metatheoretical level. The 
learning-oriented behavior modification litera- 
ture, especially the operant, emphasizes the 
overriding importance of functional definitions 
of problems in distinct contrast to topo- 
graphical (e.g., Ferster, 1965; Staats & Staats, 
1963). In the attempt to extrapolate from 
research with infrahumans to research with 
humans, learning theory oriented behavior 
modifiers seem, on reflection, to have had little 
choice but to stress the functional to the 
virtual exclusion of the topographical, nay, 
physiological nature of stimuli and responses. 
Thus, one terms a response as topographi- 
cally complex as “walking to the door” an 
“operant” if it can be shown to relate func- 
tionally to antecedent and consequent stimuli 
in the same fashion as more carefully deline- 
ated operants in the Skinner box. The same 
holds true for behavior therapy approaches 
which stress the classical conditioning para- 
digm: Pavlov (1928) himself proposed that 
“every imaginable phenomenon of the outer 
world affecting a specific receptive surface of 
the body may be converted into a CS [p. 881: 

The Garcia research obviously raises the 
serious question of this preoccupation with 
functional identities to the exclusion of the 
specific nature of the particular stimuli 
and responses within one’s presumed condi- 
tioning paradigm. This work also illustrates 


modification” as deriving from “modern learn- 
ing theory” or, more properly, learning prin- 
ciples. Familiarity with such psychophysiO- 
logical work as Garcia’s (inter alia) makes 
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clear the desirability of expanding the field 
of behavior modification to include general 
experimental psychology as a whole (cf. 
Davison, in press). 
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ENCODING OF ATTITUDE BY A SEATED COMMUNICATOR 
VIA POSTURE AND POSITION CUES* 


ALBERT MEHRABIAN?2 anp JOHN T. FRIAR 
University of California, Los Angeles 


In this study, Ss were requested to imagine themselves in situations involving 
different kinds of addressees and to sit in the ways in which they would if 
they were actually interacting with these addressees. The four independent 
factors in the experiment were encoder-communicator attitude, addressee status, 
addressee sex, and encoder sex. The dependent measures were eye contact, dis- 
tance, head orientation, shoulder orientation, leg orientation, arm openness, 
leg openness, and hand, foot, and trunk relaxation. The findings suggest that 
the most important variables for the communication of positive attitude are 
small backward lean of torso, close distance, and more eye contact. Also, there 
is less sideways lean and more eye contact in communications with high-status 
as compared to low-status addressees. In addition, for female encoders, less 
arm openness is an indicator of a higher status of the addressee. 


The present investigation is an attempt to 
explore the salient postural, orientation, and 
distance cues which have been considered 
relevant to the communication of attitudes. 
There have been a number of informal dis- 
cussions of cues which can be relevant in such 
communication. Hall (1959, 1963, 1964) 
delineates several variables which he sub- 
sumes under the rubric of proxemics. Some 
proxemic variables are distance between a 
communicator and his addressee, degree of 
directness of orientation of a communicator 
toward his addressee (i.e., the degree to which 
a communicator’s body is turned in the direc- 
tion of his addressee versus away from the 
direction of his addressee), and the presence 
or absence of touching, or eye contact, between 
communicator and addressee. Thus, the con- 
cept of proxemics subsumes variations in 
postural and distance variables which relate 
to the degree of directness or immediacy of 
interaction between a communicator and his 
addressee (Mehrabian, 1967). In addition to 
eye contact and directness of body orienta- 
tion, Machotka (1965) also denotes acces- 
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sibility of a communicator’s body to the ad- 
dressee (e.g., openness of arms) as a relevant 
affect or attitude communicating variable. 
Directness of body orientation can also be 
construed as a proxemic variable. 

Argyle and Kendon (1967) summarized 
some of the research literature related to the 
effects of the above variables in communica- 
tion. For instance, the importance of eye 
contact in communication involving positive 
versus negative attitudes has been demon- 
strated in several studies. Kendon (1967) 
suggested that the amount of eye contact 
which a communicator allots to his addressee 
can serve the regulative function of allowing 
varying degrees of emotionality, or expres- 
sion of attitudes between the communica- 
tors. Exline and Winters (1965) found 
that when Ss were interviewed simultaneously 
by two Es, they tended to have more eye 
contact with the Æ whom they preferred. 
They also found that Ss have more eye con- 
tact with an interviewer when they receive 
a positive treatment from the interviewer 
than when they receive a negative treatment 
which produces feelings of dislike. Argyle and 
Kendon (1967) report an unpublished study 
by Weisbrod ë who studied eye contact pat 
terns in a group. She found that a speaker 
rates those of his addressees who look at him 
most as being instrumental to his goal aM 

3R. M. Weisbrod. Looking behavior in a dis 
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: as valuing him more than addressees who 
look at him less. She also found that a 
speaker feels more powerful when he receives 
more eye contact from his addressees. The 
latter finding can also be interpreted to indi- 
cate that greater eye contact with a speaker 
communicates a feeling of higher status to the 
speaker. 

Sommer’s (1967) review indicates the rele- 
vance of variations in distance and orientation 
to attitudes of the communicator. Studies by 
Leipold (1963), Little (1965), Argyle and 
Dean (1965), and Rosenfeld (1965) consis- 
tently provide support for the hypothesis that 
closer positions are assumed to persons toward 
whom attitude is positive than those toward 
whom attitude is negative. Rosenfeld also 
found that his Ss who were assigned an 
approval seeking role did not assume a 
significantly more direct body orientation 
toward their addressees than Ss who were 
assigned an approval avoiding role. Steinzor 
(1950), however, observed that in a discussion 
group, most of the interaction took place 
between those who were sitting opposite each 

other (i.e. direct orientation). There are 
two possible interpretations of this relation- 

_ ship: Ss tend to sit across from people with 
whom they like to interact and/or sitting 
directly opposite someone leads to interaction 
with that person. The first interpretation is 
More plausible. Since the individuals in 
Steinzor’s group met for several sessions, the 
choice of a sitting position in the group could 
be determined on the basis of attitudes among 
Members of the group. Some of Mehrabian’s 

(1967) findings also provide support for the 
first interpretation. He found that when eye 
Contact is present, a less direct body orienta- 
tion of a communicator is interpreted by the 
addressee as an indication of a less positive 
attitude toward herself than when there is 

More direct body orientation. Thus, the exist- 
Ing evidence relating orientation to attitude is 
equivocal, 

Machotka’s (1965) findings suggest some 
relationship between openness of arms and 
attitude. His Ss rated line drawings display- 
ing different degrees of arm openness of nude 
figures, Compared to the figures with moder- 
ate or very open arm positions, the figures 

with closed arm positions were judged as 
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being cold, rejecting, unyielding, shy, and 
passive. 

In addition to the above investigations 
which relate the proxemic variables to atti- 
tudes, there are some studies of the signifi- 
cance of relaxation—tension in the communica- 
tion of affect, attitude, or status differences, 
between communicators, Schlosberg (1954), 
denoted the dimension of sleep-tension as 
being relevant to the expression of emotions. 
Ekman (1965) asked Ss to rate photo- 
graphs of communicators in an interview 
situation on a tension-relaxation dimension. 
He found that body cues, relative to head 
cues, are more suited for the communication 
of intensity of affect. In the present context, 
the relationship of body relaxation to atti- 
tudes or status relationships is of interest. 
Goffman (1961) found that in psychiatric 
staff meetings, tension is greater in lower 
status participants (interns) than in higher 
status participants (psychiatrists), The latter 
finding implies that the degree of body relaxa- 
tion of a speaker is greater when he com- 
municates to a lower status addressee than 
when he communicates to a higher status 
addressee. 

Mehrabian (1968) explored the main and 
interactive effects of several proxemic vari- 
ables on inferred attitudes. In the experi- 
ments, various combinations of posture, orien- 
tation, and distance of communicators were 
presented to Ss who inferred attitudes from 
these cues. In the photographic stimuli pre- 
sented to Ss, in addition to postural cues, 
communicator characteristics such as age or 
sex were also varied. The findings indicate 
that when a communicator sits in a relaxed 
posture, a relatively forward lean of his 
posture toward his addressee communicates 
a more positive attitude to him. An open posi- 
tion of his legs and arms communicates a more 
positive attitude than a closed position of his 
legs and arms. When a communicator is re- 
laxed and standing, a relatively closer position 
to his addressee communicates a more posi- 
tive attitude. A relatively direct orientation to 
his addressee can communicate either positive 
or negative attitudes, depending on the age 
and sex of the communicator and the addres- 
see. 

The study to be reported employs an en- 
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coding methodology. In this study Ss were 
requested to imagine themselves in situations 
involving different kinds of addressees and to 
sit in the ways in which they would if they 
were actually interacting with these ad- 
dressees. There were four independent factors 
in the experiment. They were encoder atti- 
tude, addressee status, addressee sex, and 
encoder sex. There were two levels of each of 
these four factors. The two levels of the 
attitude factor were communicated to S by 
asking him to imagine an addressee whom he 
liked versus an addressee whom he disliked. 
The two levels of the addressee status were 
communicated to S by asking him to imagine 
an addressee who was of a higher status than 
himself versus an addressee who was of a 
lower status than himself. There was a total 
of 11 dependent measures in the study. These 
were eye contact, distance, head orientation, 
shoulder orientation, leg orientation, arm 
openness, and leg openness. Additional depen- 
dent measures relating to relaxation were 
hand relaxation, foot relaxation, and trunk 
relaxation as measured by backward and 
sideways angles of lean, 

Drawing on the interpretations of research 
findings previously reviewed, the hypotheses 
were that there is more eye contact, smaller 
distance, and greater relaxation in communica- 
tions with liked than with disliked addressees, 
and that there is more eye contact and less 
relaxation in communications with high-status 
than with low-status addressees. 


METHOD 
Subjects 


Forty-eight University of California undergradu- 
ates served as Ss, The Ss were equally divided as 
to sex and served in the experiment as part of 
introductory psychology course requirements. 


Procedure 


The procedure employed in the experiment was simi- 
lar to one reported in Mehrabian (1968). The S who 
was left alone with a set of instructions assumed 
eight different postures which he considered to be 
typical of his postures while interacting with 
Liked-Disliked X High-Low Status X Male-Female 
Addressees, The S’s behavior was observed through 
a one-way mirror by three raters who independently 
rated the behavior on the 11 dependent measures. 
The order in which S carried out the eight condi- 
tions was counterbalanced and was not known to 
the raters. 
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RESULTS 

Reliability 

Three judges rated the behavior of S on 
each of 11 dimensions. Product-moment cor- 
relations were obtained between the ratings 
of each pair of judges for each of the 11 di- 
mensions. Mean correlations over the three 
pairs of judges for each of the 11 dimensions 
indicate the following reliability estimates: 
.81 for eye contact, .99 for distance, .82 for 
head orientation, .93 for shoulder orientation, 
.88 for leg orientation, .90 for arm openness, 
.94 for leg openness, .27 for hand relaxation, 
-16 for foot relaxation, .85 for backward lean 
of torso, and .82 for sideways lean of torso, 
Since the comparisons of the ratings of the 
three judges indicate significant consistency 
among their judgments (except in the case of 
hand and foot relaxation), mean judgments 
over judges for each of the dependent vari- 
ables were employed in the analyses to be 
reported. 


Eye Contact 


A score of zero was assigned for no eye 
contact and a score of unity was assigned 
for the presence of eye contact. A 2 X 2 X 2 
X 2 X 24 analysis of variance was performed 
on the mean judgments of eye contact ob- 
tained from the three judges. In this design, 
there were two levels each of attitude, ad- 
dressee status, addresee sex, and encoder sex, 
with 24 Ss nested under each level of the 
encoder sex factor and repeated measures on 
the first three factors. 

The results of the analysis of variance of 
the eye contact scores indicate significant 
effects due to attitude (F = 17.4, df = 1/46, 
p< 01), addressee status (F = 14.8, df= 
1/46, p < 01), addressee sex (F = 10.6, df 
= 1/46, p < .01), Attitude x Addressee Sex 
(F=11.4, df=1/46, p< .01), Addressee 
Status x Addressee Sex (F = 9.5, df = 1/46, 
p< 01), Attitude x Addressee Status X En- 
coder Sex (F = 10.4, df = 1/46, p < .01), and 
Attitude X Addressee Sex x Encoder Sex 
(F = 9.4, df = 1/46, p < 01). 

The main and interaction effects involving 
the attitude factor will be considered first. 
The significant effects of attitude, Addressee 
Sex X Attitude, and Encoder Sex x Addressee 
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Sex X Attitude are hierarchically related; 
theréfore, a consideration of the Encoder 
Sex X Addressee Sex X Attitude Effect will 
summarize all the necessary findings related 
to these significant effects. 

Application of the Newman-Keuls tech- 
nique (Winer, 1962) for comparisons of the 
eight means involved indicates the following 
significant differences among the means at the 
05 level. For male encoders, the cell means 
for eye contact indicate that irrespective of 
addressee sex, there is less eye contact with 
disliked addressees (.68) than with liked ad- 
dressees (.86). For female encoders, there is 
significantly less eye contact with disliked 
male addressees (.53) than with any of the 
other three addressee groups (i.e., liked males, 
88; liked females, .81; and disliked females, 
81). In other words, for female encoders, the 
degrees of eye contact with liked male, liked 
female, or disliked female addressees do not 
differ significantly from each other, while all 
three differ significantly from the degree of 
eye contact with disliked male addressees. 

The main and interaction effects involving 
the addressee status factor will next be con- 
sidered. The significant effects of addressee 
status, addressee sex, Addréssee Status X Ad- 
dressee Sex are also hierarchically related; 
therefore, a consideration of the Addressee 
Status x Addressee Sex effect will summarize 
all the necessary findings relating to these 
three effects. Application of ¢ tests for com- 
parison of the four means involved indicates 
the following significant differences among 
the means at the .01 level. Irrespective of 
encoder sex, there is significantly less eye 
Contact with low-status male addressees (.60) 
than with any of the other three addressee 
groups (i.e. low-status females, .79; high- 
oi males, .83; and high-status females, 

Finally, the Attitude x Addressee Status X 
Encoder Sex effect needs to be considered. 
Application of the Newman-Keuls technique 
for comparisons of the eight means involved 
Indicates the following significant differences 
among the means at the .05 level. For male 
encoders, there is more eye contact with 
liked high-status adressees (.98) than with 
disliked high-status addressees (.71), liked 
low-status addressees (.74), or disliked low- 
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status addressees (.64). The latter three 
means do not differ significantly among them- 
selves. For female encoders, there is less eye 
contact with disliked low-status addressees 
(.54) than with disliked high-status ad- 
dressees (.80), liked high-status addressees 
(.83), or liked low-status addressees (.85)— 
the latter three means not differing signifi- 
cantly among themselves. 


Distance 


The straight-ahead and lateral distances 
were used to obtain the total distance of S$ 
from the addressee. The unit of measurement 
for distance was the length of a side of the 
9X9 inch tiles covering the floor of the 
experimental room. A 2X 2X2X2X 24 
analysis of variance was performed on the 
mean judgments of distance obtained from 
the three judges. The design employed for the 
present analysis, as well as all subsequent 
analyses of variance, was identical to that 
already reported for the analysis of the eye 
contact measure. The results of the analysis 
of variance of the distance scores indicate 
significant effects due to attitude (F = 49.5, 
df = 1/46, MS. = 5.42, p< 01) and ad- 
dressee sex (F = 9.9, df = 1/46, MS, = 1.95, 
p< .01). The mean distance to liked ad- 
dressees (5.57) is less than the mean distance 
to disliked addressees (7.24). Also, the mean 
distance to female addressees (6.18) is less 
than the mean distance to male addressees 
(6.63). 


Orientation Measures 


All orientation measures were obtained in 
such a way that greater angles correspond to 
Jess directness of orientation of the encoder’s 
head, shoulders, or legs toward his addressee. 
The product-moment correlation of the head 
orientation scores with the shoulder orienta- 
tion scores is .90; the correlation of head 
orientation scores with the leg orientation 
scores is .86; and the correlation of the 
shoulder orientation scores with the leg orien- 
tation scores is .97. While separate analyses 
of variance were performed for each of these 
three dependent measures, due to the high 
intercorrelations among these measures, only 
the analysis of variance performed on the 
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average of head, shoulder, and leg orientation 
scores will be reported. 

Analysis of variance of the average orienta- 
tion scores shows significant effects due to 
attitude (F = 4.2, df = 1/46, MS. = 438.2, 
b < .05), Addressee Sex X Encoder Sex (F = 
10.8, df = 1/46, MS, = 588.4, p < .01), and 
Attitude X Addressee Status X Addressee Sex 
(F = 4.9, df = 1/46, MS, = 464.9, p < .05). 
The main and interaction effects involving the 
attitude factor will be considered first. The 
significant effects of attitude and Attitude x 
Addressee Status x Addressee Sex are hier- 
archically related; therefore, a consideration 
of the triple interaction effect will summarize 
all the necessary findings relating to these ef- 
fects. Application of the Newman-Keuls tech- 
nique for comparisons of the eight means in- 
volved indicates only one significant difference 
at the~.05 level. Irrespective of encoder sex, 
the average orientation away from liked fe- 
male addressees of low status (39.0°) is 
greater than the average orientation away 
from disliked male addressees of high status 
(24.3°). 

In considering the Addressee Sex X En- 
coder Sex interaction effect, ¢ tests were ap- 
plied for comparison of the four means in- 
volved and the following significant differences 
were obtained at the .05 level. The average 
orientation of male encoders away from 
the female addressees (31.9°) is greater than 
the average orientation of male encoders away 
from male addressees (24.4°). Also, the aver- 
age orientation of female encoders away from 
male addressees (40.5°) is greater than the 
average orientation of female encoders away 
from female addressees (31.7°). In short, this 
interaction effect is due to a greater degree 
of average orientation of an encoder away 
from an addressee of the opposite sex than 
that away from an addressee of the same sex. 

While, as noted above, the results of the 
separate analyses of variance performed for 
each of the head, shoulder, and leg orienta- 
tion measures are congruent and need not be 
separately discussed, there is one effect ob- 
tained from the separate analysis of variance 
of the leg orientation measure which is of 
some interest. In that analysis a significant 
effect due to the encoder sex factor was ob- 
tained (F = 96.7, df = 1/46, MS. = 3,170.9, 
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p< 01). The leg orientation of female en- 
coders away from their addressees (39.7°) is 
greater than the leg orientation of male en- 
coders away from their addressees (28.8°), 


Openness Measures 


All openness measures were obtained in 
such a way that greater openness corre- 
sponded to larger numbers. The product- 
moment correlation of arm openness scores 
with the leg openness is —.02. 

Analysis of variance of the arm openness 
scores shows significant effects due to ad- 
dressee status (F = 5.8, df = 1/46, MS,= 
.73, < .05) and Addressee Status X En- 


\coder Sex (F= 5.8, df = 1/46, MS, =.73, 


p< .05). Since these two effects are hier- 
archically related, a consideration of the 
interaction effect will summarize all the neces- 
sary findings relating to the effects. Applica- 
tion of ¢ tests for comparison of the four 
means involved indicates the following sig- 
nificant differences at the .05 level. The mean 
score for arm openness of female encoders 
communicating to high-status addressees 
(1.62) is significantly lower than the mean 
scores for arm openness in the female encoder 
and low-status addressee (2.05), male encoder 
and high-status addressee (1.93), and male en- 
coder and low-status addressee (1.93) condi- 
tions. The latter three do not differ signifi- 
cantly from each other. 

Analysis of variance of the leg openness 
scores shows only a significant effect due to 
encoder sex (F= 51.8, df= 1/46, MS.= 
2.87, p < .01). The mean degree of leg open- 
ness of male encoders (2.05) is greater than 
the mean degree of leg openness of female 
encoders (.80). 


Relaxation Measures 


These measures were obtained such that 
greater numbers correspond to greater degrees 
of relaxation. Since the judgments of hand 
and foot relaxation is unreliable, only the 
results for the trunk relaxation indexes (i.€; 
backward and sideways lean) will be reported. 
The number of degrees that an S leaned back- 
ward from the vertical served as one index. 
Analysis of variance of these scores shows 
significance due to attitude (F = 22.4, df= 
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1/46, MS, = 269.9, P< .01) and encoder sex 
(F = 19.6, df = 1/46, MS, = $23.7, p < 01). 
The mean angle of backward lean with liked 
addressees (1.4°) is less than the mean angle 
with disliked addressees (9.3°). Furthermore, 
the mean angle for female encoders (.2°) is 
less than the mean angle for the male encoders 
(10.5°). These results may be summarized by 
noting that the torso lean is more backward 
for disliked addressees than for liked ad- 
dressees and that the lean of the torso of 
male encoders is typically further back than 
the lean of the torso of female encoders. 

Analysis of variance of the second trunk 
relaxation index (i.e., the angle of sideways 
lean of S’s torso away from the median plane 
of the chair in which he sits) shows signifi- 
cant effects due to addressee status (F = 26.1, 
df = 1/46, MS, = 65.9, p < .01). Both male 
and female encoders show greater trunk relax- 
ation in communicating to low-status ad- 
dressees (8.5°) than in communicating to 
high-status addressees (4.3°). 


DISCUSSION 


This study was designed to help determine 
the salient posture and position variables in 
the communication of affect to addressees of 
different status and sex. The study was under- 
taken with a primarily empirical rather than 
theoretical orientation and indicated that 
some of the dependent measures which were 
employed were significant indexes of atti- 
tude or status differences, whereas others 
Were either unreliable or insignificant in the 
experiment. 

Relaxation in the posture of a commmuni- 
Cator indicates both his attitude and his rela- 
tive status to his addressee. An integration of 
findings obtained in the present study with 
findings obtained in Mehrabian’s (1968) 
study suggests the following postural indexes 
for increasing degrees of relaxation. Least 
relaxation: obvious muscular tension in hands 
and rigidity of posture as exhibited by a 
straight back and upright trunk; moderate 
relaxation: subtle indicators of relaxation 
such as trunk position, for example, a 20° 
arara lean and a less than 10° sideways 

an, curved back and in the case of females 
an open arm position; extreme relaxation: 
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obvious trunk relaxation as indicated by a 
greater than 20° backward lean and a greater 
than 10° sideways lean. The findings of the 
present study, together with the findings of 
Mehrabian’s (1968) study, suggest that there 
is a curvilinear relationship between the atti- 
tude of a communicator toward his addressee 
and the degree of relaxation manifested by 
the communicator. Degree of relaxation is 
either very high or very low for a disliked 
addressee and is moderate for a liked ad- 
dressee. It would seem that a very low degree 
of relaxation, that is, extreme tension, would 
occur with threatening addressees and a high 
degree of relaxation would occur with non- 
threatening addressees who are disliked. 

In contrast to its relationship to attitude, 
relaxation seems to exhibit a linear relation- 
ship with status, as follows: Of the three 
degrees of relaxation (i.e., obvious tension, 
moderate relaxation, and extreme relaxation) 
only the latter two degrees are relevant for 
describing the relationship of relaxation to 
status of an addressee: there is a high degree 
of relaxation with a low-status addressee, a 
moderate degree of relaxation with a high- 
status addressee, and an intermediate degree 
of relaxation with peers. These findings are 
consistent with Goffman’s (1961) observa- 
tions. 

In addition to relaxation, distance can also 
be seen as constituting an indicator of atti- 
tude and status relationships with an ad- 
dressee. The findings of the present study 
together with findings of earlier studies 
(Leipold, 1963; Little, 1965; Mehrabian, 
1968; Rosenfeld, 1965) suggest that there 
is an inverse linear relationship between 
distance and attitude. The relationship of dis- 
tance to status can be conceptualized with the 
use of the findings in the present study and 
with Lott and Sommer’s (1967) study which 
found that distance to addressees who are of 
either a higher or a lower status than the 
communicator are greater than distances to 
peers. In the present study, it was found that 
distance to high-status, relative to low-status, 
addressees is not significantly different. It may 
therefore be hypothesized that distance to an 
addressee is a curvilinear function of the 
status of the addressee relative to the self and 
that it is equally great for low-status and high- 
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status addressees and relatively small for 
peers. 

Eye contact and directness of orientation 
toward an addressee can also serve as indexes 
of attitude and status toward the addressee. 
The findings are consistent with those of 
Exline, Gray, and Schuette (1965) and Exline 
and Winters (1965) and indicate that eye 
contact generally increases with more positive 
attitude toward an addressee. They also indi- 
cate eye contact is greater for high-status, 
relative to low-status addressees. As in pre- 
vious studies (Mehrabian, 1968; Rosenfeld, 
1965), directness of orientation toward an 
addressee is not as clearly an index of atti- 
tude and status toward an addressee, It was 
found that orientation is least direct for 
female addressees of low status and most 
direct for disliked male addressees of high 
status. This finding would suggest that direct- 
ness of orientation might be partially deter- 
mined by the threatening quality of the ad- 
dressee for the communicator and suggests 
further exploration of this particular inter- 
personal variable as a determiner of posture 

“and position cues. 

The above findings were obtained with a 
role-playing methodology which requested Ss 
to assume that they were in specific situations, 
instead of attempting to create such situations 
in the experiment. It is interesting to note 
that whenever findings from other investiga- 
tions involving different kinds of methods are 
available, the results of the present study 
are congruent with those already available. 
This is reassuring for the status of the re- 
maining findings of the present study for 
which corroborative evidence is absent. 
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Twenty maladjusted and 20 carefully matched adjusted boys and their parents 
were administered TATs and interviewed. Objective measures of achivement, 
affiliation, power, aggression, and experience balance were obtained from the 
TATs. The two groups of families differed less, motivationally, than expected 
on the basis of the clinical literature. Paternal’ aggression and maternal extra- 
tensiveness in the maladjusted group did suggest some parental conflict. The 
maladjusted boys perceived their mothers as/' distant and subtly controlling. 
There was no evidence of a positive relationship between motivational measures 
in members of the same family, There was an inverse relation between the 
maladjusted boy and his parents, particularly the mother. 


In the recent focus on the family of mal- 
adjusted individuals, it is frequently observed 
that the needs and conflicts of the parents are 
of central importance in the development of 
pathological behavior in the child (Ackerman, 
1958; Handel, 1967; Lidz, Fleck, & Corneli- 
son, 1965; Spiegel & Bell, 1959). Thus, for 
example, a child’s difficulty in school may be 
related to an excessive or a deficient motiva- 


tion to achieve on the part of one or both 
Patents. Furthermore, it has been suggested 


that each family has a core problem revolving 
around some basic motive that is manifest 
directly or indirectly in each family member. 
Unfortunately, there is relatively little direct 


evidence for these phenomena other than 
clinical impressions. 
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One of the strategies for studying the needs 
and conflicts of the family has been the 
administration of projective tests, particularly 
the Thematic Apperception Test (TAT), to 
several members of the same family. Earlier 
exploratory studies (e.g., Fisher & Mendel, 
1956; Rosensweig & Isham, 1947) suggested 
both direct and inverse relationships between 
TAT themes in members of the same family. 
However, there have been few systematic 
studies using this technique. One attempt was 
made to relate measures of aggression from 
TATs administered to members of normal 
families (Hess & Handel, 1956). A level of 
aspiration score from the TAT was found 
useful in differentiating the parents of normal, 
neurotic, and schizophrenic adults (Fisher, 
Boyd, Walker, & Sheer, 1959). The conjoint 
TAT, in which several family members tell 
stories together, has also been found to dif- 
ferentiate normal and disturbed families 
(Fisher et al, 1959; Winter, Ferreira, & 
Olson, 1965, 1966). 

The purpose of the present study was to 
compare the motivational patterns in the 
families of maladjusted boys with those of a 
well-matched group of families with adjusted 
boys. The maladjusted boys were typical 
of the intact, neurotic-level population fre- 
quently referred for treatment by the ele- 
mentary schools. The TAT was administered 
individually to the boys and their parents 
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and then scored for the following motives: 
achievement, affiliation, power, and aggres- 
sion. These were selected for two reasons: 
first, they were typical of the kinds of motives 
described in the clinical literature on family 
interaction; and, second, they had the most 
objectively scored and best validated TAT 
measures. A TAT measure of experience bal- 
ance was added because of its relevance to 
external versus internal sources of motivation. 
Additional information was obtained through 
interviews. 

It was anticipated that the two groups 
would be differentiated on the basis of motiva- 
tional measures for individual family mem- 
bers, It was also expected that there would 
be a relationship between members of the 
same family on each of the measures. On the 
other hand, the level of development of theory 
in this area was judged as not warranting the 
statement of directional hypotheses at this 
time, 


METHOD 
Subjects 


A group of 20 families with a maladjusted boy 
was obtained through a Child Guidance Center. They 
were intact, white, lower-middle- or upper-lower- 
class families, about half Protestant and half Cath- 
olic. The fathers were working and the mothers were 
at home. The parents averaged 12 years of schooling. 
The boys were between the ages of 8 and 12, of 
normal intelligence, with problems such as academic 
failure, anxiety symptoms, rebellious behavior, and 
social rejection. Boys with mental retardation, neuro- 
logical involvement, legal delinquency, or psychotic 
symptoms were eliminated. 

A group of 20 matched well-adjusted families 
were obtained through the school systems. Each ad- 
justed boy was selected from the same grade and 
school as the maladjusted boy with whom he was 
matched. The principals eliminated all boys with 
academic or social adjustment problems. The match- 
ing was done on the basis of age, intelligence, social 
class, parent’s education, and religion. Intelligence 
was estimated on the basis of standard tests in the 
records, Three graduate students determined social 
class using the Hollingshead and Redlich Scale (1958) 
with a 91% agreement. The two groups of families 
did not differ significantly on any of the matching 
variables, 7 


Procedure 


All Ss were tested and interviewed individually by 
psychology graduate and senior honors students, The 
maladjusted families were seen as part of the intake 
procedure at the Child Guidance Center. The ad- 
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justed families were contacted by phone and seen 
at the University. a 

The TAT was administered with the standard 
instructions. The cards were presented in the follow- 
ing fixed order: 2, 10, 4, 5, 7BM, New, and 16, The | 
New card showed two boys playing. The stories wer 
tape-recorded, then transcribed and coded for blind 
scoring. 


TAT Measures 


Measures were obtained from the coded TAT 
stories for four motives—achievement, affiliation, 
power, and aggression. The first three of these are 
described in detail in Atkinson (1958). The measure 
of aggression was based on a dissertation by Mac- 
Casland (1961). The main score is the ratio of 
direct plus indirect aggression to indirect aggression | 
plus inhibition of aggression. 

‘The four motives were scored by four independent 
teams of two graduate students each. Each team was 
carefully trained with the aid of published instruc- 
tions and practice stories. The reliabilities for the 
scoring of the four measures with the stories of the 
families in this study ranged from 86% to 93% 
agreement. The two groups of families did not differ 
significantly on length of story. 

The stories were also scored for a TAT analog of 
the traditional Rorschach experience balance devel- | 
oped by Palmer and Lustgarten (1962). The extra= | 
tensive score is based on descriptions of the external | 
environment and external sources of motivation, The 
intratensive score is based on expression of internal” 
needs and feelings as well as on well-developed story 
plots. The final measure used was the ratio of extra= 
tensive to intratensive scores. The coded stories were 
scored by a senior honors student. A reliability check 
showed 80% agreement. 


Interview Measures 


Two measures were obtained from six open-ended: 
questions about parents asked of the boys. The first” 
was a rating of the parent-child relationship as close | 
or distant. Two judges agreed on 86% of the ratings.” 

The second measure concerned the control tech 
niques used by the parents. “Physically Aggressive 
included hitting and spanking, while “Verbally Ag- 
gressive” included scolding and yelling. “Nonaggtes~ 
sive” included more subtle control techniques suc 
as restriction and deprivation. Two judges agreed 
on 87% of the ratings. The data for one child were 
missing. 


RESULTS 
TAT Measures 


The TAT measures for the fathers in both 
groups are shown in Table 1. The results wer 
evaluated by two-tailed ¢ tests for correlated” 
measures. The measure of aggression clearly” 
differentiates the groups. There are no signifi- 


FAMILIES oF ADJUSTED AND MALADJUSTED Boys 


cant differences in achievement, power, or ex- 
perience balance but there is a trend for a 
greater affiliation need in the maladjusted 
fathers (p < .10). Thus, the results suggest 
that fathers of maladjusted boys show a 
strong aggression motive and, possibly, more 
affiliation motivation than fathers of adjusted 
boys. 

The TAT measures for the mothers of 
both the maladjusted and adjusted boys are 
shown in Table 2. It can be seen that the 
average scores for the two groups of mothers 
on affiliation, aggression, achievement, and 
power are quite similar. None of these dif- 
ferences is significant or even approaches sig- 
nificance. The one measure that clearly dif- 
ferentiates the two groups of mothers is the 
TAT experience balance. The mothers of 
maladjusted boys had a high Extratensive- 
Intratensive ratio, suggesting that they are 
relatively more responsive to external than 
internal cues. 

The results for the TAT measures with the 
neurotic and normal boys are shown in 
Table 3. The groups do not appear to be 
significantly different on these measures, with 
the exception of an almost significant trend 
for the neurotic boys to show more of an 
aggression motive (p < .06). 


Correlational Analysis 


A correlational analysis was also made of 
the TAT measures and is shown in Table 4. 
Product-moment correlations were computed 
for the relationship between individuals in 
each family role, in each of the two groups, 


TABLE 1 


Mean TAT MEASURES FOR FATHERS OF 
MALADJUSTED AND ADJUSTED Boys 


Fathers | Fathers 
of mal- | of ad- 
Measure adjusted | justed i 
boys boys 
Achievement motive 1.00 95 03 
liation motive 12.05 9.40 1.82 
wer motive 6.80 6.05 88 
pubtession motive 1.82 140 | 2.54* 
Xperience balance 38 33 85 


x 
? <.05, df = 19, two-tailed test. 
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TABLE 2 
Mean TAT MEASURES FOR MOTHERS OF 
MALADJUSTED AND ADJUSTED Boys 
Mothers | Mothers 
Wessun of mal- | of ad- i 
7 adjusted | justed 
boys boys 
Achievement motive 1.85 1.35 38 
Affiliation motive 14.15 13.00 73 
Power motive 5.55 7.22 1.58 
Aggression motive 1.82 1.62 ve} 
Experience balance A3 HA) 3.26" 


*p <.01, df = 19, two-tailed test. 


on each of the five measures. An examination 
of the pattern of correlations for the adjusted 
families suggests very little relationship be- 
tween the TAT measures for the fathers, 
mothers, and boys. At best the pattern sug- 
gests a weak positive relationship but only 
one correlation is significantly different from 
zero. 

An examination of the father-mother cor- 
relations in the maladjusted group suggests 
that there is nothing remarkable about the 
parental pattern. Except for power motiva- 
tion, the correlations between the maladjusted 
parents are not significantly different from 
zero. Furthermore, the patterns of father- 
mother correlations in the maladjusted and 
adjusted groups are quite similar with no sig- 
nificantly different correlations on any of the 
five measures. 

On the other hand, the pattern of correla- 
tions between the parents and boys in the 
maladjusted groups does appear to be dif- 
ferent with a strong tendency toward inverse 


TABLE 3 


MEAN TAT MEASURES FoR MALADJUSTED 
AND ADJUSTED Boys 


Malad- 


Measure justed sai t 
boys poe 
Achievement motive | —2.10 —.60 1.39 
Affiliation motive 7.70 8.10 50 
Power motive 6.35 5.15 1.12 
ion motive 2.31 1.75 2.05 
Experience balance .20 A7 1.01 
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TABLE 4 


Propuct-MoMENT CORRELATIONS OF TAT MEASURES BETWEEN MEMBERS WITHIN THE MALADJUSTED 
AND WITHIN THE ADJUSTED FAMILIES 


E. J. Murray, A. SEAGULL, AND D. GEISINGER 


Father-Mother Father-Son Mother-Son 
Measure 
Maladjusted| Adjusted | Maladjusted| Adjusted | Maladjusted | Adjust 

Achievement motive 24 —.24 AL 12 —.10 

Affiliation motive 16 18 —.13 19 =:00"* 

Power motive .48* 37 —.88** .00 —.66** 
Aggression motive —.06 —.16 —.18 —.30 —.47* 

Experience balance —.03 .20 18 .47* .02 


* p <.05, df = 18, two-tailed test. 
™ > <.01, df = 18, two-tailed test. 


relationships. The negative correlations for 
the power motive between maladjusted fathers 
and boys is substantial and significantly dif- 
ferent from zero. The maladjusted father-son 
power correlation is also significantly different 
from the corresponding one for adjusted 
fathers and sons (z ='4.01, p < .001). 

The results are even more striking for the 
maladjusted mother-son correlations. The cor- 
relations for affiliation, power, and aggression 
show a significant inverse relationship, Two 
of these are also significantly different from 
the corresponding correlations in the adjusted 
group (for power, z = 2.06, p < .01; for ag- 
gression, z = 2.14, p < .05). Thus, in con- 
trast to the weak positive association in the 
adjusted group, the motivational measures for 
the maladjusted boys appear to be inversely 
related to the parental measures, particularly 
those of the mother. 

The intercorrelations for the five TAT mea- 
sures for each family role in each of the 
groups ranged from low negative to medium 
positive. However, only 5 of these 60 correla- 
tions were statistically significant, a result 
that could be attributed to chance. In addi- 
tion, the correlations between each TAT mea- 
sure for a given family role and each different 
measure for each of the other family roles 
for each group were also of a chance order. 
No meaningful patterns within or between 
groups were noted among the few significant 
correlations. With correlations this low it is 
unlikely that a combination of measures 
would increase the differentiation between the 
adjusted and maladjusted groups of families. 


Interview Measures 


The results from the rating of the boy 
relationships with their parents as close 
distant are shown in Table 5. It can be 
that more maladjusted than adjusted 
describe distant relationships with their 
ents. The difference between the maladjuste 
and adjusted boys is significant for the rel 
tionship with the mother (x? = 5.7, df= 
$ < .02). A similar trend can be seen for th 
boy’s perception of the father but the diffe 
ence between the groups does not quite t 
statistical significance (p < .07). ; 

The measures for the type of control tech: 
niques used by the parents, based on the 
terviews with the boys, are shown in Table 
It can be seen that the fathers in the tm 
groups do not differ significantly in typi 
control used (y?=.9, df= 3, ns). On # 
other hand, the distributions of control te 
niques used by the maladjusted and adjus 
mothers are significantly different (x? = 12: 
df = 3, p < 01). The adjusted mothers tet 


TABLE 5 


NUMBER or MALADJUSTED AND ApyusTEp Boys | 
DEscrrBinc THEIR RELATIONSHIP WITH 
THEIR PARENTS AS CLOSE or DISTANT 


Relationship 


A with father 
Description by 


Close | Distant] Close 
Maladjusted boys 11 8 12 
Adjusted boys 17 3 19 
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TABLE 6 


NUMBER OF MALADJUSTED AND Apyustep Boys DE- 
SCRIBING THEIR PARENTS CONTROL TECHNIQUES AS 
PHYSICALLY AGGRESSIVE, VERBALLY AGGRESSIVE, 
OR NONAGGRESSIVE 


Parental control techniques, 
Descrip-] Description 
ji y Physi- | Insuffi- 
ge p Nonag- vapaa cally | clent 
gressive | ĉ8gre™ | aggres- |informa- 
sive tion 
Maladjusted 
boys 5 6 5 Kt 
Father 
Adjusted 
boys 5 8 3 4 
Maladjusted 
boys 8 5 3 3 
Mother 
Adjusted 
boys 0 14 3 3 


to use verbal aggressive techniques, such as 
scolding and yelling. The maladjusted mothers 
tend to use the more subtle, nonaggressive 
techniques of restricting and depriving. 


DISCUSSION 


Perhaps the most striking result of this 
study was that the motivational differences 
between the parents in the two groups were 
considerably fewer and smaller than antici- 
pated on the basis of the clinical literature. 
Only paternal aggression and maternal extra- 
tensiveness differentiate the groups at a satis- 
factory statistical level. It is possible that 
other motivational variables would differenti- 
ate the groups. Nevertheless, within the limits 
of the measures used in this study it would 
not appear that parents of maladjusted chil- 
dren are grossly different, motivationally, than 
parents of adjusted children. This supports 
Frank’s (1965) conclusion that when objec- 
tive measures and adequate controls are used, 
it is difficult to distinguish between families 
of emotionally disturbed and normal children. 

Although the parents in the two groups are 
Not grossly different, the two significant vari- 
ables are of some interest. The higher aggres- 
sion score of the fathers in the maladjusted 
group is consistent with studies showing 
Steater hostile interaction in maladjusted than 
adjusted families (Gassner & Murray, 1969; 
Winter et al., 1966). Since these studies used 
Interaction measures, it was difficult to dis- 
tinguish the main source of hostility. The 
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present findings suggest that the more overt 
hostility is displayed by the father and, to 
some extent, the boy. The mother may play 
a more subtle role. The higher extratensive 
score suggests some difficulty in reacting to 
internal feelings. In line with this, the mal- 
adjusted boys describe their mothers as rela- 
tively distant and inhibited in overt aggres- 
sion. Instead, these mothers seem to use 
subtle control techniques that could enter into 
the type of hostile interaction described by 
Spiegel (1957). 

The results from the correlational analysis 
provide little support for the view that each 
family has a core problem, at least a core 
problem captured by these motivational mea- 
sures. In the adjusted group there is essen- 
tially no relationship between family members 
in regard to motives. The slightly positive 
trend is similar to the results of Hess and 
Handel (1956) for a set of TAT aggression 
measures from normal families. Hess and 
Handel found the same order of relationship 
between individuals from the same or different 
families. In the maladjusted group there is 
also little relationship between the motives of 
the parents. In contrast to these findings, the 
TAT measures for the maladjusted boys tend 
to be negatively related, substantially in some 
cases, to the measures for their parents, par- 
ticularly the mother. Thus, the results provide 
no support for the idea of direct relation- 
ships between motives in the same family and 
support for the idea of inverse relationships 
only in the case of maladjusted boys and 
their parents. 

Does any of this help us understand the 
nature of the difficulties of the maladjusted 
boys? First of all, it should be noted that 
while the boys in the two groups behave 
differently, they do not appear to be very 
different motivationally. Furthermore, the 
parents in the two groups are not categorically 
different. Nevertheless, there does seem to be 
some hostile tension in the maladjusted fami- 
lies and the boy does seem to be alienated 
from and reacting against the parents, par- 
ticularly the mother. Conceivably, the rela- 
tively mild psychopathology in the present 
sample is related to the comparatively small 
differences in parental variables and the 
child’s reaction to these. 
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REALITY ATTENTIVENESS-INATTENTIVENESS AND 


EXTERNALIZATION-INTERNALIZATION 
IN DEFENSIVE STYLE’ 


MARSHALL BUSH,? ROBERT HATCHER, anp MARTIN MAYMAN 
University of Michigan 


This study attempts to link the degree of attentiveness or inattentiveness one 
characteristically displays toward his immediate environment and the locus of 
conflict in subjective experience. It was hypothesized that Ss who avail them- 
selves of opportunities to withdraw attention from reality when such an option 
is an adaptive alternative in a particular situation would also tend subjectively 
to experience conflict as lying within themselves, whereas Ss who remain atten- 
tionally anchored in their immediate surroundings—even when the task require- 
ments do not demand sustained environmental attentiveness, would tend to 
experience conflict as lying between themselves and something else. Several 
measures of different aspects of reality attentiveness-inattentiveness were all 
found to correlate significantly with an index of externalization-internalization 


in defensive style among a large sample of female college students. 


Several lines of investigation (Guilford & 
Guilford, 1939; Guilford, 1959; Lacey, 1967; 
Lacey, Kagan, Lacey, & Moses, 1963; 
Pribram, 1967; Singer, 1966) point toward 
the feasibility of conceptualizing a bipolar 
stimulus-response continuum which is bounded 
at one end by the cognitive processing of in- 
ternally produced stimuli and at the other end 
by the cognitive processing of externally pro- 
duced stimuli. A convenient way of formu- 
lating such a dimension is in terms of a 
directional parameter of attention deployment 
along an internal-external stimulus coordinate. 
Tn very general terms, internal stimuli refer 
to such things as memories, images, thoughts, 
and feelings, while external stimuli refer to 
the immediate environmental objects of sensa- 
tion and perception. Voth and Mayman 
(Mayman & Voth, Voth, 1962; Voth & 


1This paper is based on a dissertation submitted 
to the graduate school of the University of Michigan 
in candidacy for the degree of Doctor of Philosophy. 
The first author wishes to thank his thesis com- 
mittee, Joseph B. Adelson, Martin Mayman, Fred- 
erick Wyatt, Ira Miller, and M. Clemens Johnson, 
for advice and assistance. 

? Requests for reprints should be sent to Marshall 
Bush who is now at Mount Zion Hospital, Depart- 
Ment of Psychiatry, 1600 Divisadero Street, San 
Francisco, California 94115. 

3M. Mayman and H. M. Voth. Reality-closeness, 
phantasy, and autokinesis: A dimension of cognitive 
style. Unpublished manuscript, 1967. 

. Mayman and H. M. Voth. Reality-closeness 
and autokinesis: Experimental test of an hypothesis 


Mayman, 1963, 1967) have proposed that 
there are stable individual differences in the 
distribution of attention cathexes in terms of 
the priorities assigned to impinging internal 
and external classes of stimuli. A systematic 
presentation of this dimension as a cognitive 
control principle has been attempted else- 
where (Bush, 1968). 

Previous research has demonstrated that 
the study of cognitive control principles can 
provide access to a more refined understand- 
ing of the attentional processes and the cogni- 
tive structures underlying such classical de- 
fense mechanisms as repression, denial, isola- 
tion, and intellectualization (Gardner, 1962, 
1964; Gardner, Holzman, Klein, Linton, & 
Spence, 1959; Gardner & Long, 1962; Holz- 
man & Gardner, 1959; Witkin, 1964). The 
intent of the present study was to demonstrate 
the direct relevance of a directional param- 
eter of attention deployment for a basic di- 
mension of defensive style—externalization- 
internalization. Clinical evidence for such a 
relationship has been provided by Voth and 
Mayman (1966, 1967). 

Mayman and Voth (see Footnote 3) have 
coined the term “reality-closeness-reality- 
distance” in referring to an individual’s char- 
acteristic degree of attentional embeddedness 
in his surrounding environment, The reality- 
to explain inter- and intra-individual differences in 
receptivity to autokinetic illusion. Unpublished manu- 
script, 1968. 
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closeness-reality-distance principle describes 
the directional priorities in attention deploy- 
ment along an internal-external stimulus di- 
mension in situations where S is “free” to 
attend to “outer” or “inner” sources of stim- 
uli, that is, in situations where the adaptive 
requirements of the task do not demand active 
attention to external stimuli or active atten- 
tion to internal stimuli. The relationship be- 
tween this approach to individual differences 
in attentiveness toward the “inner” and 
“outer” world and other related typologies 
has been discussed elsewhere (Bush, 1968; 
Voth & Mayman, 1963). 

For purposes of this study externalization- 
internalization was conceptualized as the 
locus of conflict in subjective experience. An 
externalizer is an individual for whom con- 
flict experiences are environmentally em- 
bedded to an extreme degree, that is, conflict 
is experienced as lying between the self and 
something else. An internalizer is an indi- 
vidual for whom the experience of conflict is 
strongly embedded within the self. This con- 
ception of externalization-internalization deals 
not with particular “mechanisms of defense” 
but with the phenomenology of conflict 
(in which a variety of defense mechanisms 
may be implicated). This formulation of 
externalization-internalization bears much in 
common with Horney’s (1945) discussibn of 
externalization. The hypothesis to be tested 
is that reality attentiveness (attentional prior- 
ity to external stimuli) will be positively 
related to externalization, while reality in- 
attentiveness (attentional priority to internal 
stimuli) will be positively related to inter- 
nalization in defensive style. 

The underlying rationale for this prediction 
is that defensive operations employ the same 
ego apparatuses and structures that are used 
in nondefensive cognitive functioning. From 
one standpoint defenses represent the exploita- 
tion of adaptive reality testing processes to 
reduce awareness of internal sources of threat 
and distress. Both on a phenomenological and 
a process description level externalization may 
be viewed as an outgrowth of a high degree 
of attentional anchorage in one’s immediate 
external reality. Not only does the atten- 
tional priority given to external stimuli sensi- 
tize the individual to reality representations of 
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internal conflict, but the resulting phenomeno. 
logical world contingent on extreme reality 
attentiveness is deficient in the Categories 
of subjective experience that are requisite for 
the apprehension of internal conflict, 

Whereas a high degree of continuous reality 
attentiveness creates a set to refer conflict 
experiences to environmental circumstances, 
a high degree of attentiveness to internal 
thoughts and feelings creates the disposition 
to recognize and experience conflict as lying 
within oneself. The individual who habitually 
gives attentional priority to internal stimuli is 
better equipped with the experimental struc- 
tures for cognizing and tolerating conflict 
within the self. That the externalization- 
internalization distinction in defensive style is 
of more than academic interest has been 
shown by Voth and Mayman’s ( 1966, 1967) 
findings about the course of psychotherapy 
with these two different kinds of individuals. 

This article reports that portion of a larger 
investigation (Bush, 1968) which attempted 
to relate several different aspects of reality 
attentiveness (rather than several parallel 
measures of the same variable) to externaliza- 
tion-internalization in defensive style. In par- 
ticular the distinction was drawn between 
response potential as demonstrated in labora- 
tory tasks and frequency of behavioral mani- 
festations of this potential in everyday activi- 
ties. It was felt that an individual’s capacity 
to withdraw attention from environmental 
stimuli in an experimental situation would not 
necessarily show a perfect correspondence 
with the degree of reality in attentiveness he 
characteristically exhibits in his daily behav- 
ior, although the two should be positively 
related. 


METHOD 
Subjects 


The Ss in this study were 83 female college stu- 
dents at the University of Michigan. The sample 
included 38 freshmen, 35 sophomores, and 10 juniors. 
Their ages ranged from 17 to 22 with a modal age 
of 19. All Ss were drawn from the introductory 
Psychology courses and received course credit for 
their participation. 


Measures of Reality Attentiveness and 
Inattentiveness 


The first three measures to be described are based 
on the autokinetic test and replicate the procedures 


— 
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employed by Mayman and Voth (see Footnote 3, 
1967). For a detailed description of these measures 
the reader is referred to Mayman and Voth (see 
Footnote 3, 1967) and Bush (1968). 

Extent of perceived autokinetic movement. The 
yationale for use of the autokinetic test as a measure 
of reality attentiveness is presented in a series of 
articles by Voth and Mayman (Mayman & Voth, see 
Footnote 3; Voth, 1962; Voth & Mayman, 1963, 
1966, 1967, 1968). For purposes of the present study, 
autokinetic responsivity was thought to represent a 
psychological (and perhaps also neurophysiological) 
predisposition toward greater ease in the withdrawal 
of attention from external reality and a heightened 
sensitivity to inner sources of stimulation. The extent 
to which such a constitutionally based predisposition 
will reflect itself in overt behavior is strongly influ- 
enced by defensive needs and character style (Voth 
& Mayman, 1967, 1968). 

In the autokinetic test S viewed a stationary pin- 
point of light (.0006 foot candles in intensity) from 
a distance of 9 feet in a totally darkened room. 
During the 10-minute test period, S was asked to 
trace with a pencil whatever movement she observed. 
The S’s score was the logarithmic conversion of the 
length of line drawn to record the apparent motion 
of the light. (The logarithmic conversion is needed 
to normalize the distribution of autokinetic scores.) 
A test-retest reliability coefficient of .83 has previ- 
ously been reported for this measure of autokinetic 
perception (Voth & Mayman, 1963). 

Loss of awareness of the experimental room. This 
measure, and the following one to be described, was 
based on a structured interview conducted immedi- 
ately following the autokinetic test. Part of the 
inquiry was intended to ascertain the degree to 
which S preserved a mental image of the experi- 
mental room during the 10-minute test period. This 
Procedure was devised by Mayman and Voth (see 
Footnote 3, 1967) to provide a direct experimental 
Measure of reality-closeness-reality-distance. The de- 
gree to which S remains aware of the presence of 
the room, despite its visual obliteration by darkness, 
is a concrete experimental manifestation of the ease 
with which he can sustain attention to, or, con- 
versely, withdraw attention from, his immediate 
Surroundings when either alternative is an adaptive 
Option offered by the test situation. 

The S was asked such questions as, “Can you 
remember your impressions of the room as you were 
Watching the light?” “To what extent did you 
temain aware of some of the things in the room?” 
and “If the room seemed different to you, to what 
extent did it continue to seem like a room at all?” 

sponses varied greatly from the extreme of losing 
all Sense of being in any type of enclosure to 
Temaining well aware of the presence and contents 
of the room throughout the entire 10-minute auto- 
netic test period. 

a he protocols were later rated blindly by two of 

e Es* using a 5-point scale. A 1-point rating indi- 
fated minimal loss of awareness of the room and a 


‘Marshall Bush and Martin Mayman. 


345 


5-point rating extreme obliviousness to the presence 
of the room. The index of rater agreement (as mea- 
sured by the product-moment correlation coefficient) 
was 83. 

Degree of increased subjectivity. This measure was 
intended to assess the degree to which S became 
immersed in a subjective experience (such as a 
fantasy or a reminiscence of a past experience) while 
viewing the autokinetic light. The relevant interview 
questions inquired about the contents of S’s thought 
processes during the 10-minute autokinetic test period 
(e.g, “What did the light make you think of?” 
“When it seemed like you were watching a 
where did you seem to be 
watching it from?” and “What other thoughts 
passed through your mind during the time you were 
looking at the light?”). 

The protocols were later rated blindly by one of 
the Es and an advanced graduate student in clinical 
psychology. A 5-point scale was employed to indi- 
cate the degree to which S shifted from a reality- 
oriented to a highly subjective frame of reference. 
The 1-point scale position denotes a “realistic” experi- 
ence of the autokinetic situation while the 5-point 
scale position indicates that the autokinetic test was 
transformed into a highly subjective imaginary ex- 
perience. The correlation between the scores assigned 
by the two raters was .70. 

These two interview-based measures, while focus- 
ing on different aspects of attentional withdrawal 
from one’s immediate reality situation, are not com- 
pletely independent of each other. Overlapping data 
were used in making each of the ratings. Moreover, 
a high degree of increased subjectivity necessarily 
implies some loss of awareness of the experimental 
room. However, there were enough cases of disparity 
between the two measures to warrant keeping them 
separate. 

The Reality -Attentiveness—Reality -Inattentiveness 
(RA-RI) Questionnaire. A 26-item questionnaire 
(Bush, 1968) was used to measure an individual’s 
characteristic degree of environmental attentiveness. 
The questions were exclusively restricted to everyday 
behavioral manifestations of attentiveness or in- 
attentiveness in a variety of situations. The range 
of behavior covered in the item content included 
time estimation, spatial orientation, calendrical orien- 
tation, experiences of “engrossment,” and a variety 
of memory phenomena which could be attributed to 
reality attentiveness or inattentiveness. Care was 
taken to keep all of the items well within the range 
of normal behavior and to phrase the statements 
in a way that minimized their social desirability 
value. 

Each item consisted of a statement which S was 
asked to endorse on a 4-point scale (rarely, some- 
what, fairly, or very characteristic of you). Seventeen 
items were worded in the direction of reality in- 
attentiveness, 12 items in the direction of reality 
attentiveness. The following examples typify each 
kind of item. 
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Reality inattentiveness (RI items): 


a. You can become so absorbed in something that 
you completely lose track of the time. 

b. You momentarily forget what day of the week 
it is. 

c. Unless you are especially interested in something 
your mind easily wanders from the topic of 
conversation, 


Reality attentiveness (RA items) : 


d, You find it easy to remember the names of people 

you have just been introduced to. 

You know almost exactly how much money you 

have with you at all times, 

f. You are a naturally good score keeper at games 
and other activities. 


6 


Separate scores were obtained for those items 
worded in the direction of reality attentiveness and 
for those items worded in the direction of reality 
inattentiveness. A composite total score was obtained 
by summing these two-part scores. Thus each $ had 
an RA score, an RI, and a total RA-RI score, In 
calculating the total score, the scoring was reversed 
for the RA questions so that a high total score 
indicates reality inattentiveness and a low total score 
reality attentiveness. The odd-even split-half cor- 
relation coefficient for this questionnaire was .80. 
Application of the Spearman-Brown formula pro- 
duced a reliability coefficient of .89. 


Measure of Externalization-Internalization 


The Conflict Situations test. In order to assess 
externalization-internalization in defensive style a 
projective technique was devised, the Conflict Situa- 
tions test. This test was presented in the form of 
an opinion survey wherein Ss were asked to illus- 
trate through written stories what they felt were 
the three most difficult crises or personal problems 
faced by girls during their college years. The amount 
of time allotted for this test, 30 minutes, proved to 
be insufficient given the nature of the task. Several 
Ss could not complete three stories in the time 
allowed. Consequently only the first story is used 
in the data analysis, 

The first story written by each S was given two 
ratings. The first rating was based on a 5-point scale 
indicating the degree to which the experience of 
conflict is portrayed as occurring within the self, as 
opposed to being primarily situational, that is, em- 
bedded in particular relationships, situations, or 
events. The second rating, again using a 5-point scale, 
was based on how attention is distributed within the 
story between the internal and external referents of 
the conflict situation. A high rating here indicates 
that a large proportion of the story was devoted to 
describing the subjective side of the conflict experi- 
ence in the central character. The S’s score was 
simply the sum of these two ratings. A high score 
represented internalization, and a low score externali- 
zation in defensive style. 
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Both ratings were made for every S’s first story 
by two experienced clinical psychologists. The mean 
of the externalization-internalization scores assigned 
by the two raters served as the measure of defensive 
style for each S. Due to several cases in which the 
raters showed wide disagreement, the correlation be- 
tween the scores assigned by the two raters was 
only .57. To evaluate the effect of eliminating those 
Ss whose stories produced the largest rater dis- 
agreement, a separate analysis was performed after 
excluding all cases in which the raters showed a 
disagreement of four or more points in the assign- 
ment of externalization-internalization scores. The net 
result was to eliminate nine Ss (leaving an N of 74) 
while increasing the reliability of the measure of 
defensive style for the rest of the sample from $7 
to .71. 


Procedure 

The Ss were seen for two experimental sessions, In 
the first 1-hour testing session, Ss were individually 
administered the autokinetic test, an autokinetic test 
interview, and the Conflict Situations test. The 
RA-RI Questionnaire was administered during a 
second testing session. The participants had no prior 
knowledge about the purpose of the experiment nor 
of the fact that the two testing sessions were part 
of the same study. A few Ss did not complete their 
first testing session, while several others were unable 
to attend the second session, with the result that only 
partial data were available in nine cases. 

{ 


RESULTS 


Table 1 presents the correlations between 
the measures of reality attentiveness-inatten- 
tiveness and the measure of defensive style. 
Despite the rather low reliability of the 
externalization-internalization score, it none- 
theless correlates with four of the measures 
of attentiveness-inattentiveness at the .05 level 
of significance, and with the remaining two 
measures at the .01 level of significance (two- 
tailed probability). When Ss producing the 
largest rater disagreement are eliminated from 
the samples (which increases the reliability 
of the measure of defensive style from .57 to 
.71) a slight increase occurs in the magnitude 
of the correlations between the measures of 
attentiveness-inattentiveness and the external- 
ization-internalization score. In all cases the 
correlations in Table 1 are in the predicted 
direction, confirming the predicted relation- 
ship between attentional anchorage in ones 
immediate environment and the tendency t° 
externalize the experience of conflict. 

While it was not possible to directly control 
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r the effects of intelligence on the observed 
s, verbal ability scores and cumulative 
point averages were obtained for all Ss 
ie samples. The verbal ability score from 
Scholastic Aptitude Test correlated sig- 
ficantly with the RA subscore and the total 

e on the RA-RI Questionnaire (—.36 and 
24, respectively), but did not correlate sig- 
“nificantly with the measure of defensive style 
or the other measures of reality attentiveness- 
inattentiveness. Cumulative grade point aver- 
age correlated significantly (—.29) with only 
variable, degree of increased subjectivity. 
either verbal ability, nor academic achieve- 
ment seems to play a significant role in 
the obtained relationships between attention 
deployment and defensive style. 

“Table 2 presents the intercorrelations be- 
tween the several measures of reality atten- 
tiveness-inattentiveness. Disregarding for the 
Moment the correlations involving the auto- 
netic score, the other measures of reality 
 attentiveness-inattentiveness (with one excep- 
ion) all show significant intercorrelations in 
the predicted direction. The .59 correlation 
between the mean rating for loss of aware- 
mess of the room and the mean rating for 


TABLE 1 


Mean exter- 
Mean exter- | nalization-in- 
nalization-in- | ternalization 
Measure i ternalization score with 
score using all | Ss producing 
data on largest rater 
all Ss* disagreement 
removed 
tog autokinetic tracing .26* 32A 
‘Hoss of awareness of 
; m (mean rating) .29* ol 
gree of increased 
ibjectivity (mean 
ting) .28* Bae 
Questionnaire 
al score 34** 38** 
subscore 31** 36** 
Subscore ie —.27* 
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degree of increased subjectivity indicates that 
these two interview-based measures are closely 
related to each other; the withdrawal of at- 
tention from the surrounding environment 
appears to be accompanied by an internally 
directed attentiveness to one’s own thoughts, 
fantasies, memories, and feelings. 

The correlations between the mean rating 
for loss of awareness of the room and the 
questionnaire measures of habitual reality 
attentiveness-inattentiveness (.34, .30, —.30) 
indicate that a person’s attentional priorities 
in a laboratory task are reflective of his char- 
acteristic behavior in everyday life. This rela- 
tionship is, however, a modest one. Other 
factors would seem to mediate the degree to 
which the response potential evidenced in an 
experimental situation becomes a prominent 
feature of one’s everyday behavior. The mean 
rating for degree of increased subjectivity cor- 
relates significantly with only the RA items 
on the RA-RI Questionnaire, although the 
correlation with the RI items is in the ex- 
pected direction, The —.47 correlation be- 
tween the RI and RA items is only moderate, 
suggesting that the “reality inattentive” ques- 
tions and the “reality attentive” questions, 
while tapping an overlapping domain, still 
possess some independent sources of variance 
not shared by the other. 

One unexpected finding in Table 2 was the 
lack of significant correlations between the 
autokinetic score and the other measures of 
reality attentiveness-inattentiveness. No satis- 
factory explanation for this result is available. 
Further analysis (Bush, 1968) showed that 
the relationship between the amount of per- 
ceived autokinetic motion and the RA-RI 
Questionnaire was markedly curvilinear. The 
Ss who perceived the least amount of move- 
ment (2-35 inches), as well as those who 
perceived the most movement (71-400 
inches), received high scores for reality atten- 
tiveness and low scores for reality inattentive- 
ness, while the middle group (36-70 inches) 
received low scores for reality attentiveness 
and high scores for reality inattentiveness. 
To put the matter another way, a high score 
in the direction of reality inattentiveness on 
the RA-RI Questionnaire was associated with 
a moderate to high degree of autokinetic per- 
ception, whereas a high score in the direction 
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of reality attentiveness could be associated 
either with a low degree or with an extremely 
high degree of autokinetic perception. 

Some preliminary efforts to control for the 
effects of individual differences in drawing 
style on autokinetic tracings yielded a most 
unusual finding. A drawing style factor ap- 
peared to be operative only among Ss record- 
ing the greatest amounts of autokinetic move- 
ment. Among the 19 Ss who recorded more 
than 70 inches of movement, the correlation 
between the index of drawing style and the 
autokinetic score was .46 (.05 level of sig- 
nificance). For the remainder of the sample, 
the correlation between drawing style and 
autokinesis was .05. The effects of eliminating 
from the sample the 19 Ss recording the 
highest degree of movement are presented in 
Table 3. While the relationships between the 
autokinetic test, loss of awareness of the room, 
and degree of increased subjectivity remain 
essentially unchanged, the relationship be- 
tween the autokinetic scores and the RC-RD 
questionnaire is made linear. The autokinetic 
test now correlates significantly with the total 
RA-RI Questionnaire score and the RI sub- 
score, and there is an increase in the correla- 
tion between the RA-RI Questionnaire and 


TABLE 2 


INTERCORRELATIONS AMONG MEASURES oF REALITY 
ATTENTIVENESS-INATTENTIVENESS USING ALL 
AVAILABLE DATA ON ALL SUBJECTS 


Measure 2 


PEE 


1. Log autokinetic 
tracing Siea- EONO 01 

2. Loss of aware- 
ness of the 
room (mean 
rating) 

3. Degree of in- 
creased sub- 
jectivity 
(mean rating) .27* 20 

4, RA-RT Ques- 
tionaire total 
score 

5. RI subscore 

6. RA subscore 


-SOMFF 34re 30H) — 30+ 


=31m 


game — 75408 


— AE 


Note.—W varies between 75 and 83 due to mi: 
* p <.05, two-tailed. Ce 
** > < .01, two-tailed, 
"D> <.001, two-tailed. 
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TABLE 3 


INTERCORRELATIONS AMONG MEASURES oF REALTY 
ATTENTIVENESS-INATTENTIVENESS AFTER SUBJECTS 
WITH AUTOKINETIC SCORES ABOUT 70 INCHES 
Have BEEN ELIMINATED 


Measure 2 3 4 


s | 6 


1. Log autokinetic| 
tracing «18 14 

2. Loss of aware- 
ness of the 
room (mean 
rating 

3. Degree of in- 
creased sub- 


SOE AZ 41H — 33% 
jectivity 
(mean 
rating) 128s i 21 —.33* 
4. RA-RI Ques- 
tionnaire 
total score garre OAN 
ji 


5. RI subscore =.50* 
6. RA subscore 


Note.—N varies between 58 and 63 due to missing data. 
* p <.05, two-tailed. 
** p <.01, two-tailed. 
> <.001, two-tailed. 


loss of awareness of the room. One implication 
of these results which needs to be pursued in 
future research is that the degree and type 
of relationships exhibited by the autokinetic 
test, as well as the influence of a drawing 
style factor, may vary according to the range 
of scores within which one is working. 


Discussion 


The results, in general, suggest that a rela- 
tionship does exist between the attentional 
priorities one assigns to internally and exter- 
nally produced stimuli and the locus of con- 
flict in subjective experience. The Ss who 
readily withdraw attention from their immedi- 
ate environment and attend instead to inter- 
nal states—when such behavior is an adaptive 
option afforded by the response demands in 
a particular situation—also subjectively €x- 
perience conflict as something which is em- 
bedded within themselves. Conversely, Ss who 
sustain attention to their external surround- 
ings when such behavior is not necessitated 
by the adaptive requirements in a particular 
situation, also subjectively experience conflict 
as lying between themselves and something 
else. In describing conflict they focus on the 
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external trappings of the situation, rather 
than on the internal sources of tension. The 
externalization-internalization dimension ap- 
pears to be of decisive importance in deter- 
mining an individual’s capacity to respond 
to analytically oriented “insight” therapy 
(Voth & Mayman, 1966, 1967). Externalizers 
view the source of their problems as stemming 
from without and find it very difficult to 
think psychologically and make effective use 
of insight. 

It is noteworthy that all of the measures of 
reality attentiveness-inattentiveness correlated 
to a comparable degree with the externaliza- 
tion-internalization score, despite the fact that 
they correlated only very modestly with each 
other. It seems likely that we are dealing here 
with different but related aspects of atten- 
tional priorities for internal and external 
sources of stimulation. The nonsignificant re- 
lationships between the autokinetic test and 
the other measures of attention deployment 
is very puzzling in light of previous experi- 
mental results (Mayman & Voth, see Foot- 
note 3) and not readily explainable. One 
factor which seems to have played an im- 
portant part in this finding was the fact that 
this sample, in contrast to previous ones, 
lacked Ss who perceived no autokinetic move- 
ment. This restriction on the range of auto- 
kinetic scores at the ‘“low-movement” end of 
the continuum may well have prevented the 
expected relationships from appearing. Ina 
More heterogeneous sample the relationships 
which were observed might be found to be 
stronger than appeared in this study. Another 
unknown stemming from the use of a very 
homogeneous sample of female college stu- 
dents is the possible effect of sex differences 
on the variables under investigation. Clarifi- 
cation of this issue must await further study. 

We are dealing here with two domains of 
behavior which have been sampled in only a 
preliminary and incomplete manner. The locus 
of conflict in subjective experience is only 
One aspect of externalization-internalization 
in defensive style. Other related aspects of 
Coping with conflicts include how responsibil- 
ity for personal difficulties gets distributed, 
the kind of symptomatology one is likely to 
develop, the kind of resolution which is sought 
after, and expectancies concerning the control 
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of reinforcement (Rotter’s, 1966, internal- 
external control dimension). Similarly, with 
regard to the kind and degree of attentiveness 
one displays toward his inner states and outer 
environment, we have dealt with only a lim- 
ited segment of a very complex area of human 
behavior. People differ in the areas of reality 
toward which they remain attentive and in 
the modalities through which they attend, 
just as they differ in the areas of self- 
experience toward which they are charac- 
teristically attentive or inattentive. As the 
dimensionality of reality attentiveness-inat- 
tentiveness and externalization-internalization 
in defensive style is further explored and 
understood, it should be possible to link 
particular components of attentiveness-inatten- 
tiveness with specific facets of externalization- 
internalization, A more differentiated un- 
derstanding of the enduring parameters of 
attention deployment will not only shed light 
on the defensive operations (and other ego 
structures and functions) which use these 
attentional processes, but may in addition 
provide the key to a more adequate under- 
standing of the individual’s phenomenological 
world. 
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VALIDATIONAL STUDY OF MARITAL STATUS AND 


correlations between the criterion 


. j y prognosis were discussed. 
a 
The process-reactive continuum has re- 
ceived much attention in reducing hetero- 
‘geneity and predicting prognosis of patients 
diagnosed schizophrenic (Garmezy, 1965; 
Herron, 1962). The measurement of this di- 
mension has been based on either the Elgin 
(Becker, 1956; Wittman, 1941) or Phillips 
(1953) scales. Both scales are time consum- 
require a great deal of effort, and are 
ect to extraneous sources of variance, for 
ple, the varied quality of social and 
medical records. These limitations were over- 
‘tome by Ullmann and Giovannini (1964) who 
developed a 24-item, true-false self-report 
Scale which obtains biographical information 
to reflect the nature of the patient’s prior 
Social interaction with his environment. The 
mocedure for selecting items was based on 
face validity as judged by three clinicians 
and by internal consistency criteria. No 
Gxternal criterion was used either in the 
" Scale’s development or to demonstrate its pre- 
ive validity. Construct validity for the 
: self-report scale was provided by Johnson and 
Reis (1967) and Held and Cromwell (1968) 
who found a significant correlation between 
; self-report scale and Phillip’s scale and 
Ober (1966) who found a significant cor- 
‘elation between the self-report scale and the 
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THE SELF-REPORT SCALE FOR PROCESS- 
REACTIVE SCHIZOPHRENIA 


DONALD H. MEICHENBAUM + 
University of Waterloo, Ontario 


Evidence for both predictive and construct validity for the self-report scale 
and maritus status was obtained for 62 male schizophrenic patients. Six-month, 
1-year, and 2-year follow-ups indicated that the self-report scale and marital 
status were equally efficient in predicting length of hospitalization. The inter- 
measures indicate that length of present 
hospitalization is the best predictor of length and future hospitalization. A con- 
sistent finding for both male schizophrenic and male normals was a significant 
relationship between level of cognitive ability and marital status. Married and 
divorced males, both schizophrenic and normal, 
on the proverbs test and gave more common 
Possible factors contributing to the efficacy of marital status in predicting 


performed on a superior level 
associations than single males. 


Zigler and Phillips (1962) Social Competence 
Scale. 

The purpose of the present investigation 
was to examine the predictive and construct 
validity of the process-reactive self-report 
scale. A second purpose was to compare the 
predictive effectiveness of the self-report scale 
with marital status. 


METHOD 
Subjects 


Sixty-two male schizophrenics from the Veterans 
Administration Hospital, Danville, Illinois, who had 
been previously involved in other research (Meichen- 
baum, 1966a) were selected from acute admission 
wards and back wards. The Ss represent a stratified 
sample, in terms of age, level of education, and 
marital status, of patients diagnosed schizophrenic 
at the hospital. The Ss whose diagnosis of schizo- 
phrenia may have been questionable were not in- 
cluded. At the time when the self-report scale was 
administered the average age of sample was 38.40 
with a range of 24-48; average length of present 
hospitalization was 12.50 months with a range of 
1-48 months; mean number of hospitalizations was 
one and one-half; and average level of education 
was 11.96 years with a range of 6-16 years. The 
present sample represents a broad range of pathology 
including a number of extreme process and reactive 


schizophrenics. 


Procedure 


Administration of self-report scale. Pilot research 
indicated that schizophrenic patients often misinter- 
preted the questions, or could not support the answer 
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they offered. Thus, the self-report scale? was ad- 
ministered verbally to each S. This procedure was 
followed to insure the likelihood of accurate report- 
ing and to expand the range of Ss to whom the self- 
report scale could be administered. Belligerent and 
disoriented patients who were unwilling or unable 
to take the test on their own were testable in this 
manner. 

Marital status. One of the major classes of items 
in many prognostic scales such as the Phillips (1953), 
Modified Elgin Scale (Becker, 1956; Wittman, 1941), 
and the self-report scale (Ullmann & Giovannini, 
1964) concerns the heterosexual behavior and marital 
status of the patient. Several investigators (Farina, 
Garmezy, & Barry, 1963; Garfield & Sundland, 1966) 
have reported that marital status has proved a 
successful prognostic indicator. Thus, the relative 
effectiveness of marital status in predicting outcome 
was examined. The relationship between a 3-point 
marital scale (1-married, 2-divorced, 3-single) and 
the criterion was examined. In the present sample 
18 Ss were married, 10 were divorced, and 34 were 
single. 

Criterion of predictive validity. The criterion of 
predictive validity employed was the total number 
of days the patient spent in a neuropsychiatric 
hospital (TDH) from the day the self-report scale 
was administered until a 6-month, 1-year, and 2-year 
follow-up was obtained. Social service and medical 
files were examined to keep track of hospital dis- 
charge and possible rehospitalizations. The discharge 
of the patient from the hospital was entirely inde- 
pendent of his score on the self-report scale. A 
positive correlation between process-reactive scores 
and TDH means that those Ss who scored toward 
process end remained in the hospital longer. A posi- 
tive correlation between marital status and TDH 
means that those Ss who are single remain in the 
hospital longer. 

Assessment of construct validity. One kind of evi- 
dence for the construct validity of prognostic indi- 
cators of the process-reactive continuum is a signifi- 
cant correlation with level of thinking pathology, 
namely, those Ss who indicate a more adequate level 
of prior social adjustment do better on conceptual 
tasks; whereas those Ss who evidence less adequacy 
do more poorly on conceptual tasks. In order to 
assess this relationship, three cognitive measures were 
administered: the Vocabulary subtest of the 1960 
Stanford-Binet, Form L-M; the Similarities Subtest 
of the WAIS; and Kaufman Proverbs Test (1960), 
Form I. The size of the sample for the assessment 
of construct validity was expanded to include 48 
additional schizophrenic Ss from the same hospital 


2 Question 15 was changed to read “More than 
once in the last year after some group meeting I 
have talked with some other members about some- 
thing that went on.” The phrase “I have stayed on” 
was deleted because the patients in group meetings 
at the Veterans Administration Hospital are typically 
ushered out of the meeting room at the completion 
of the group session. 
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who were used in another research project (Meichen- 
baum, 1966b). The additional Ss were more chronic 
and manifested a greater degree of schizophrenicity, 
especially pathognomic verbalizations. The inclusion 
of these additional Ss provides a broader range of 
pathology to determine the relationship between 
cognitive functioning and process-reactive scores, 
The relationship between cognitive functioning and 
marital status was also examined. 


RESULTS 


Table the 


1 presents intercorrelations 
between process-reactive self-report scores, 
marital status, and the criterion total number 
of days hospitalized (TDH). Four TDH 
scores are reported, namely, the number of 
days hospitalized in first 180 days; the 
number of days hospitalized in first 365 days; 
the number of days hospitalized in the second 
365 days; and the total number of days hos: 
pitalized over the 2-year period. The set of 
intercorrelations helps to answer three ques- 
tions: (a): What is the relative effectiveness 
of the process-reactive self-report scale and 
marital status in predicting TDH? (b) 
What is the relationship between the self- 
report scale and marital status? (c) What is 
the relationship between length of present 
hospitalization and length of future hospitali- 
zations? 

The significant correlations between self- 
report scale and TDH, and marital status and 
TDH (p< .01, df = 60), indicate that both 
the self-report scale and marital status pre 
dict the length of hospitalization of patients 
diagnosed schizophrenic. The differences M 
the correlations between process-reactive 
scores and the criteria, and marital status 
and the criteria, were not significant, although 
the self-report scale had a consistent trend 
of higher relationships with the criteria. h 


SCORES, MARITAL STATUS, AND TOTAL 


TABLE 1 
INTERCORRELATIONS OF PrOCESS-REACTIVE 
Days HOSPITALIZED 


Marital | TDH | TDH | TDH 


Correlates status | 1-180] 1-365] 366-730 


Process-reactive scale | .776 | 411 | .441 | .341 
Marital status +325 | .389 323 
TDH 1-180 1904 | -711 
TDH 1-365 1817 
TDH 366-730 


one- 


Note.—N = 62; all correlations significant, p < 005, 
tailed. 
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The high-positive relationship (r= .776) 
between process-reactive scores and marital 
satus is supported by the mean process- 
reactive scores for married (N = 18), di- 
reed (N = 10), and single (N = 34) pa- 
nts, respectively, 6.11, 10.40, and 15.62, 
all significantly different from each other 
(p<.01). Four of the 24 items on the self- 
report scale concerns marital status and sev- 
eal items are highly correlated with marital 
satus, contributing to the high correlation 
between the two measures. 

‘The high intercorrelations between the re- 
spective TDH measures indicate that length 
of present hospitalization is a good predictor 
of length of future hospitalization. The cor- 
telation of .904 between TDH 1-180 and 
TDH 1-365 is somewhat inflated because of 
the redundancy of information in the latter 
measure. However, when one partials out the 
first 180 days from the first year score, the 
Correlation between first 6 months and second 
6 months drops only to .820, still a signifi- 
cantly high relationship. 

One strategy to assess the relative useful- 
ness of two prognostic instruments is to com- 
pate their respective correlations with the 
criterion. On this basis marital status was 
as efficient as process-reactive scores. A second 
Strategy is to determine the number of hits 
and misses in predicting length of future hos- 
Pitalization. In employing this strategy one 
must select a cutting score for the ordinal 
measure (self-report scale) and compare the 
Consequent hits and misses with the nomi- 
Nal categories of marital status. The conclu- 
Sions from such a comparison must be limited 
Since one should cross-validate cutting scores 

ona different population. A likely cutting 

Point on the self-report scale which has been 
Used for the assignment of Ss (Johnson & 
Reis, 1967; Meichenbaum, 1966b) is a median 
“Split, namely, those Ss who score 13 and 

above versus those who score 12 and below. 

Table 2 presents a comparison of the fre- 
quency distribution of self report scores and 
Marital status for TDH 1-730 which has been 
divided into quartiles. An almost identical 

Picture of efficiency is demonstrated for both 

Process-reactive scores and marital status in 

Predicting length of future hospitalization. 

oth measures are most efficient in predicting 
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who leaves the hospital earliest and who 
remains longest. Thus, using both the correla- 
tion with the criterion and number of mis- 
placements, marital status is as efficient as 
the 24-item true-false self-report scale in 
predicting length of hospitalization. 

Construct validity. Table 3 presents the 
intercorrelations between  process-reactive 
scores, marital status, and cognitive measures. 
The significant negative correlations between 
premorbid adjustment, as assessed by the self- 
report scale, and marital status, and the simi- 
larities and proverbs test provide evidence for 
construct validity of these prognostic indi- 
cators. The present results are consistent with 
the findings of Ullmann and Eck (1965) who 
found a significant relationship between the 
self-report scale and cognitive performance as 
assessed by the Holtzman Inkblot Test. The 
finding that patients with poorer premorbid 
adjustment do poorer on cognitive tests was 
also found by Becker (1965), Little (1966), 
and Meichenbaum (1966a). The schizo- 
phrenic patients’ scores on the proverbs, 
vocabulary, and similarities tests were not 
significantly related to any of the TDH 
measures (N = 62). 

An analysis of cognitive performance by 
marital status revealed a consistent picture 
of married and divorced individuals both dif- 
fering significantly from singles, but not from 
each other. Married and divorced patients 
performed at a superior level to single patients 
on the proverbs test and similarities subtest 
and on the Kent-Rosanoff word association 


TABLE 2 


FREQUENCY DISTRIBUTION ON Toran Days HOSPITAL- 
IZED AS A Function oF SELF-REPORT SCALE AND 
MARITAL STATUS 


Self-report scale | Marital status 
N . 
gs 12and | 13 ana | Married oe 
below | above aivakced| 
2-178 | 15 11 4 11 4 
198-470 | 16 6 10 i 9 
525-720 | 16 8 8 6 10 
725-730 | 15 4 11 3 12 
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TABLE 3 


INTERCORRELATIONS OF Process-REACTIVE SCORES» 
MARITAL STATUS, AND COGNITIVE MEASURES 


r Binet | Similari- 
Correlates Si vocab- | ties sub- froyo 
ulary test 
Process-reactive | .683** | —.100 | —.225** | —.345** 
Marital status —.089 | —.187* | —.323** 
Vocabulary 516**| .543** 
Similarities -708** 


Note.—N = 110. 
* p <.05, one-tailed. 
** p <.01, one-tailed. 


test, but they were not significantly differ- 
ent from each other. The married, divorced, 
and single patients did not significantly differ 
on the vocabulary test or on level of educa- 
tion, suggesting that level of intelligence or 
education does not explain differences in 
cognitive performance between groups. 

Meichenbaum (1966b) tested 20 hospital- 
ized male medical patients who had no his- 
tory of psychiatric illness on these same cogni- 
tive measures. Of the 20 patients, 10 were 
married, 1 divorced, and 9 single. The medi- 
cal patients who were single performed at a 
significantly inferior level ( < .05) on the 
proverbs test and gave less common associa- 
tions on the Kent-Rosanoff word association 
test than married and divorced patients. As 
with the schizophrenics, there were no differ- 
ences on vocabulary scores or level of educa- 
tion between single and nonsingle individuals. 

In summary, a consistent negative rela- 
tionship appears between level of premorbid 
adjustment, as assessed by self-report scale 
and marital status, and cognitive ability for 
both male schizophrenics and male normals. 
It does not appear that level of intelligence 
or education accounts for this relationship. 


Discussion 


Evidence for the predictive validity of the 
self-report scale and marital status was found. 


8The Kent-Rosanoff word association test was 
administered only to the additional 48 schizophrenics 
(Meichenbaum, 1966b). A common association re- 
sponse was defined as one given by at least 100 of 
Russell and Jenkins (1954) Ss (see Ullmann, Krasner, 
& Edinger, 1964). 
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The correlation of these prognostic indicators 
and the criterion of length of hospitalization 
and the number of misplacements indicated 
that marital status was as efficient as the 
24-item true-false process-reactive self-report 
scale. Garfield and Sundland (1966) have 
also found that marital status alone predicts 
length of hospitalization as well as prog- 
nostic scales designed to measure the process- 
reactive continuum. It appears that marital 
status (married, divorced, single) is as effi. 
cient in predicting prognosis as the prognostic 
scales (e.g., Elgin, Phillips, or self-report 
scales) thus far developed. Ullmann (1967) 
and Ullmann and Gurel (1964) found greater 
discriminability by dividing married patients 
into two groups, namely, those who list wives 
as people to notify in case of emergency and 
those who did not. 

The reasons why marital status proves s0 
effective in predicting prognosis are not 
known, Garfield and Sundland (1966) indi- 
cate two general sources of hypotheses which 
have been offered, namely, those which em- 
phasize premorbid personality and those 
which stress social and environmental factors. 
In other words, there is something about the 
married individual which contributes to his 
leaving the hospital (e.g., he may be les 
ill to begin with), or the environmental pres 
sures which surround the married individual 
contribute to his discharge. The data on the 
differences between married and single indi 
viduals on cognitive measures for both schizo- 
phrenic and nonschizophrenic males under- 
score the possibility that the married male 
may have a cognitive syle which may Con 
tribute to better prognosis. A consistent all 
significant finding was that single males pet 
formed on an inferior level on proverbs an 
gave less common associations than martie 
or divorced males, with intelligence held co™ 
stant. A common characteristic of the pe 
formance of the single individuals on the 
proverbs test was an obsessiveness in giving 
responses. Comparing random samples of mat- 
ried and single males, singles gave signi” 
cantly more words in their answers to prov: 
erbs resulting in more literal interpretation’ 
and an inferior performance. These resu ts 
suggest that unmarried males in their late 
thirties, either schizophrenic or normal, a 
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likely to manifest thought processes 
may be described as ruminative, obses- 
and less common than marrieds. One 
not know whether these thought proc- 
contribute to the fact that males are not 

arried or result from the absence of mar- 
us The heterosexual and familial respon- 
| bilities of marriage in most instances re- 


that the male use language in an 
astrumental manner, assert interpersonally, 
‘convey and validate thoughts and strategies, 
a make decisions. The unmarried male has 
less of an opportunity for interpersonal rela- 
onships and reciprocity of thoughts and thus 
‘is more prone to internalize decisions in the 
m of self-dialogue with the consequence of 
ming more ruminative and obsessive. 
us, marital status may be seen as a reflec- 
ion of a style of thought processes of the 
dividual which correlates with amount of 
improvement and length of hospitalization. 
further research needs to be conducted on 
Ognitive styles of married versus single indi- 
Viduals in normal and maladaptive popula- 
ions. The relationship between cognitive 
le and marital status in females may be 
te different than in males. 
The other major source of hypotheses de- 
Signed to explain the efficacy of marital status 
asa prognostic indicator focuses on the social 
le and environmental situation of married 
id single individuals. These hypotheses sug- 
t that married individuals leave the hos- 
tal because there are greater pressures for 
spital discharge in order to return him to 
‘the community. At present we do not know 
if such pressures exist or how they influence 
decision-making process. The emphasis in 
€ development of prognostic scales has been 
| the instrument designed to measure the 
t’s premorbid adjustment and little at- 
ition has been paid to the factors that 
fect the criterion. An investigation of the 
ges (psychiatrists, psychologists, and social 
kers) may reveal why marital status plays 
an important role in predicting prog- 
Stern, Stein, and Bloom (1956) have 
ý ibed a procedure one may follow to 
investigate the role marital status may play 
M influencing judges (see their discussion on 
analytic model). 
summary, an examination of the cogni- 


Fact: 
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tive style of married versus single males, and 
an investigation of the criterion of the judges 
may contribute to an understanding of why 
marital status is as efficient in predicting 
prognosis as more complex scales. 
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PATTERNS OF PERFORMANCE ON WISC SIMILARITIES 


IN EMOTIONALLY DISTURBED AND 
BRAIN-DAMAGED CHILDREN * 


LEON P. HALL? anp LAVERNE LaDRIERE 
Oakland Schools, Pontiac, Michigan 


One hundred and twenty emotionally disturbed and 120 brain-damaged children 
in a public school setting were matched for WISC Full Scale 1Q, CA, and sex. 
An analysis was made of the Similarities subtest responses using the error- 
analysis approach previously used for differential diagnosis of institutionalized 
adults’ performance on the WAIS Similarities. The Wilcoxon matched-pairs 
signed-ranks technique was used in analyzing the data. Brain-damaged Ss made 
significantly more restrictive errors while emotionally disturbed Ss produced 
significantly more inadequate attempts at conceptualization. The discriminatory 
power of this error type achieved significance at the .01 level when comparing 
males alone, while it achieved only the .05 level of significance when applied 
to the female subgroups. Frequency of errors of the descriptive type and of 
positive correct responses did not distinguish between the two groups. Dynamic 


implications of findings were discussed. 


Of the many attempts to differentiate vari- 
ous diagnostic subgroups on the basis of test 
performance, several studies have concen- 
trated on the process of conceptualization, 
especially as related to perception of common 
elements or of relationships, as a- possible 
critical diagnostic index. The work of Gold- 
stein and Scheerer (1941) stressed the deficit 
in abstraction as observed in object sorting 
tasks as characteristic of brain-damaged indi- 
viduals. Grassi (1953) distinguished between 
the performance of brain-damaged and of 
schizophrenic Ss on a block substitution test 
n terms of an abstract-concrete schema. 
McGaughran and Moran (1956) reported 
that conceptual communality or social agree- 
ment of concepts provided a criterion for dis- 
ctiminating between a schizophrenic and a 
Tonpsychiatric population. A further study by 
McGaughran and Moran (1957) emphasized 
degree of abstraction as a major difference in 
the conceptual processes of brain-damaged 
Patients who “underabstract” and schizo- 
phrenic patients who “overabstract.” This last 
is closely related to Cameron’s (1938) report 
of schizophrenic “overinclusiveness.” The re- 
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search of Levanthal, McGaughran, and Moran 
(1959) analyzed the conceptual performance 
of brain-damaged, schizophrenic, and non- 
psychiatric groups on object sorting, block 
substitution, and similarities tests on the 
basis of simple-complex, public-private, open- 
closed, and abstract-concrete schematic axes. 

Tutko and Spence (1962) noted that any 
approach involving analysis of correct re- 
sponses primarily reiterates the already well- 
accepted premise that the hierarchy of re- 
sponse adequacy progresses from brain dam- 
aged to schizophrenic to normal, respectively, 
along an inferior-superior continuum. These 
authors further distinguished between two 
types of schizophrenic disturbance. The first 
of these is the process-type of schizophrenia 
in which onset is slow and insidious, pre- 
morbid social history is poor, and etiological 
factors may be somatogenic or organic in 
nature. The abstractive performance of these 
schizophrenics should approximate that of 
organic patients. In contrast is the reactive- 
type of schizophrenic syndrome in which onset 
is rapid and often related to some precipi- 
tating stressful event, premorbid social history 
is adequate, and etiology is assumed to be 
primarily psychogenic in nature. Abstractive 
performance of this group should follow that 
attributed to schizophrenic populations usu- 
ally described in the literature. Tutko and 
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Spence found significant differences in per- 
formance on the Goldstein-Scheerer Sorting 
Test between brain-damaged Ss and process 
schizophrenics whose protocols contained a 
greater percentage of restrictive errors (ise; 
descriptive, nonconceptual, stimulus bound, 
concrete responses) and reactive schizophren- 
ics who presented protocols yielding the high- 
est percentage of expansive errors (i.e, over- 
inclusive, overabstractive, poor, vague, per- 
sonal responses) of all groups tested. Their 
study used a modified form of error analysis 
based on a previous study (Rapaport, Gill, & 
Schafer, 1945), In a more recent study, 
Spence (1963) extended this approach to a 
comparative analysis of incorrect responses on 
the WAIS Similarities subtest of schizo- 
phrenic, organic, and normal adult males. 
Significant differences were in the expected 
direction, with the organic group giving the 
greater number of restrictive errors. Addi- 
tional data revealed that the protocols of the 
brain-damaged group also showed the high- 
est frequency of “Don’t Know’ responses, 
whereas the schizophrenic group tended to 
avoid this response. 

Korman’s (1960) recent study has indi- 
cated that the type, extent, and location of 
damage is directly related to psychological 
performance in organic patients. Similarly, the 
importance of sample homogeneity in com- 
parisons of behavioral dynamics between dif- 
ferent diagnostic groups of more psychogenic 
origin is underlined by distinctions such as 
the one made by Spence (1963) herself 
between reactive and process schizophrenics. 

Stimulated by Spence’s findings, Watson 
(1965) analyzed error-type frequencies across 
10 WAIS subtests (Digit Symbol excluded) 
in the protocols of 40 male schizophrenic and 
40 male organic patients. Watson questioned 
the usefulness of this approach in differential 
diagnosis not only due to the fact that only 
3 of the 40 possible differences between 
groups achieved significance, but also due to 
the progressive disappearance of any such 
differences as a direct function of length of 
hospitalization. Watson’s recategorization of 
etror types as well as his application of these 
error groupings to the aggregate of 10 WATS 
subtests instead of to the Similarities sub- 
test on which they were developed reduced 
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Spence (1963) and obscured the dynamic 
rationale of Spence’s analysis. 

The present study was initiated to examine 
further the diagnostic potential of Spence’s 
approach. In order to assure comparability 
of data, error analysis in the present study | 
was restricted to the Similarities subtest | 
alone and all of Spence’s original error group- 
ings were used. Most previous studies using} 
error analysis had restricted themselves to 
adult male Ss. It was felt that an evaluation | 
of the generality and validity of Spence's | 
approach could be best served by its applica- 
tion to a younger noninstitutionalized popula- 
tion. Therefore Ss used in the present study 
range in age from 6 to 15 years. The matched- 
pair technique was used to control the factors 
of age, sex, and intellectual ability. With 
these observations in mind, it was hypothe- 
sized that a significantly greater proportion 
of restrictive errors would occur in the proto- 
cols of the organic sample; whereas expan- 
sive errors were expected with significantly 
greater frequency in the responses of the non- 
organic, psychiatric sample. Finally, compari- 
sons within syndromes were undertaken to 
determine the extent of influence of such fac- 
tors as chronological age, level of measured | 
intelligence, and sex on the diagnostic value} 
of error analysis. 


the comparability of his data with that 


METHOD 
Subjects 


WISC protocols were obtained on Ss referred t0 
the Psychological Services of Oakland Schools. The! 
organic youngsters (BD) were diagnosed on the bast 
of evidence of probable physiological trauma due to 
injury and/or illness in life histories as well as devel- 
opmental and behavioral signs of organicity. Many 
of these records also included neurological and/ot i 
EEG data or supportive medical diagnosis. The non: 
organic psychiatric Ss (ED) were diagnosed 0n the 
basis of parallel criteria including bizarre and/or ab 
normal behavior, disturbed development, and history 
of psychologically traumatic circumstances. often 
these records were supplemented with psychiatn | 
diagnoses of serious disturbance. In no case wt 
psychometric signs of either organicity or emotio 
pathology used for sample selection. All youngst 
used in this study were public school referrals ar 
although many were under medical and/or PSY p 
atric care, none had been institutionalized up t° °"| 
time of testing. t 

It should be noted that the group in the preser | 
study is referred to not as schizophrenic but as @ 


EMOTIONALLY DISTURBED AND BRAIN-DAMAGED CHILDREN 


emotionally disturbed or as the nonorganic psychi- 
atric sample. This is not to equate the diagnosis of 
adult schizophrenia with that of serious emotional 
disturbance in children. Further, since the syndrome 
qassified as schizophrenia in adults has much broader 
implications and is applied to a considerably wider 
range of symptoms and dynamics than that of 
childhood schizophrenia, it was felt that the emo- 
tionally disturbed sample provided a better approxi- 
mation of the adult reactive schizophrenic psychiatric 
sample in the literature. This assumption was en- 
hanced by the fact that the great majority of young- 
sters included in the emotionally disturbed sample 
were characterized by symptoms of withdrawal, 
bizarre behavior, and social isolation. They were 
diagnosed as severely disturbed with no known 
organic symptoms or history. 


Sampling Procedures 


The Ss were drawn from a data pool of 342 
referrals which had been definitely diagnosed as 
either organic or emotionally disturbed. Of these, 171 
children (103 boys and 68 girls) were emotionally 
disturbed, while 171 children (113 boys and 58 girls) 
were organics, The matched-pair technique was used 
in selecting the following subgroups from this popu- 
lation for consideration:* 

1. One hundred and twenty emotionally disturbed 
and 120 organic youngsters, matched for WISC Full 
Scale IQ, chronological age, and sex, were compared 
in terms of relative frequency of error type across 
syndrome. This constituted the sample from which 
the major conclusions derived from the hypothesis 
were drawn, To permit further comparisons with 
previous data on the male population alone, both 
the organic and the emotionally disturbed samples 
were broken down according to sex into comparative 
Subgroups of 80 boys and 40 girls (the other match- 
ing data remaining constant). 

2. Since the secondary purpose of the present study 
constituted an evaluation of the relative influence 
of chronological age, sex, and ability on the diag- 
nostic value of the error analysis approach, the 
following subgroups were pulled from the data pool 
to accomplish this purpose: 

2a. Sex Differences: 40 organic boys were com- 
pared with 40 organic girls, matched for chronological 
äge and Full Scale IQ, in terms of Similarities error 
frequencies. Similarly, 40 emotionally disturbed boys 
were compared with 40 emotionally disturbed girls, 
again paired according to chronological age and 

ull Scale IQ, regarding various error frequencies. 

_ 2b. IQ Differences: The relative error frequencies 
in the protocols of 40 organic boys of average-to-low 
ee 


* The complete matching data for all comparisons 
made in this article are available in tabular form on 
Tequest to the authors, Tests for biasing influences 
es Sample heterogeneity and regression effects due to 
inclusion of IQ as one of the matching criteria have 
een evaluated. This information is also available on 
request, 
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normal intellectual ability (ie., Full Scale IQ of 80 
or better) were compared with those of 40 organic 
boys of poor-to-retarded intellectual ability (ie, 
Full Scale IQ of 79 or less). Similar comparisons 
using the same cut off of Full Scale IQ of 80 were 
made of error frequencies in protocols of 40 average- 
to-slow versus 40 poor-to-retarded emotionally dis- 
turbed boys. In both comparisons, youngsters were 
paired according to sex and chronological age, but 
were deliberately chosen to be as far apart as 
possible in intellectual ability. 

2c. Chronological Age Differences: WISC Simi- 
larities error frequencies were compared in the 
protocols of 40 organic boys of 114 years or older 
versus those of 40 organic boys under that age. 
Similar comparisons were made between responses of 
40 emotionally disturbed boys above 114 years and 
40 emotionally disturbed boys under 114 years. All 
comparisons were based on youngsters paired in 
terms of WISC Full Scale IQ and WISC Similarities 
scores as well as on sex; however, youngsters were 
selected to make the age gap as wide as possible. 


Scoring Procedures 


Incorrect responses (no-credit) on the WISC 
Similarities Items 5 through 16 (Items 1-4 are 
analogies, not similarities, and were, therefore, ex- 
cluded from scoring) were relegated into one of six 
basic error categories, following Spence’s (1963) 
outline: 


1, Inadequate Conceptual (C—) attempts specifi- 
cation of common properties that are too vague, 
inaccurate, or idiosyncratic to be considered 
correct. 

2. Narrative and Descriptive (ND) responses con- 
cerning one or both of the items separately, but 
with no specification of common properties. 

3. Don’t Know (DK) responses in which S explic- 
itly indicates he did not know how the two 
items were similar. 

4. No Response (NR) given to the question. 

5. Don’t Know-No Response (DKNR)* This 
error category includes both the explicit Don’t 
Know responses noted in Category 3 and the 
Don’t Know responses implicit in the No 
Response or Category 4 error type. 

6. Denials (De) of similarity. 


All protocols were scored by two psychologists 
independently. Of the 1,085 total items scored, dis- 
agreement occurred in only 19 (1.25% of the cases). 


4 Tutko and Spence (1962) included both narrative 
and descriptive errors (the present study’s Cate- 
gory 2) and no-response categories (4 and 5) in their 
definition of restrictive-type errors. The same defini- 
tion of restrictive errors was retained herein. Thus, 
restrictive errors were operationally defined in terms 
of Error Categories 2 and 5 (the latter including the 
DK errors as part of the NR group), again attempt- 
ing to parallel the Tutko and Spence (1962) work. 
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RESULTS AND DISCUSSION 


The major purpose of the present study 
was to extend Spence’s error analysis ap- 
proach by evaluating its diagnostic potential 
when applied to children of both sexes. It was 
hypothesized that expansive errors (the 
present study’s Category 1, represented in 
Spence’s analysis as the C— error category) 
would occur with significantly greater fre- 
quency in the protocols of children diagnosed 
as emotionally disturbed while restrictive 
errors (related by Spence to the ND and 
DK-NR error categories) would be made sig- 
nificantly more often by children diagnosed 
as organic. Table 1 presents the frequency of 
occurrence of each error category of matched 
pairs for both diagnostic groups as well as 
the comparisons which significantly discrimi- 
nated between these diagnostic groups as 
determined by application of the Wilcoxon 
matched-pairs technique. Because most of the 
literature has presented data solely on male 
populations, it was felt that it would be mean- 
ingful to present a comparison of the fre- 
quency of occurrence of each error type be- 
tween diagnostic groups for boys and for girls 
within those groups separately. This would 
not only allow greater comparability with the 
literature by providing an exclusively male 
population, but also presents the opportunity 
to evaluate the differentiating potential of 
error analysis of this type in terms of female 
samples. These data, too, are included in 
Table 1. 

Perhaps the most outstanding finding re- 


TABLE 1 


FREQUENCY OF OCCURRENCE OF THE DIFFERENT 
Error TYPES BETWEEN DIAGNOSTIC GROUPS 


Error category 


Group | N 
C— |ND| DK |NR|DKNR| De 
Total BD | 120 | 142) „„| 110 | 136 |30| 166 |23 
Total ED | 120 | 180 76 |116 |35| 151 |24 
BD boys | 80] 111 el 72| 94 NESA (n 113-41 
ED boys | 80 |134 47| 71 21 92 |20 
BD girls | 40| 31 38| 42 11 53.112 
# ED girls | 40| 46 29| 45 14 59 4 
* 
> Sor 
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vealed in Table 1 is that the C— error occurs 
significantly more often in the emotionally 
disturbed group than in the organic group, 
both when sexes are combined and for each 
sex considered separately. This directly con- 
firms the hypothesis of the present study and 
offers strong support for Spence’s findings 
concerning the diagnostic value of this type 
of error. It also conforms to the generally 
accepted relationship of the expansive error 
to nonorganic psychiatric groups. Perhaps one 
generalization may be that the vague, in- 
appropriate, idiosyncratic, and conceptually 
inadequate responses of the C— or expansive 
type reflect loss of reality contact, intrusion 
of personal (often autistic) material into the 
thought process, and the overresponsive, over- 
abstractive behavior characteristic of severe 
psychogenic disturbance. 

The only other error category in which 
error frequency distinguished significantly 
between the two syndromes is the restrictive 
error, DK. This error occurs with significantly 
more frequency in the protocols of organic 
boys than in those of their psychiatric coun- 
terparts. The tendency represented by this 
type of error is consistent with the literature 
and is generally interpreted to reveal the 
diagnostic characteristics of the organic, 
notably, impairment of abstractive ability, 
concreteness, and deficient conceptualization. 
However, any interpretation of this kind must 
be restricted by the fact that the DK error 
appears with close-to-equal frequency in the 
protocols of emotionally disturbed and of 
organic girls alike. It fails to discriminate 
between them to such an extent that when 
the data of the sexes for each diagnostic group 
are combined only a moderate tendency for 
organics to make more DK errors is observed. 
The inclusion of the NR error frequencies 
with those of the DK category, as done by 
Spence, appears only to obscure the limited 
diagnostic value of the DK response. It would 
seem safe to assume that the NR error 
dynamically represents something other than 
an implicit Don’t Know and should not be 
arbitrarily combined with it. The rationale 
for the specific separation of this combined 
classification into its two component error 
types (apparently not done by Spence or 
Watson) lay in the direct assumption in the 
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literature that the specific DK response was 
dynamically related to the organic syndrome 
while the NR response might have repre- 
sented something other than an implicit DK 
g., negativism) and, in so doing, might 
have contaminated the DK category, diluting 
possible significant differences between groups 
inherent in the DK classification. 
It is also obvious from Table 1 that the 
ND error category fails to discriminate sig- 
icantly between the organic and the psychi- 
attic Ss in the present study. This agrees with 
Spence’s (1963) data which also revealed no 
significant differences between her two diag- 
nostic groups in the frequency of the ND 
response. Since the ND error has usually 
fallen into the restrictive category as a chain- 
ing or naming response, stimulus bound and 
concrete, Spence had initially anticipated its 
greater frequency of occurrence in her organic 
group. Its failure to discriminate between the 
two groups suggested the necessity of a 
broader dynamic interpretation of this type 
of incorrect response. It may indeed have 
represented the stimulus bound, nonintegra- 
tive, perseverative naming and chaining proc- 
ess characteristic of the organic mental 
process. However, it may have been equally 
indicative of the overinclusiveness process 
ften noted in the nonorganic psychiatric pa- 
tient. Theoretically, at least, this second pos- 
bility would allow for equal occurrence of 
his kind of error in both syndromes. How- 
ver, a closer examination of the frequency 
on of the present study does not indicate 
Mat this is quite the case. It is easily seen 
that there is a strong tendency for greater 
frequency of the ND response among organics 
both when the groups are combined or when 
the sexes are considered separately. It would 
Stem that this error category has potential 
Magnostic significance. Perhaps further refine- 
Ment in scoring procedures to separate the 
erative naming of the organic from the 
‘ombination responses characteristic of psy- 
fiattic patients might eliminate the overlap 
and tealize this potential. 
_ The final error type for consideration is 
the relatively rare De response. Spence re- 
‘this type of error to several possible 
‘namics. First, it could be related to the 
Phrenic’s indifference to reality stan- 
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dards, his unwillingness to give a DK re- 


sponse, and, in such a frame of reference, 


might be a downward extension of the C— 
response. Second, it might represent the 
suspiciousness in the schizophrenic personal- 
ity such as was noted by Weiner (1957). 
Finally, it might be simply negativism. In all 
of these interpretations the De response 
dynamically fits the psychiatric patient, and 
Spence’s data showed that although rare and 
therefore statistically tentative due to small 
N, for her sample, the De response occurred 
significantly more frequently in the protocols 
of schizophrenics than those of organics, The 
present data fail to confirm this finding, nor 
do they reveal any tendency in that direc- 
tion. Further, on the assumption that the 
NR response might as easily represent nega- 
tivism or psychotic muteness as the implicit 
Don’t Know assumed by Spence, the De and 
NR categories were combined, to the advan- 
tage of neither. It might be asserted that due 
to the relative rarity and to the general fail- 
ure to discriminate diagnostically of the NR 
error and possibly of the De category, both 
are of little value in the general application 
of error analysis. 

In order to better define the relationships 
indicated in Table 1 the conduct of this re- 
search turned to an analysis of the influence 
of the variables of chronological age, sex, and 
intellectual ability on the diagnostic value 
of the error-analysis approach, Comparisons 
were made within each diagnostic syndrome 
to test the effect of these parameters on the 
frequency with which the various types of 
errors occurred. Again, the matched-pairs tech- 
nique was applied in the selection of these 
subgroup samples from the data pool. The 
Wilcoxon test was used to measure the sig- 
nificance of the differences in error frequencies 
between comparison groups. The frequency 
data for these subsidiary comparisons can be 
found in Table 2 along with indications of 
significance wherever it appeared. 

The first set of comparisons attempted to 
determine whether or not sex was a major 
factor influencing error frequencies. As can be 
seen from Table 2 the comparison of error 
responses of organic boys versus organic girls 
yielded no significant differences in any error 
type between the sexes. Similar comparison 
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TABLE 2 


COMPARISON OF THE INFLUENCE OF SEX, IQ, AND, 


CHRONOLOGICAL AGE ON ERROR FREQUENCIES 
WITHIN DIAGNOSTIC GROUPS 


Error category 


Group N 
C— | ND |NR| DK |DKNR| De 
BD male 40|45 [31 |13]42 | 55 fi 
BD female 40 | 33 35 15 | 41 56 11 
ED male 40 78 \ os 32 | 12 | 30 es 8 
Bp female 10 |do] 321, 421a 2 [3t bal dole | A 
male—low I! 40 | 32 
BD male—high 10| 40 | 56 J * 26)" 6|61j* ye 0 
ED male—low IQ |40| 71 | 34 | 6| 40 ran 9 
ED male—high IQ |40 | 67, |24 |12|56 | 68 3 
BD male—older — | 40 ae) 32 |10|42 | 52 5 
BD male—younger| 40 | 33/™*| 31 | 9|45 |54 2 
ED male—older |40| 68 |26 |12|44 | 56 9 
ED male—younger| 40 | 68 |32 |14|33 | 47 6 
*p < 05. 
**p < 01 


of error frequencies in protocols of emotion- 
ally disturbed boys versus those of emo- 
tionally disturbed girls revealed significant 
differences between the sexes in the frequency 
of both C— and DKNR error types. Exami- 
nation of the frequency data showed that the 
emotionally disturbed boys made nearly twice 
as many C— responses as did their feminine 
counterparts, while the emotionally disturbed 
girls gave significantly more restrictive errors 
of both the DK and the NR type. These 
findings reflected those presented in Table 1 
in that although the C— response occurred 
with significantly greater frequency in the 
protocols of the emotionally disturbed Ss than 
in those of the organic sample, regardless of 
sex, the discriminatory power of this error 
type achieved only the .05 level of significance 
when applied to the female subgroups. These 
data also help to explain why the restrictive 
error category of DK and DKNR failed to 
discriminate between psychiatric and organic 
groups with the one exception of the DK 
error type which occurred significantly more 
frequently among organic than among emo- 
tionally disturbed boys. It would seem that 
while emotionally disturbed girls did reflect 
their disturbance in their significantly greater 
use of C— responses than did their organic 
pairs, they also tended to resort with signifi- 
cantly greater frequency to the DK and 
DKNR errors than did boys within the same 
syndrome. The restrictive errors of the psychi- 
atric girls equalled the frequency with which 
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the organic females responded thus masking 
the full meaning of the diagnostic potential of 
the DK error. This was not true when the 
sexes were considered separately. These results 
suggest greater homogeneity as well as greater | 
response determination by syndrome dynamics 
among the emotionally disturbed boys than 
among girls within the same syndrome, Fur- 
ther they again call into question, at least 
among female samples, the assumption that 
DK and NR errors must be restrictive errors 
representing organic concreteness, and present 
the alternative possibility, mentioned above, 
that such responses might also reflect the 
negativism and/or autistic muteness of the 
psychiatrically disturbed youngster. 

Comparisons were then made within each 
diagnostic syndrome separately to investigate 
the influence of chronological age and intel) 
lectual ability on the frequency of occur) 
rence of various error types. Due to limited 
number of girls available, these comparisons 
were based on all male samples. These data) 
also appear in Table 2. 

When responses of brain-damaged boys of! 
average-to-dull normal intelligence compared 
with those of brain-damaged boys of equal 
age but low intelligence, it was noted thal 
the C— error occurred with greater frequency 
in the protocols of the more intellectually 
able youngsters. Similarly, older organic boys 
gave significantly more C— responses than 
did their younger counterparts when matched 
for ability. This seemed to indicate thal 
within the organic group, the C— error is not 
only a function of intelligence but also of 
age. Thus, brighter and/or older and mot 
experienced youngsters tended to attempt 
both the generalization and greater adequacy 
of response that the C— error category iM 
plies regardless of their organic limitation 
than do younger or more retarded organic $ 

This position that syndrome dynamle 
might be considerably influenced by age and 
ability factors is further supported by Ù, 
fact that the brighter organic boys gave Sf 
nificantly more DK responses than did th 
more retarded group, while the Jow-ability 
group tended to produce significantly mo 
ND responses. If the DK and ND respons 
do, as assumed throughout the literatur, 
represent dynamically the stimulus-bound 4 
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perseverative behavior characteristic of or- 
ganicity alone, then no significant differences 
within the organic syndrome should be ob- 
served. Thus, the fact that the brighter or- 
ganic group gave significantly more DK re- 
sponses than the less able group must suggest 
the importance of ability as well as syndrome 
dynamics in the frequency with which this 
error occurred. This finding could be inter- 
preted along the same line as was the greater 
frequency of the C— error with brighter 
youngsters. Thus, it would seem that the 
brighter organic youngster not only attempted 
to make more adequate and abstract responses 
but also was more aware of what he did and 
did not know than his slower counterparts. 

It might be argued that these ability- 
related differences merely represent the gen- 
eral tendency of bright youngsters to give 
more responses of all kinds than slower 
youngsters. This argument would have ap- 
peared a great deal stronger had not the 
tetarded organic group given almost twice as 
many ND responses than the more able 
group. The brighter youngster tries for re- 
sponses with greater generality and possesses 
greater awareness of reality and personal limi- 
tation; whereas the duller organic youngster 

. evidenced the fragmented and perseverative 
and stimulus-bound responses assumed to be 
universally characteristic of organicity. Thus, 
it would seem that the diagnostic value of 
rror analysis relative to organics is not 
Simply a direct function of syndrome dynam- 

ics but is also related to age and ability where 
the C— and DK error types are concerned, 
and represents an inverse function of ability 
in regard to the ND error types. 

A strong contrast to the above was apparent 
When similar comparisons of the effect of abil- 
ity and age variables on error frequencies in 
the protocols of emotionally disturbed boys 
Were made. These data, also presented in 
able 2, reveal no significant differences rela- 
tive to chronological ages and only one sig- 

nificant difference relative to ability in error 

Tequencies across the error categories under 

Consideration. This suggests not only greater 
dynamic homogeneity within the emotionally 
disturbed sample, but also greater overall in- 

“fluence of syndrome dynamics on error fre- 
quency patterns than was observed in the 
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protocols of the organic group. This interpre- 
tation of response determination based pri- 
marily on syndrome dynamics is underlined 
by the uniformly high occurrence of the C— 
error type across age and ability subgroups 
within the emotionally disturbed sample since 
the C— error type is the one most associated 
with psychiatric disturbance. The single case 
in which significant differences relative to abil- 
ity occurred was that of the error category 
DKNR. Responses of this type were given 
more often by brighter emotionally disturbed 
boys. As in the above, this cannot be related 
simply to greater overall response frequency 
in protocols of brighter children since the 
frequency data do not confirm this. Rather, 
it might be suggested that the brighter sample 
of emotionally disturbed boys use this type 
of response as an indication of either better 
awareness of personal limitation and reality 
or of greater negativism, suspiciousness, and 
resistance, than is existent in the less intel- 
lectually able emotionally disturbed boys. 


SUMMARY AND CONCLUSIONS 


In summary, the data of the present study 
generally support the validity and diagnostic 
usefulness of the error analysis approach, The 
fact that the expansive (or C—) error type 
occurred with significantly greater frequency 
in the responses of the psychiatrically dis- 
turbed group than in those of the organic 
group regardless of sex provides strong con- 
firmation of Spence’s (1963) findings and of 
the hypothesis of the present research based 
thereon. The diagnostic merit of this type of 
error appears even more powerful when it is 
noted that it is not test-specific, but equally 
effective whether measured on the WAIS or 
the WISC. It is concluded that this error 
category is primarily a reflection of syndrome 
dynamics with diagnostic potential. Strong 
trends in the predicted direction of occurrence 
appeared in the data with regard to the 
restrictive-type error category (i.e, DK, NR, 
DKNR), and the DK error frequency did 
discriminate significantly between organic 
versus emotionally disturbed boys. It is sug- 
gested that these findings support those of 
Spence, but the diagnostic value of this type 
of error is of a more limited nature than that 
of the C— category. The restrictive error 
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type ND revealed a mixed picture suggesting 
that this error category needs further con- 
sideration and refinement before its diagnostic 
value can be validly assessed. Finally, differ- 
ential effects of chronological age, IQ, and 
sex were analyzed and discussed not only in 
terms of their direct influence on error analy- 
sis but also relative to their added interpreta- 
tive value and restrictions in the use of this 
approach to differential diagnosis. 
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Two cognitive-perceptual measures on psychological tests, one from the WAIS 


and the other from the Rorschach, are fr 
toward oppositionality and negativism. 


equently assumed to reflect tendencies 
Oppositional behavior is considered 


when the number of digits remembered in a backward order exceed the number 
of digits remembered in a forward order or when there are White Space re- 


sponses on the Rorschach. A significant 


relationship was found between these 


two independent measures and this finding was discussed in relation to the 
validation of interpretative assumptions of psychological tests. 


On the Wechsler Digit Span, the usual pat- 
tern is for a person to remember more digits 
in forward order than they remember in 
reverse order. Rapaport, Gill, and Schafer 
(1945, p. 180) suggest that a pattern of a 


greater number of digits remembered in a re- 


verse order rather than in a forward order may 
reflect, among other things, negativism and a 
tendency to be oppositional. In the Rorschach 
literature, there is the assumption that a 
tendency toward negativism and oppositional- 
ity can also be reflected in the White Space 
(S) responses, Rorschach (1942) believed 
that S responses always indicated “opposi- 
tional tendency whether they occurred in large 
or small numbers. In normals they indicate 
argumentative, willful, obstinate and queru- 
lous types; in schizophrenics they indicate 
blocking, negativism and eccentricity [p. 
40].” Schafer (1954) in discussing obsessive- 
compulsive defensive operations comments 
that “the underlying negativistic, rebellious, 
stubborn impulses of persons characterized by 
tigid reaction formations against hostility fre- 
quently find expression through S, . . . R and 
Dr may increase greatly . either high or 
low R may result from hostile impulses and 
demanding attitudes . . . [p. 353].” Rapa- 
Port, Gill, and Schafer (1946, p. 178) point 
Out that space and Dr responses tend to 
accumulate in the same records. They also 
Indicate that the difficulty in validating the 
assumptions about test responses, which indi- 

*Requests for reprints should be sent to Sidney 


a Blatt, Department of Psychiatry, Yale University, 
3 Cedar Street, New Haven, Connecticut 06510. 
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cate oppositional tendencies, is a function of 
the many alternative ways such tendencies 
can occur in behavior and in test records 
(1946, pp. 176-177). 

Several research strategies could be used to 
test these assumptions about indicators of 
negativism in the various psychological tests. 
Ideally, test variables might be studied in 
relation to overt behavior, but it is often 
difficult to obtain reliable measurements of 
the degree of negativism in overt behavior 
that occurs in a complex social matrix or in an 
experimental procedure. At least as a first 
step, however, one can test assumptions about 
the psychological test indications of nega- 
tivism by examining the relationship between 
the balance of digits forward (DF) to digits 
backwards (DB) and Rorschach S responses. 


METHOD 


From a file of approximately 700 psychological 
test records, all patients who had DB in excess of 
DF were selected for study. The diagnoses of these 
patients ranged from psychoses through neuroses but 
records were excluded if there were suggestions of 
central nervous system impairment. The experimental 
group consisted of 14 records (7 males and 7 
females) and two control groups were matched with 
the experimental group for age, Sex, and Full Scale 
IQ. The first control group consisted of Ss with 
DB=DF and the second control group consisted 
of Ss where DB was less than DF by one or two 
digits. These three groups were compared on a 
number of Rorschach variables and WAIS subtest 
scores. 


RESULTS 


As presented in Table 1, the data indicate 
that the number of S and the number of Dr 
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TABLE 1 


RELATIONSHIPS OF THE BALANCE OF Dicts 
Forwarp AND Backwarp To WAIS 
SUBTEST SCORES AND RORSCHACH 


VARIABLES 
Variables DB>DF|DB=DF|DB<DF| F* 
Control 
Age 27.0 24.8 26.4 ns 
Full Scale 10 109.00 | 108.00 | 110.21 | ns 
WAIS 
Comp 12.50 11.43 10.50 | 1.88 
Info 11.29 11,21 11.29 | .00 
D Sp 11.86 11,00 9.93 | 1.25 
Arith 10.79 10.57 9.86 | .27 
Simil 10.86 10.64 10.50 | .07 
Vocab 11.86 11.93 11.71 | .02 
PA 11.21 10.57 10.86 | .20 
PC 10.57 9.86 11.64 | 2.39 
BD 11.21 11.00 11.71 | .31 
OA 10.86 10.43 11.86 | .99 
D Sy 9.71 10.79 11.29 | 1.53 
Verbal 10 110.00 | 108.57 | 107.29 | .14 
PI 106.36 | 105.71 | 112.14 | 1.53 
Rorschach 
R 34.21 27.71 25.57 | 1.78 
w 6.64 8.14 9.71 |117 
D 18.93 16.57 12.50 | 1.69 
Dd 71 14 157 DES 
Dr 5.50 2.07 1.57 | 8.33%" 
Do «21 -36 14 | .42 
De +36 0 14 | 1.04 
S 1.29 «29 150 |3.67* 
S (+ additionals) 3.36 1.85 1.00 | 3.28* 
s 236 0 :21 | 1,00 
DW 107 J4 4 | 1S 
M 1.57 1.36 2.00 | .76 
zC 3.86 3.14 3.89 | .22 
A 66.50 70.64 61.64 | 1.08 
F% Ext 91.07 93.07 86.00 | 1.77 
ig % 67.14 70,50 69.57 | .19 
F +% Ext 67.71 71.86 67.64 | .47 


Note.—N = 14 for each group. 
adf = 2/39. 

*> <.05 fz = 3.24). 
> <.01 (F = 5.21). 


responses relates to the balance of DF to DB. 
No differences were found between the three 
groups on any WAIS subtests or on traditional 
Rorschach variables other than the number 
of S and Dr responses. 


DISCUSSION 


The significant relationship between two 
independent cognitive-perceptual measures 
which were assumed to reflect oppositionality 
and negativism tends to support the hypothe- 
ses about the psychological processes involved 
in S responses and the balance of digits back- 
ward to digits forward. It is also consistent 
that the groups should also differ significantly 
on Dr responses since Rapaport et al. (1946) 
commented that Dr and S responses tend to 
co-occur, 

Though there is a significant relation- 
ship between these independent cognitive- 
perceptual measures which are assumed to 
reflect negativism and oppositionality, support 
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for the assumption about test indicators of 
negativism can only be inferential since no 
direct assessment was made of manifest overt 
behavior. Yet the expectation that there 
would be a relationship between the variables 
on the Wechsler and the Rorschach was based 
on independent clinical observation and 
theory. The relationship of test variables to 
particular psychological processes may require 
a demonstration of a relationship to manifest 
behavior but the relationship between internal 
psychological states and overt behavior is a 
complex and difficult problem. There is little 
evidence of a one-to-one, isomorphic relation- 
ship between internal psychological states and 
specific overt behavior (Skolnick, 1966). 
Psychological processes are only one determi- 
nant of behavior in a complex social matrix 
(Engel & Blatt, 1963) and though we may 
understand indications of psychological states 
and tendencies toward certain types of be 
havior, we may not be able to predict how 
these internal states will specifically manifest 
themselves in social situations. The more we 
attempt to place constraints and controls on 
the variations of the social matrix so there isa 
limited range of possible behavior, the more 
the behavior moves toward becoming only re- 
sponses to other psychological test procedures. 
The correlation of dimensions on independent 
psychological procedures, as in this study be- 
tween the WAIS and Rorschach, may be one 
way to test assumptions about the interpreta- 
tion of psychological test variables. 
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SENSATION SEEKING SCALE AS A PREDICTOR OF NEED 
FOR STIMULATION DURING SENSORY RESTRICTION 


JOHN HOCKING? ann MALCOLM ROBERTSON 


Western Michigan University 


The study tested the hypothesis that the Zuckerman Sensation Seeking Scale 
(SSS) predicts strength of need for stimulation under conditions of sensory 
restriction. During a 3-hour isolation period, 15 high- and 15 low-SSS 
scorers were free to button press for visual, auditory, and kinesthetic stimula- 
tion. Except for visual stimulation high-SSS scorers button pressed at a higher 
rate than low-SSS scorers, but the differences were not significant. Low-SSS 
scorers showed a significantly greater need for visual stimulation than high-SSS 

Í scorers. High-SSS scorers showed a significant preference for kinesthetic stimu- 
lation, while low-SSS scorers had a significant preference for visual stimulation. 
Results were discussed in the light of future research strategy. 


= A number of writers have discussed the 
‘concept of optimal stimulation, by which they 
"mean that under conditions of sensory depriva- 
tion or overstimulation the organism seeks to 
increase or decrease the sensory input to a 
level conducive to optimal functioning (Duffy, 
1962; Fiske & Maddi, 1961; Hebb, 1955; 
Leuba, 1955; Schultz, 1965). With few ex- 
ceptions, experimental work has been di- 
tected toward the organism’s behavior in 
“response to conditions of restricted sensory 
‘input (Ansfeld, 1965; Bexton, 1954; Gold- 
Stein, 1965; Vernon & McGill, 1960; 
Zuckerman & Haber, 1965). One major 
Tesearch limitation has been the lack of 
base-line data on an individual’s level of 
Optimal stimulation. To correct for this 
deficiency, Zuckerman, Kolin, Price, and Zoob 
(1964) developed the Sensation Seeking Scale 
(SSS) to provide a quantitative measure of 
the strength of need for stimulation. The SSS 
‘WS a forced-choice questionnaire in which the 
items, derived from factor analysis, pertain 
to preferences for intensities of sensation 
eat, cold, noise, taste, colors), familiarity 
as Opposed to novelty, routine as opposed to 
Ìtregularity, and security as opposed to ad- 
Venture, Zuckerman hypothesized that an 
i individual who scores high on the SSS would 
‘Rave a strong need for stimulation and would 
Seek a relatively high level of sensory input. 


si 

* Requests for reprints should be sent to John 
oe who is now at the Kalamazoo Child Guid- 
mce Clinic, Kalamazoo, Michigan 49001. 
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Conversely, an individual who scores low on 
the SSS would have less need for stimulation 
and would seek a relatively lower level of 
sensory input. 

The present study tested the hypothesis 
that the SSS predicts strength of need for 
stimulation in a sensory restricted environ- 
ment. Specifically, high-SSS scorers were 
expected to request more stimulation than 
low-SSS scorers during 3 hours of sensory 
restriction. 


METHOD 
Sample and Apparatus 


The SSS was administered to 214 male under- 
graduates, and the 15 highest and lowest scorers 
comprised the sample. Out of a maximum score of 
26 on the SSS, the mean of the high scorers was 23 
(SD = 90), and the mean of the low scorers was 
8 (SD=1.8). Approximately 1-3 weeks later, each 
S was retested with the SSS (rko = .93), and then 
spent 3 hours on a bed in an isolation cubicle. The 
interior of the cubicle was totally dark, and the 
only audible sound was the monotonous blower noise 
of an air-conditioner. 

During the isolation period S could button press 
for visual, auditory, and kinesthetic stimulation. 
Three door bell buttons, distinguished by indenta- 
tions, were mounted on the bed approximately 18 
inches from S’s head. The buttons were connected 
to three indicator lights located in an adjacent E’s 
station. 

Visual stimuli consisted of lighting the cubicle 
by a Penetray’s motorized color wheel which was 
illuminated by a 150-watt reflector spotlight. The 
color wheel rotated amber, blue, green, and red at 
4 revolutions per minute. An Amperite delay relay 
switch limited the length of visual stimulation to 
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15 seconds. Auditory stimulation consisted of four 
piano notes which were played on a tape recorder. 
Each time the auditory button was pressed, the tape 
lasted for 15 seconds or one revolution. A button 
press for kinesthetic stimulation permitted S freedom 
of movement both on and off the bed for 15 seconds. 
Since the bed springs emitted a squeaking noise 
whenever S moved, a record could be kept of move- 
ments during periods when the kinesthetic button 
was not pressed. 


Procedure 


After each S had been retested with the SSS and 
completed a brief autobiographical questionnaire, he 
was given the following instructions along with a 
demonstration of each form of stimulation. 


You are asked to remain in the room for a period 
of time not more than 3 hours or until you volun- 
tarily leave. You are asked to lie on the bed on 
your back. You are to stay awake. Please keep 
movements to a minimum and do not talk. 


You will be able to get several forms of stimula- 
tion if you push a certain button. If you push the 
button with one mark on it, a light will go on 
for a period of time. If you push the button with 
two marks on it, tones will be played for a period 
of time. If you push the botton in the middle po- 
sition, you will be permitted to move from your 
position for a period of time. That is you may 
move around or get off the bed until a buzzer 
is sounded. When the buzzer sounds, please return 
to the supine position on the bed. The use of the 
buttons is entirely up to you. This experiment is 
not an endurance test or test of will power. People 
react in many ways and there is no correct pattern 
that we are looking for. The experimenter will 
remain next door in the observation room through- 
out the period of time. You will not be hurt in 
any way. Do you have an questions? 


Analysis 


The experimental period was divided into eighteen 
10-minute units, and the button pressing responses 
for the three types of stimulation were totaled for 
each 10-minute period. The response variable was 
quite skewed in that most Ss made a low number 
of button presses, while a few Ss made a very high 
number. Consequently, nonparametric methods were 
used to analyze the data. The Mann-Whitney U test 
compared the high- and low-need Ss on the total 
number of button presses for each type of stimula- 
tion, as well as the total of all button pressing 
responses. Within each group, differences in type of 
stimulation requested were determined by a Friedman 
two-way analysis of variance. 


RESULTS AND Discussion 


With the exception of visual stimulation, 
igh scorers on the SSS demonstrated a 
greater need for stimulation than low scorers. 
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Table 1 shows that high-need Ss had a higher 
rate of total button pressing than the low- 
need Ss. The same was true of the subtotals 
for auditory and kinesthetic stimulation, 
However, the differences were not statistically 
significant. On the other hand, for visual 
stimulation low-need Ss had a significantly 
(p = .05) higher rate of button pressing than 
the high-need Ss. j 

Since a graphical representation of the re- 
sponses of the two groups disclosed the 
greatest difference during the first hour, 
between-group comparisons were made both 
for total button presses and the subtotals 
for each type of stimulation during the first 
60 minutes. Although the differences were in 
the direction noted above, they were not 
significant. 

In terms of group preference for type of 
stimulation, high-need Ss showed a significant 
(x? = 177.38, p < .001) difference in favor 
of kinesthetic stimulation. For low-need Ss 
the difference was significant (x? = 182.70, 
p < .001) in favor of visual stimulation. 

Although high scorers on the SSS had a ; 
greater rate of requesting stimulation than 
low scorers on the SSS, the failure to obtain 
a significant difference combined with the sig- 
nificant reversal on visual stimulation under- 
scores the necessity of further scale validation. 
In further research four points should be 
considered. First, the effect of a longer period 
of sensory restriction, for example, 4-8 hours, 
should be assessed to determine whether the 
obtained direction of group differences 1 


TABLE 1 


Mann-Wurtney U Test or Rate or BUTION 
PRESSING ror HIGH AND Low SCORERS 


ON THE SSS 
High SSS Low SSS 
Response U 
Total Total a 
eee sate | R 
clase 
Visual 559 | 192 | 645 | 274 | 715° 
Auditory 692 | 206 205 | 269 | 86 
Kinesthetic 548 | 208 415 | 258 | 89 
Combined 1,799 | 204 | 1,262 | 261 | 84 
Not N = 15 in each group. 


wash of ranks assigned to each group. 
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maintained. Second, Zubek (1963) has sug- 
gested that exercise during isolation may 
provide enough variation of kinesthetic and 
proprioceptive. stimulation to counteract some 
of the effects of restricted visual and auditory 
stimulation. The fact that the high-SSS 
group requested more kinesthetic stimulation 
than the low-SSS group and had a signifi- 
cantly greater preference for kinesthetic 
stimulation than for visual and auditory 
stimulation might be interpreted as an effort 
to lessen the unpleasant effects of restricted 
visual and auditory stimulation. Thus, a com- 
parison of high and low scorers should per- 
haps be made under conditions in which 
kinesthetic stimulation is held to a minimum. 
Third, the high-SSS person is thought to be 
one who requires a variation in stimulation 
as well as amount of stimulation; conse- 
quently unless stimulation is quite varied and 
interesting, high-SSS persons are likely to 
consider it dull and to show less operant 
motivation.2 Therefore, the stimuli provided 
in the experimental condition should have a 
variation dimension as well as an intensity 
dimension. Finally, additional analysis of the 
SSS items is needed. In this study, an item 
analysis of the SSS scores showed that 9 of 
the 26 items had low discriminatory power, 
that is, 70% or more of the Ss answered these 
items in the sensation seeking direction. On 


the other hand, 15 of the 26 items discrimi- 


nated satisfactorily; that is, close to 50% of 
the Ss answered these items in the sensation 
seeking direction. On 12 of these items, Ss 
who answered in the sensation seeking direc- 
tion button pressed substantially more or sub- 
stantially less than Ss who answered the items 


2M. Zuckerman, personal communication, Janu- 
ary 1968. 
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in the opposite direction. Thus, the relation 
between the SSS and overt behavior is at best 
an unclear one. Perhaps some clarity will be 
forthcoming from Zuckerman’s (1967) efforts 
to replace the composite scale with specific 
factor scales that sample the stimulation 
dimensions separately. 
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FOLLOW-UP OF THE LATER CAREERS AND LIVES OF 
1,000 BOYS WHO DROPPED OUT OF HIGH SCHOOL" 


STARKE R, HATHAWAY,? PHYLLIS C. REYNOLDS, 
anp ELIO D. MONACHESI 


University of Minnesota 


A dropout group and a nondropout group of high school boys were compared 
on several psychometric and demographic variables. The data both predated 
and postdated (by about 10 years) the time of dropout. For further analysis, 
the groups were subdivided by intelligence, additional training, and early versus 
late dropouts for the dropout group. The later lives of dropouts were charac- 
terized in general by relatively low socioeconomic status (SES), a downward 
social mobility, a higher incidence of criminal activity, and bigger families. 
Ninth-grade MMPI tests showed some personality differences between dropouts 
and nondropouts. When dropout and nondropout groups were matched by 
intelligence, most of the differences persisted. Training after dropout is of 
some help in future SES. Few differences were found between early and late 


dropouts. 


During the 1950s somewhere between 15% 
and 30% of boys entering high school in the 
state of Minnesota did not graduate.® Actual 
dropout rates are not known for Minnesota 
since the gathering of this kind of statistic 
presents difficulties such as accounting for 
school mobility, temporary dropouts, etc. In 
Lawrence, Kansas, over 27% of children who 
entered high school in 1950 did not graduate 
(Kansas State University, Bureau of Child 
Research, 1957); in Iowa, almost a quarter 
of adolescents who enter the ninth grade do 
not graduate (National Committee for Chil- 
dren and Youth, 1960). 


During the 1960's, the Department of Labor esti- 
mates, some 7.5 million youngsters will dop out 
before high school graduation. About 2.5 million 
will not go beyond the eighth grade; two out of 
three will go not further than the tenth grade. 
Most will end their education before they have 
been exposed to the advice of the vocational coun- 
selor. Meanwhile, the level of training required for 
jobs is steadily rising. Each year there will be fewer 
openings, proportionately, for the unskilled worker. 
. . . Dismal as the dropout’s immediate prospects are 


1 This research was supported by Research Grant 
5 RO1 MH 12187 from the National Institute of 
Mental Health and grants from ‘the University of 
Minnesota. _ 

2 Requests for reprints should be sent to Starke 
R. Hathaway, Medical School, Division of Clinical 
Psychology, University of Minnesota, Minneapolis, 
Minnesota 55455. 

3 Minnesota Department of Education, personal 
communication, April, 1968. 
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when he leaves school, his long range outlook is even 
gloomier. The Department of Labor predicts that by 
1970 white collar workers will constitute 45 per cent 
of the labor force, unskilled workers only 5 per cent 
[National Committee for Children and Youth, 1961, 
pp. 15 and 20]. 


These data attest to the critical social problem 
presented by the mass of adolescents who do 
not earn a high school diploma. The in- 
creasingly complicated technology in this 
country with its outcomes of automation and 
need for more and more skilled workers and 
persons who deal with theory and planning 
suggests that a high school diploma is be- 
coming a minimum occupational requirement, 

Although there have been many reports 
comparing dropouts to nondropouts (€-8» 
Cervantes, 1965; Hathaway & Monachesi, 
1963; Livingston, 1958; Schreiber, 1967; 
Turchan, 1966) and reports on dropouts from 
the Midwest (e.g, Kansas State University, 
Bureau of Child Research, 1957; Nachman, 
Getson, & Odgers, 1964), surprisingly little 
has been published in the psychological, s010- 
logical, or educational literature about the 
later careers of the high school dropout. A 
recent report on occupational structure m 
America (Blau & Duncan, 1967) includes 
data on the later occupations of male drop- 
outs, data which in many ways are far mote 
extensive than the present; however, because 
of the nature of their study (retrospective): 
it could not provide data on some of 
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variables which are included in the present 
paper, for example, intelligence, trouble with 
the law, personality. The present study will 
be one of the few providing data on what 
happens to a rather large and representative 
sample of male high school dropouts, during 
approximately 10 years after the boys have 
left school. Although many questions are left 
unanswered, some evidence is provided on 
the relationship of dropout to data such as 
intelligence, marriage, size of family (of 
origin and of procreation), present occupa- 
tion, training since high school, trouble with 
the law. The present authors also attempt 
a preliminary description of the personality 
factors associated with dropping out of school 
before completion. 


METHOD 
Samples 


The dropout and control samples for this study 
were drawn from 11,329 Ss who comprised the state- 
wide sample in a longitudinal study of adolescent 
personality and behavior which began in 1953 when 
the Ss were ninth-graders in Minnesota schools. For 
a more detailed description of the longitudinal study 
see Hathaway and Monachesi (1963); the sample 
was a random one from the state exclusive of the 
Minneapolis-St. Paul urban centers. It included the 
city of Duluth. It included 28% of the entire ninth- 
grade public school population of Minnesota in 1953- 
1954 and 35% of all ninth-graders outside the Min- 
neapolis and St. Paul public school systems. It was 
impossible to record race when the data were gath- 
ered. Because of the racial composition of Minnesota, 
however, it is not far wrong to describe the sample 
as representative of predominately white, midwestern 
society. 

The original number of boys in the sample was 
5,701, of whom 1,002 (18%) dropped out of school 
before graduation (14% of the girls dropped out). 
Two follow-up studies were done on the entire 
11,329; the first in 1957, shortly before high school 
completion, and the second over the years from 1962 
to 1966. The mean age of the dropout boys was 
27.9 years. Because two of the dropouts were lost 
in the second follow-up study, the total dropout N 
from which the table samples for this paper are 
drawn is 1,000. Except for the fact of high school 
graduation, the control sample boys were picked 
on a random basis within each school to equal the 
number of dropouts for the school. 


Follow-Up 


In the years 1953-1957, the initial and high school 
data were collected. Pertinent to this report were size 
of home community, socioeconomic status of parental 
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home, intelligence test scores, MMPI test results, 
and date of birth. 

During the period 1962-1966, further follow-up 
data were gathered. The variables used in this report 
are date of marriage, present marital status, region 
of present residence, size of community, number of 
children, present occupation, type of training since 
high school, offenses against the law since 1957, 
emotional problems (as defined by known contact 
with a psychiatrist or psychologist in an in- 
patient or outpatient setting), status with the Armed 
Services. 

Although 94% of the original statewide sample 
was contacted in some way during the 1962-1966 
follow-up period, because of the usual difficulties 
inherent in a longitudinal study, the number of Ss 
that could be classified for each informational item 
used in this study varied from 50% to 89%. 


RESULTS 


This study summary of the dropouts is 
arbitrarily divided into five sections: (a) all 
dropouts compared to all controls; (b) drop- 
outs compared to controls when grouped by 
high, medium, and low intelligence test scores; 
(c) dropouts who had some sort of further 
occupational training since high school com- 
pared to controls with further training; (d) 
dropouts with further occupational training 
since high school compared to dropouts with 
no further training; (e) boys who dropped out 
of school in the ninth and tenth grades com- 
pared to boys who dropped out in the eleventh 
and twelfth grades, Each of these will be con- 
sidered separately. All analyses except one 
were done using the chi-square test. The 
authors felt justified in using only this test 
because of the nature of most of the data, 
which was a frequency count for each vari- 
able. Had we been able to do a follow-up 
in greater depth, we could have been able 
to often classify by interval scales, as did 
Blau and Duncan (1967) for socioeconomic 
status (SES). Unfortunately, this was not pos- 
sible. The data presented here are derived 
from over 100 tables. Those included in the 
paper contain data that might be of more 
interest.* 


All Dropouts Compared to All Controls 


Differences with x? < .001 appeared in the 
following comparisons: age at the end of the 


4Copies of all the tables can be obtained on 
request from S. R. Hathaway. 
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follow-up period—the dropout sample, al- 
though only .4 year older than the control 
sample in mean age, was negatively skewed 
with a median .5 year older; 69% were over 
the age of 27, whereas only 44% of the con- 
trols were over this age. Year of marriage— 
boys who drop out of school marry earlier, 
with the largest difference between them and 
the controls being the years when the con- 
trols were still in high school. Present marital 
status—although dropouts were more often 
divorced than controls, when this variable 
was controlled for date of marriage, there 
was no difference between dropouts and con- 
trols. Number of children—dropouts had 
larger families even when date of marriage 
was controlled. 

Socioeconomic status of the family of ori- 
gin (as measured by occupational level)— 
there was much fewer than the expected 
number of parents of dropouts who were in 
the professional, semiprofessional, or clerical, 
skilled categories (white collar) and much 
more than the expected number in the day 
laborer category. There were no significant 
differences between the number of dropouts 
and controls whose parents were in the semi- 
skilled, slightly skilled (blue collar), farmer, 
or public assistance (unemployed) categories. 
Table 1 gives the percentages in each cate- 
gory. Classification was made in accordance 
with the Minnesota Scale for Paternal Occu- 
pations (Institute of Child Welfare, undated) 
both for this comparison and the one to 
follow. Present occupation of the dropouts 
appeared in far fewer than the expected 
numbers in the professional and semiprofes- 


TABLE 1 


DISTRIBUTION OF PARENT OCCUPATIONAL LEVELS 
or Dropouts COMPARED TO CONTROLS 


Occupational level Dropouts | Controls 
Professional and semiprofessional 2 7 
Clerical, skilled 15 20 
Semiskilled 20 23 
Slightly skilled 21 19 
Day laborer 17 7 
Farmer 25 23 
Unemployed j 1 

Total 101 100 


Note.—For dropouts, N = 927; for co: = 
Distribution is in percentages. x? = 81.39, wee $e “oO 
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OF DROPOUTS COMPARED TO CONTROLS 


Occupational level Dropouts | Controls 
Professional and semiprofessional 7 16 
Clerical, skilled 17 24 
Semiskilled 26 21 
Slightly skilled 30 16 
Day laborer 11 3 
Farmer 7 6 
Unemployed 5 2 
Student 3 1 


Total 101 99 


Note.—For dropouts, N = 699; for controls, N = 816 
Distribution is in percentages. x? = 204.06, df = 7, p <,001 


TABLE 2 
DISTRIBUTION OF SUBJECT OCCUPATIONAL LEVELS 


sional category. This showed up to a lesser 
degree in the clerical-skilled category. In the 
semiskilled category no significant difference 
appeared between dropouts and controls, but 
in the categories of slightly skilled, day 
laborer, and unemployed there were many 
more than the expected number of dropouts, 
Because of the time of the follow-up study, 
a special category called student was pro- 
vided. The proportion of controls who were 
still students was much greater than of drop- 
outs. Table 2 gives the percentages of dropout 
Ss in each SES category compared to controls. 
Social mobility—in an effort to further 
study the relationship of parental compared 
to S SES, the occupation of each S was com- 
pared to that of his family of origin. Drop- 
out boys whose parents were in the pro- 
fessional, semiprofessional, and clerical, skilled 
combined categories (white collar) appeared 
in far less than the expected number in this 
category and in far more than the expected 
number in the semiskilled and slightly skilled 
combined categories. No differences were 
found between dropouts and controls from 
professional to clerical families who became 
day laborers or farmers or welfare recipients. 
Among those dropouts whose parents were in 
the semiskilled and slightly skilled combined 
categories there was less than the expected 
number who rose to the white collar level 
and more than the expected number who fell 
into the day laborer and public assistance 
category. Far fewer than the expected number 
of dropout boys whose parents were farmers 
were found in the white collar categories, 
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put there were no differences between drop- 
outs and controls in the socioeconomic cate- 
gories when the parents were farmers. A 
comparison of dropouts and controls whose 
parents where in the day laborer and un- 
employed combined categories showed no 
meaningful differences. 

Type of training since high school—Table 3 
shows that a comparatively large percentage 
of dropouts did not obtain any further train- 
ing after leaving high school (72%); among 
those who did have additional schooling, there 
were far fewer than the expected number 
who had other than trade school training. 
There was no significant difference between 
the number of dropouts and controls who 
received a trade school training. 

Offenses against the law—more dropouts 
than controls committed misdemeanors, felo- 
nies, and crimes severe enough to warrant a 
prison sentence, although there was no dif- 


“ference between the two groups when those 


committing only traffic offenses were com- 
pared. Emotional problems—although rela- 
tively few in actual number, the frequency of 
dropouts who were known to have received 
some form of psychiatric care since high 
school was significantly highed than the fre- 
quency of controls who had received psychi- 
atric care. 

Ordinal birth position—there was a tend- 
ency for dropouts to be the last born of large 
families (> 7 children in the family) and a 
Slight tendency for them to be firstborn of 
large families. This is in some contrast to the 
findings of Blau and Duncan (1967) that 
oldest and youngest children are better edu- 
cated than their middle siblings. Due to the 
nature of the present analysis, however, we 


TABLE 3 


Percentacrs or Dropouts AND CONTROLS WITH 
FURTHER TRAINING AFTER Hic SCHOOL 


Further training Dropouts| Controls 


Professional, education, 


business, etc. 8 42 
Trade school f 20 16 
No further training 72 41 

Total 100 99 


Note—For dropouts, N = 820; for controls, N = 888. 


X = 270.56, df = 2, p < 001. 
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would concur with Blau and Duncan that 
the dropout tends to come from a large 
family. Although the present data cannot 
show this, other studies suggest that older 
brothers’ and sisters’ dropping out of school 
is an influence on S to drop out (National 
Committee for Children and Youth, 1961), 
as is having a mass of younger siblings to 
support (Blau & Duncan, 1967). 

Personality—more than the expected num- 
ber of dropouts produced invalid (L > 10, 
F > 22, ? > 40) MMPI profiles in the ninth 
grade. Some but not all of this invalidity is 
interpretable as indicating poor reading abil- 
ity or noncooperativeness in the testing situa- 
tion. More than the expected number of drop- 
outs also had MMPI profiles with the highest 
scale being 8 (Schizophrenia), suggesting 
that there was a tendency for dropouts to 
be negative, difficult, odd, apathetic, or lack- 
ing in social graces. Less than the expected 
number had profiles where the highest scale 
was 3 (Hysteria) or 9 (Hypomania), sug- 
gesting a tendency for dropouts to be less 
characterized by such qualities as being 
idealistic, articulate, socially conforming, opti- 
mistic, decisive, expansive. Table 4 gives the 
chi-square computation on the numbers of 
dropouts and controls for each MMPI high 
point. 

When the mean scores on each scale were 
compared, however, a somewhat different pic- 
ture appeared. Table 5 gives the means and 
standard deviations for the validity scales 
and the 10 clinical scales of the MMPI for 
those boys with valid profiles. Because of 
the large numbers in the samples, some small 
differences between the means were highly 
significant; yet the mean scores on scales F 
(validity), 4 (Psychopathic Deviate), and 
8 (Schizophrenia) for the dropouts seem 
meaningfully higher than those of the con- 
trols. A high F scale can signify an unusual 
amount of psychological complaint; a high 
Scale 4 suggests disregard for rules and feel- 
ings of rebelliousness; and a high Scale 8, as 
mentioned, suggests feelings of isolation, 
negativism, and apathy. Whereas the chi- 
square results of Table 4 are based on the 
first high point of each boy’s MMPI profile, 
the mean scale values of Table 5 give an 
idea of the actual average elevation of each 
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TABLE 4 
MMPI Hic Point COMPARISONS 
FOR DROPOUTS AND CONTROLS 
Dropouts Controls 
Scale Total} x? 
o x e o 5 e 
1 12| .29| 14| 16| .29| 14| 28| .58 
2 33| 89| 28| 23| .89| 28| 56| 1.78 
3 7| 427| 15 | 22| 427| 14| 29| 8.84 
4 |160| 142] 152 |138| .44| 146 |298| 86 
5 16 | 1.64| 22 | 28| 1.64] 22| 44| 3.28 
6 38| 82| 44| 48| .86| 42| 86| 1.68 
7 as | 19| 48| 49| .20| 46| 94| .39 
8 |255| 6.65 |217 |171 | 6.91 | 209 | 426 | 13.56 
9 118 | 9.26 | 156 | 189 | 9.56 | 151 | 307 | 18.82 
0 38 | 2.88| 50 | 61| 2.94] 49| 99| 5.82 
Invalid | 110 | 6.70 | 86 | 59| 6.94 | 83| 169 |13.64 
Total| 832 | 34.01 804 | 35.24 


Note.—x' = 69.25, df = 10, p < .001. o = actual number; 


e = estimated number. 


of the scales. Dropouts clearly tended to 
get more elevated MMPI profiles than did 
controls. 

The following comparisons yielded little or 
no statistically significant differences: region 
of present residence—when dropouts are 
compared to controls on remaining in Min- 
nesota versus moving elsewhere, or by more 
extensive breakdown of region of residence 
(e.g, Minnesota, California, Pacific North- 
west, Midwest, New England, South, etc.), 
there was no significant difference between 


TABLE 5 


Means AND STANDARD DEVIATIONS OF THE MMPI 
SCALES For DROPOUTS AND CONTROLS: 
VALD PROFILES ONLY 


Dropouts Controls 
Scale t 
M SD M SD 
L 3.89 | 1.96 | 3.84 | 206| .54 
F. 8.57 | 4.90 | 6.18 | 4.17 | 11,27* 
K 12.93 | 4.45 | 13.69 | 4.67 | 3.58* 
1 (+.5K) | 52.52 | 10.31 | 51.52 | 9.00 | 2.20 
2 54.67 | 10.81 | 51.69 | 10.45 | 5.98* 
3 52.55 | 8.42 | 51.91 | 7.86 | 1.68 
4 (+.4K) | 62.90 | 11.09 | 58.44 | 9.84 | 9.06* 
5 50.21 9.01 | 51.24 8.72 | 2,40 
6 58,33 | 11.31 | 56.00 9.92 | 4.67* 
7 (+4) 60,58 | 10.91 | 58.25 9.95 | 5,18* 
8 (+K) 66,50 | 14.85 | 61.87 | 12.90 | 7.09* 
9 (+.2K) | 61.17 | 11.30 | 58.84 | 11,40 | 4.38* 
0 55.42 | 7.77 | 54.07 | 8.22 | 3.61* 
Note.—For dropouts, N = 884; for controls, N = 941. 


aorewie pen Probie ’849760—'4:9:13, Control mean profile 
“P< “002, 
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the two groups. Community size 

did not tend to congregate in any o 
community, city, large town, small 

farm. Armed Services—branch of ser 
not generally distinguish dropouts fror 
trols. Although the chi-square on statu 
the service was significant beyond p: 
the only cells that meaningfully disting 
dropouts from controls were those 
National Guard. There were fewer 
outs than controls in the Guard. D 
were not discharged from the Armed 
for any reason with higher frequency 
controls. 


Dropouts Compared to Controls when ~ 
Grouped by Intelligence 


Percentile scores from an eleventh or 
grade American College Entrance (Al 
exam or, in a few cases, percentiles d 
from an IQ test were used for these analyse 
Grouping was as follows: high 69-100" 
centile; medium 40-68 percentile; low 1- 
percentile. { 

Comparison on the following variables 
significant at p < .005: present marital 
—although there was no difference amoi 
three dropout groups themselves, when 
intelligence dropouts were compared 
intelligence controls, there was a tenden 
more than the expected number of dr 
to be separated or divorced. Year 0 
riage—again the three dropout groups did 
differ, but all three differed significantly 
their comparable control group, each 
marrying earlier than the controls. 

Present occupation—the overall table 
paring the three dropout groups was 
cant at p< .001; however, it was prim 
the medium intelligence dropout group 
showed significant differences. It was 
group, not those of high intelligence, that h 
more than the expected number in the pi 
fessional, semiprofessional, and clerical, 
categories. Even when the medium intell 
dropouts were compared to the m 
intelligence controls there was no diff 
between the two groups in the clerical, 
category. Both the high and low intellig 
dropout groups differed greatly from 
comparable control groups, and both 
same categories; fewer than the expé 
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number in the white collar category and 
more in the slightly skilled, day laborer, and 
unemployed categories. 

Type of training since high school—chi- 
square values on this variable were all signifi- 
cant at p < .001. Among the dropout groups 
themselves there is a pattern of more than 
expected high intelligence boys who received 
training in the academic or business school 
lines, more than the expected number of 
medium intelligence boys who went to trade 
school, and more low intelligence boys who 
had no further training. The most marked 
difference when these groups were compared 
to their controls appeared in the relatively 
small number who pursued academic or 
business lines among the dropouts, no matter 
what the intelligence. For the boys in the 
dropout or the control groups who went to 
trade school there were no significant differ- 
ences, that is, a high intelligence dropout was 
just as likely to go to trade school as a high 
intelligence boy with a high school diploma, 

Offenses against the law—no differences ap- 
peared when the three dropout groups were 
compared; when each was compared to its 
control, however, it was only between low 
intelligence dropouts and low intelligence con- 
trols that there were no differences. Both 
high and medium intelligence dropouts had 
significantly more offenses ranging from mis- 
demeanors to prison sentences than did their 
control groups. 

MMPI high point—when just those high 
points that distinguished dropouts from con- 
trols were considered (Scales 3, 8, 9), only 
those dropouts who scored high on intelligence 
tests got more than the expected number of 
profiles with 8 as the highest scale. Intel- 
ligence did not differentiate on either Scale 3 
or 9. 

On some of the variables there were no 
significant differences between dropouts and 
controls when grouped by intelligence test 
scores: region of present residence showed 
no differences except between the medium 
intelligence groups. More dropouts had moved 
away from Minnesota. Community size and 
number of children showed no meaningful 
differences, Status with the Armed Services 
developed a difference only between the low 
intelligence groups, which had less than the 
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expected number of dropouts in the National 
Guard. 

When the dropouts were grouped by reason 
for dropping out of school, that is, failing 
or lack of interest, work, institutionalized, 
Armed Service, there were no significant fre- 
quency differences among the high, medium, 
or low intelligence boys. 


Dropouts with Further Training Compared 
to Controls with Further Training 


For the grouping in these analyses, drop- 
outs and controls were included only if they 
were known to have had training of any kind 
after high school. Training ranged from ad- 
vanced degrees to trade school, The purpose 
for study of these groups was to explore the 
discrimination that relates to a high school 
diploma. The following chi-squares were 
significant (p < .001). 

Present marital status—the dropout group 
was less likely to be single and more likely 
to be divorced than the control group. 

Type of training since high school—far 
greater than the expected number of dropouts 
went on to trade school training rather than 
to pursue a more academic avenue; but when 
present occupation was evaluated, there was 
no difference between controls and dropouts 
in the clerical, skilled category. Table 6 shows 
that when compared to controls, dropouts 
congregated among the semiskilled and 
slightly skilled and day laborer categories 
despite their training. 

Offenses against the law—the group of 


TABLE 6 


COMPARISON OF PRESENT OCCUPATION BETWEEN 
Dropouts AND CONTROLS WITH ‘TRAINING 
AFTER HicH SCHOOL 


Occupation Dropouts | Controls 
Professional and semiprofessional 6 27 
Clerical, skilled 28 26 
Semiskilled 29 18 
Slightly skilled 21 6 
Day laborer 5 0 
Farmer 3 2 
Unemployed 2 2 
Student 6 18 

Total 100 99 


„—For dropouts, N =189; for controls, N = 454, 
comparison {s in percentages. x? = 90.51, df =7,p < 001, 
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dropouts who had further training, like the 
total dropout group, got into significantly 
more trouble with the law than did the con- 
trols. More of them than controls committed 
misdemeanors, felonies, and crimes severe 
enough to warrant a prison sentence. 
Comparisons that showed little or no dif- 
ferences were year of marriage, number of 
children, region of present residence, size of 
community, and MMPI high points. 


Dropouts with and without Training after 
Quitting High School 


An attempt was made to find demographic 
signs of future success despite high school 
dropout, that is, even if a boy quits school, 
are there any indications that he makes up 
for this deficit in some way, and can one tell 
which boys are more likely to make up for 
the deficit? Again, training is defined as any- 
thing ranging from return to high school plus 
continuation on to a professional degree to 
trade school. 

Boys who obtained further training showed 
a tendency (p< .05) to come from families 
that were professional, semiprofessional, or 
semiskilled and to not come from farm fami- 
lies. Although less than expected of them 
came from a city or suburb, far greater than 
the expected number grew up in large towns. 
Relatively few came from farms whether or 
not their fathers were actually farmers. While 
or after they had their training, however, they 
tended to move to the city, and even if they 
originally came from a farm, they did not 
stay or return there. Also they moved away 
from Minnesota (p < .001) with much higher 
frequency than dropouts who had no training. 

Intelligence showed some association with 
continued training. Fewer than the expected 
number of low intelligence dropouts went on 
for training, while clearly more than the 
expected number of both medium and high 
intelligence boys got more training. 

Whether the family of origin was a broken 
one by death or divorce seemed to make no 
difference in determining that a dropout went 
back to school. And between the two groups 
themselves there was no difference in rate 
‘of divorce, although those dropouts who got 
no further training tended to marry earlier. 
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There was a distinction between the groups 
on present occupation. As Table 7 shows, the 
percentage of boys with further training who 
were in the white collar category was more 
than twice that of boys without training. As 
would be expected, the nontrained boys con- 
gregated in the low SES categories, and only 
1% of them were students at the time of 
follow-up. 

The ninth-grade MMPI testing distin- 
guished weakly at the p < .10 level between 
dropouts who did and did not get further 
training. The MMPI high point pattern was 
similar to that produced when all dropouts 
were compared to all controls: boys who went 
back to school less often had a high 8, more 
often had a high 9. They also less often 
had invalid profiles, and although it did not 
appear on the overall comparisons, they more 
often had a high point of 5 (Masculinity- 
Femininity), a scale that is frequently associ- 
ated with educational persistence (Dahlstrom 
& Welsh, 1960). 


Early (Ninth and Tenth Grade) Dropouts 
Compared to Later (Eleventh and 
Twelfth Grade) Dropouts 


There were few differences in the later lives 
of early dropouts compared to late dropouts. 
There were no significant differences on year 
of marriage, marital status, present area of 
residence, offenses against the law, later emo- 
tional problems, or status with the Armed 
Services. 


TABLE 7 


Dropouts WITH FURTHER TRAINING AFTER HIGH 
ScHooL Comparep To Dropouts wira No 
TRAINING, ON PRESENT OCCUPATION 


No 


Occupation training Training 
Professional, semiprofessional, 
clerical-skilled 13 34 
Semiskilled 25 29 
Slightly skilled 34 at 
Day laborer 11 5 
Farmer Y 3 
Unemployed 6 2 
Student 1 6 
Total 99 100 


Note.—For dropouts with no training, N = 470; for dropouts 
with training, N = 189. Ci i entages. X 
711d df 16.) <-00L. omparison is in perci 
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Early dropouts tended to live on farms 
more than did later dropouts. Also early 
dropouts originally came from farms with a 
much higher frequency ($ < .001). 

When these groups were compared on type 
of training after leaving high school, less than 
the expected number of early dropouts pur- 
sued academic lines; not surprisingly it 
was the later dropouts who more frequently 
returned to complete high school and 
from that continued in regular academic 
lines or attended some sort of business school 
(p < .005). 


DISCUSSION 


The most marked feature that distinguishes 
the male high school dropout from those boys 
who receive their high school diploma is the 
occupational level achieved by the time these 
men are in their late 20s. To a large degree 
the dropout of the 1950s is the slightly skilled 
or unskilled laborer of the 1960s. Even if a 
bright boy drops out of school, the data show 
that he is much less likely than his bright 
schoolmate who graduates to get later training 
of any kind and that he is much more likely to 
subsequently fall into the ranks of the semi- 
skilled, slightly skilled, day laborer, and un- 
employed. In view of the nationwide search 
for talent, this finding is important: boys of 
_ high ability are being lost. 

Downward social mobility of, dropouts is 
another finding of this study. The data show 
that boys who do not finish high school tend 
to drop in socioeconomic level from their 
family of origin regardless of what level their 
parents were. Social class does determine to 
some extent whether or not a boy finishes 
high school (88% of boys from professional, 
semiprofessional, and clerical, skilled families 
finished high school compared to 78% of boys 
from semiskilled, slightly skilled, and day 
laborer families), but once a boy drops out 
of school, the social class of his parents has 
little relationship to his own social class by 
the time he is in his late 20s. These findings 
do not exactly agree with those of Livingston 
(1958), who found that occupational status 
of the principal wage earner in the family of 
origin was an insignificant factor in whether 
or not a boy dropped out of school, but they 
do agree with the findings of a much larger 
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study (Blau & Duncan, 1967), where it is 
shown that the number of years of schooling 
completed has a much higher relationship 
with later career than does the father’s 
occupational status. 

Until drastic changes are made in primary 
and secondary school curricula in the United 
States (cf. Schreiber, 1967, for discussions of 
possibilities), it can be assumed that there 
will always be a certain proportion of boys 
who will drop out of high school. At age 16 
they can do so legally in Minnesota and many 
other states. If one had to select a group to 
drop out, perhaps the most justifiable selec- 
tion would be made by using intelligence as a 
criterion, that is, boys with a below-average 
intelligence would appear to have the most 
difficulty in keeping up with the “academic” 
types of curricula presented by the usual high 
school of today. A fair question to ask is 
how much of a social problem would boys 
of below-average intelligence who drop out 
of school become. The data in this study show 
that although there is a tendency for low 
intelligence dropouts to be separated and 
divorced more often than high school gradu- 
ates, they are not separated or divorced with 
any higher frequency than dropouts of aver- 
age or above average intelligence. Although 
low intelligence dropouts tend to congregate 
in the low SES categories when compared to 
low intelligence controls, they are not actually 
unemployed or on public assistance to any 
greater extent than the controls, and when 
compared to average and above-average 
intelligence dropouts, there is essentially no 
difference between the ultimate SES level. 
They do not have larger families than either 
the controls with a below-average intelligence 
or the higher intelligence dropouts, thus it 
cannot be said that they are potential higher 
producers of future social problems than are 
other dropouts, Even when trouble with 
the law is considered, the below-average 
intelligence dropout is no different from his 
control S or from the average to above- 
average intelligence dropout. 

This sort of reasoning makes it clearer that 
attempting to limit high school dropouts to 
just those boys who are the most “eligible” 
candidates for not continuing, that is, those 
who are of too limited intelligence to benefit 
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from academic curricula, would do little if 
anything to solve the social problems pre- 
sented by high school dropouts. The culprit 
may be the value placed on the high school 
diploma. Without it a boy, no matter what 
his basic intelligence, tends to end up in low- 
paying, unsteady jobs or perhaps unemploy- 
ment as he grows older. 

As was stated earlier, boys of talent, if talent 
is broadly defined by above-average intel- 
ligence, are being lost. How severe is this 
loss? In the first place, 14% of the dropouts 
have intelligence test scores at or above the 
69th percentile when compared to their age 
group in high school. These boys are no more 
mobile than nondropouts, that is, they remain 
in the state to the same degree and they do 
not congregate in any one community size 
area more than nondropouts. Sixty-nine per- 
cent of them stay in Minnesota, yet 59% 
receive no further training after quitting 
school, and only 15% reach the occupation 
level of clerical, skilled or above (71% fill 
the ranks of semiskilled, slightly skilled, day 
laborer, and unemployed). As adults they get 
into much more serious trouble with the law 
than does the group of boys who graduate from 
high school (37% of high intelligence dropouts 
have committed offenses ranging from mis- 
demeanors to ones severe enough to warrant 
a prison sentence as compared to 9% of the 
high school gradauates). Clearly, the loss 
seems rather severe: these boys tend to be- 
come a burden to the state; their talent 
seems to be spent in fighting the laws rather 
than making them. These boys, of all of the 
dropouts, have the intelligence potential to 
stay in school at least until high school 
graduation. With a high school diploma they 
could have at least qualified for training pro- 
grams for skilled workers that so often now 
require the diploma for entrance. A study by 
Gibson, Higgins, and Mitchell (1967) offers 
a note of pessimism on preserving the talent 
of dropouts. Of 6,008 dropouts in Toledo 
(both male and female), 7% had potential 
for college and were within commuting dis- 
tance to a college. After individual contact, 
only 3% (13 persons) of this group entered 
college, and of these 13, only 11 persons com- 
pleted the freshman year, and only 4 of these 
were making satisfactory progress. 
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When a comparison is made between drop- 
outs who do and do not get further training 
after quitting school, those that do go on 
can be characterized by the following fea- 
tures: they are more likely to come from 
families at a professional or semiprofessional 
SES level and, interestingly, from semi- 
skilled families. Since they are also more 
likely to profit by their training and secure 
white collar or high blue collar jobs, the 
downward social mobility trend does not show 
up here as it does when dropouts in general 
are compared to high school graduates. They 
are more likely to be of average or above- 
average intelligence. Although they do not 
grow up in the cities and suburbs, they tend 
to migrate there as adults, and they tend to 
leave Minnesota. 

Dropouts in general become burdens to the 
state; the state loses those dropouts most 
likely to succeed (36% of them go to the 
West Coast; 23% to California alone). 
Whereas intelligence and SES do to some 
degree seem to be determinants in returning 
to school for training, the factor of coming 
from intact or broken homes is not. It is the 
boy who drops out early (ninth or tenth 
grade), who comes from a farming family 
or an unskilled labor family in the city, and 
who does not have good intellectual potential 
that not only drops out of high school but 
never seeks further training of any kind. 

Reports of personality testing on dropouts 
compared to high school graduates have been 
infrequent. Cervantes (1965) administered the 
TAT to a group of known dropouts and con- 
trols in New Orleans and Boston (the exact 
number in each group is not clear from the 
report, but it is possibly 50 dropouts and 
50 matched controls), and from these proto- 
cols he arrived, in an unreported way, at 
a list of psychological tendencies of the 
dropout. This list includes 36 descriptive 
items, including troubled, hostile, antagonistic, 
instinctoid [sic], unconventional, maladjusted, 
nonroutinized. In contrast to Cervantes’ data 
the present report gives results on personality 
testing before the time of drop out and at- 
tempts to show personality characteristics 
that can be used in a predictive sense. , 

The ninth-grade personality testing with 
the MMPI yields distinctions between late" 
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dropouts and nondropouts. The results in this 
paper differ somewhat from those presented 
earlier (Hathaway & Monachesi, 1963) be- 
cause they include MMPI profiles considered 
as invalid earlier; for this paper an F score 
less than 22 was considered as valid. There 
is some association between MMPI high point 
in the ninth-grade and later school dropout 
($= .20, N = 1,636), but the predictive 
value of knowing a high point is only slight 
(AB = .17). This does not entirely preclude 
a discussion of the dropout personality types, 
however. On the whole, dropouts tend to get 
a more elevated profile over all scales except 
Scale 5. Especially elevated are Scales 8 and 
4. More dropouts also tend to have profiles 
where 8 is the highest point in the profile 
(31% of all the dropouts have 8 as the 
highest point, compared to 21% of the con- 
trols), An 8 as the highest point occurs more 
among high intelligence dropouts compared 
to high intelligence controls and among drop- 
outs who do not get further training com- 
pared to dropouts who do. Whereas the ele- 
vated Scale 4 suggests that future dropouts 
harbor feelings of rebelliousness and lack 
feelings of social conformity in the ninth 
grade, the more frequent occurrence of 8 as 
the highest point suggests early feelings of 
apathy, difficulty in thinking straight, and 
tendencies to withdraw. The high school 
counselor should be on the alert when he 
finds a boy with a high 8 MMPI profile; he 
could be a future dropout, even if he is bright. 
And if he does drop out of school, he is 
likely to become a real social dependent. 
The Scale 9 findings suggest that a little 
elevation of 9 is a good thing but too much 
is detrimental. Dropouts have more elevated 
9s but high school graduates more frequently 
have 9 as the highest point on the profile. An 
elevated 9 is associated with a high energy 
level; perhaps those dropouts with elevated 
9s had too much energy to stick to school. 
Among the statewide sample as a whole, a 
high 9 profile was the most common (Hatha- 
way & Monachesi, 1963); these adolescent 
boys in general are more often characterized 
by adjectives such as sociable, enthusiastic, 
forward, talkative, and verbal; and boys who 
graduate from high school seem to be espe- 
cially characterized by such adjectives. The 
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further finding that there are more than the 
expected number of high point 3 profiles 
among those of the controls is in line with 
the above description. High point 3s are de- 
scribed as socially conforming, persevering, 
energetic, cooperative. 


CONCLUSION 


Although the picture painted by this study 
is dark, it is not quite so black as that from 
studies that cover large urban areas or the 
South. In the late 1950s, at least, Minnesota 
had a comparatively low dropout rate for 
high school boys—18% versus a percentage 
like 59 for a slum area of a large city (Conant, 
1961). Only 5% of the Minnesota dropouts 
were unemployed by their late 20s (which 
is only slightly more than the national figure 
of 3.8% for white males aged 25-34 in 1962, 
Bienstock, 1967), although approximately 
50% of them are likely candidates for future 
unemployment if the Department of Labor 
predictions are correct. The deep problem in 
the country today is concerned with the 
futures of deprived persons, especially mem- 
bers of racial minority groups, where even 
a high school diploma is of little value in 
securing employment. The data of this study 
suggest that in predominately white, mid- 
western America, the high school diploma is 
a passport to occupational security. They 
show, however, that certain background char- 
acteristics of adolescent boys are influential in 
determining whether or not a diploma is 
attained: socioeconomic status of the parents, 
family size, intelligence, and personality. 
These characteristics have long been recog- 
nized and discussed in the literature, and so 
the findings in this report do not come as a 
surprise. What is surprising are the findings 
that if a boy does drop out of high school, 
these same characteristics have little influence 
on his later career. 
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In one of the most sophisticated studies of 
accident-related personality traits, Conger, Gas- 
kill, Glad, Hassel, Rainey, and Sawrey (1964) 
hypothesized 13 variables to be significantly re- 
lated to accident frequency. These variables, for 
example, amount of underlying hostility, reality 
orientation, conventionality, extent of fantasy 
preoccupation, ability to tolerate various tensions, 
etc., as well as the hypothetical composite acci- 
dent type described by Conger et al., contain 
many personality characteristics that one would 
expect to find in emphasized or exaggerated form 
in the hospitalized mental patient. 

To explore the degree of relationship between 
driver records of neuropsychiatric patients (NP) 
and their scores on psychological tests adminis- 
tered as part of the routine admission procedure, 
and to develop criteria for use in the identifica- 
tion of high-risk drivers among neuropsychiatric 
patients, the driving records of 533 NP patients 
admitted to Sepulveda Veterans Administration 
hospital from August 1964 through July 1967 
were compared with published data for a random 
sample (N = 86,717) of California drivers. Their 
accident records did not differ statistically, but 


+ An extended report of this study may be ob- 
tained without charge from Matthew W. Buttiglieri, 
Assistant Chief, Psychology Service, Veterans Admin- 
istration Hospital, Sepulveda, California 91343, or 
for a fee from the American Society for Information 
Science. Order NAPS Document 00364 from ASIS 
National Auxiliary Publications Service, c/o CCM 
z3 ormation Sciences, Inc., 22 West 34th Street, New 
rs New York 10001; remitting $1.00 for micro- 

che or $3.00 for photocopies. Requests for reprints 


should be sent to the first author at the above 
address, 


violation records of the NP sample were higher 
(p < .001) than the compared group. 

A regression analysis approach was used to 
examine the possibility that variables from 
the MMPI, Wonderlic Personnel Test, and 
selected biographical variables would aid in the 
prediction of the patients accident record. Three 
variables which were significantly related to acci- 
dent records were selected using the first 2-year 
sample; Schizophrenic Reaction, other; Psycho- 
neurotic diagnosis; and Ma score. A drink 
(yes/no) variable was added on rational grounds. 
Diagnosis Schizophrenic Reaction, other; Ma 
score; and drinking were validated by the third- 
year sample. The resulting regression equation 
accounted for 9% of the variance in accident 
involvement. 

The possibility of improving prediction of 
accident records by selection of subpopulations 
in which prediction is more effective was also 
explored. Lack of comparative data on the non- 
hospitalized population prevented comparison, but 
the results are suggestive for further studies. 

In general, the results indicated that patients 
with diagnosis Schizophrenic Reaction, other, had 
a lower accident rate, while patients who drank 
or had a high Ma score had a higher accident 
rate. 
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In a time of supposed alienation of youth from 
their elders, an investigation of the relationship 
between parental reinforcement and patterns of 
verbal expression becomes of particular impor- 
tance. Jourard (1964) describes self-disclosing 
behavior and reports females to be more dis- 
closing than males; mothers disclosed to more 
often than fathers or friends; and differences in 
what people reveal about themselves, Heilbrun 
(1964), using the Parent-Child Interaction Rating 
Scale (P-C), found mothers receiving higher 
nurturance ratings than fathers from their chil- 
dren and this difference was greater for male Ss 
than females. 

This study attempts to relate perceived child- 
rearing practices and self-disclosure as well as to 
replicate some of Jourard’s findings. Hypotheses 
are (a) home environments seen as nurturant 
will yield greater disclosure than low-nurturant 
homes and (b) parental nurturance will have 
greater influence on disclosure to parents than 
to friends. College Ss were given the Self- 
Disclosure Questionnaire (SDQ); the P-C; and 
the Marlowe-Crowne Social Desirability Scale 
(M-C SDS—Crowne & Marlowe, 1964). 

A significant positive relationship was found 
between nurturance ratings for each parent 
(r= .61, p< .01, N = 110), suggesting that off- 
spring see similarity in the child-rearing attitudes 
of their parents. Analysis of variance (N = 80) 
revealed a significant main effect of greater over- 
‘all disclosure by Ss who saw parents as highly 


1An extended report of this study may be ob- 
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partment of Psychology, Emory University, Atlanta, 
Georgia 30322, or for a fee from the American Soci- 
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reprints should be sent to Bonnie Strickland, Depart- 
ment of Psychology, Emory University, Atlanta, 
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nurturant than by Ss who saw their family back- 
ground as low in nurturance. A significant inter- 
action of Parental Nurturance X Target Person 
indicates that Ss from low-nurturant homes dis- 
close more to friends than to parents, with the 
reverse being the case for Ss perceiving their 
parents as nurturant. 

Significant main effects for sex differences and 
for differences in disclosure output to target per- 
sons were not found. However, trends are toward 
more disclosure to mother than to father, and 
more disclosing on the part of females than 
males. A significant Sex X Target Person inter- 
action showed that females disclose more to 
mother and female friend than to father and 
male friend. Males are about equally disclosing 
to mother, father, and male friend, but less to 
female friend. The Ss reported revealing signifi- 
cantly more about public aspects of self than 
private or personal aspects and friends are more 
often the recipients of public information, and 
parents the recipients of personal information. 
Intercorrelations of the M-C SDS with SDQ and 
P-C were generally low and nonsignificant. Thus, 
a social desirability response set does not appear 
to be operating in the data. 

Surprisingly, our student sample indicates @ 
turning to parents for personal disclosure and 
perhaps not so surprisingly a relationship between 
stated self-expression to parents and perceived | 
parental nurturance. Focusing attention on the 
nature of messages communicated is probably the | 
next logical step in research. 
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FOLLOW-UP OF 812 GIRLS 10 YEARS AFTER 
HIGH SCHOOL DROPOUT’ 


STARKE R. HATHAWAY,? PHYLLIS C. REYNOLDS, ann ELIO D. MONACHESI 


University. 


of Minnesota 


A dropout group and a nondropout group were compared on several variables. 


The data both predated and postdated 
the groups were subdivided on time 


the time of school dropout. For analysis 
of marriage and intelligence. Dropouts 


have a higher separation and divorce rate, even when time of marriage is con- 


trolled; they have larger families, but 


apparently only because they have been 


married longer. In general they marry into a lower socioeconomic status, but 
only girls from blue collar families show a downward social mobility. When 
grouped by intelligence, high and medium intelligence dropouts do not show 
the upward social mobility that nondropouts do. Dropout for girls seems less 


catastrophic than it does for boys. 


Rich man, poor man, beggar man, thief, 
doctor, lawyer, merchant, chief—little girls 
count on their buttons to predict whom they 
will marry when they grow up, the prediction 
of the day depending on the number of but- 
tons on their clothes. Even if a little girl were 
enterprising enough to search the psychologi- 
cal and sociological literature, she could not 
really improve on her button method; there is 
a paucity of studies on the social mobility of 
women. It is known, for example, that drop- 
ping out of high school has a real effect on 
the socioeconomic status of boys; the social 
mobility is in the downward direction for 
these boys by 10 years after quitting high 
school (Hathaway, Reynolds, & Monachesi, 
1969). It has not been known, however, if 
there is a similar effect on the socioeconomic 
status of dropout girls by the time they 
marry, although Wolfbein (1959) shows that 
more girl dropouts have unskilled first jobs or 
ate unemployed compared to high school 
graduates 1-5 years after quitting school. 

Much has been written recently about the 
dropout (Cervantes, 1965; Greene, 1966; 
National Committee for Children and Youth, 
1961; Schreiber, 1964, 1967); however, in 
Practically all of this literature, female drop- 
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outs are either excluded or are included but 
not distinguished from male dropouts. As 
Deutsch (1964) says, 


it is often in school that another element of the 
dropout problem related to another type of discrimi- 
nation takes place—and it is an element that is often 
ignored. That is, the female dropout. Too often, dis- 
cussion of dropouts deal with all cases as an un- 
differentiated totality or concentrate on males, with- 
out recognizing that many of the elements responsible 
for such a high dropout rate among American In- 
dians, Negro Americans, etc., are similar in the case 
of females [p. 213]. 


The present paper explores some variables 
that distinguish girls who drop out of high 
school from girls who graduate, and compares 
the fate of the female dropout to that of the 
male dropout. 

We were unable to develop the findings 
from the Minnesota Multiphasic Personality 
Inventory (MMPI) and certain other varia- 
bles that were analyzed in the report of the 
careers of dropout boys (Hathaway et al., 
1969). We do, however, have these data on 
girls and expect to publish them in another 
report. 


PROCEDURE 


Samples 


The dropout and control samples for this study 
were drawn from 11,329 Ss who comprised the 
statewide sample in a longitudinal study of ado- 
lescent personality and behavior which began in 1953 
when the Ss were ninth-graders in Minnesota schools. 
The schools were selected in 1953 to represent Min- 
nesota’s diverse economic and geographic areas. The 
statewide sample was a stratified random one except 
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that it did not include either Minneapolis or St. 
Paul. It did include 13 Twin City suburbs of varying 
social status level and the city of Duluth. It was 
impossible to record race when the data were gath- 
ered; because of the racial composition of Minne- 
sota, however, it is safe to say that the sample rep- 
resents a mainly white, midwestern society. For a 
more detailed description of the data collection and 
other generalities on this longitudinal study see 
Hathaway and Monachesi (1963). 

The original number of girls was 5,628, of whom 
816 (14%) dropped out of school before graduation. 
Because four of the dropouts could not be found at 
follow-up time, the basic total dropout N for this 
paper was 812, The control sample was 812 girls 
who graduated from high school. These girls were 
selected on a random basis within each school to 
match in number the dropouts from that school. 


Variables 


During the high school years 1953-1957 data were 
gathered on the following variables pertinent to this 
paper: socioeconomic status (SES), as defined by the 
Minnesota Scale for Paternal Occupations (Institute 
of Child Welfare, undated) ; intelligence test results 
based on ninth- or twelfth-grade ACE or, in a few 
instances, IQ equivalents; reason for drop out; and 
time of drop out. 

During a 1962-1966 follow-up, data were gathered 
on year of marriage, present marital status, number 
of children, occupation and SES of husband, and 
number of school years eventually completed. 

Although 94% of the original sample was contacted 
in some way, because of the usual difficulties inherent 
in a longitudinal study the numbers of lost cases on 
follow-up variables ranged from 29% (SES of hus- 
band) to 11% (present marital status) for the drop- 
outs; the similar attrition of numbers for the con- 
trols was proportionately less because, as a group, 
the controls were easier to locate at follow-up time. 


RESULTS 


Data published earlier (Hathaway & Mona- 
chesi, 1963) showed that girls who drop out 
of school significantly more frequently come 
from blue collar and less frequently from 
white collar families than do girls who com- 
plete high school, that they more frequently 
come from broken families, and that they 
more frequently have low intelligence test 
scores. Among the 82% of dropout girls for 
whom a reason for drop out was known, 38% 
gave marriage as the reason. This was the 
most frequent single reason. Twenty-two per- 
cent professed lack of interest, and 15% 
were reported to be pregnant out of wedlock 
(Pauker, 1968). Nine percent dropped out to 
work. Only 5% dropped out with failure in 
school as the apparent reason, 
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At the end of the follow-up, only 5% of the 
dropouts had remained single, which con- 
trasted to 16% of the controls who were still 
single. Comparison of the married groups 
showed that the rate of separation or divorce 
was significantly higher for dropouts than for 
controls. When dropouts alone were grouped 
by time of marriage (before June 1957 and 
after June 1957), there was no significant 
difference in divorce rate between the two 
groups. But when those dropouts married af- 
ter June 1957 were compared to controls 
married after June 1957, the divorce rate for 
the dropouts was significantly higher. 

The dropouts who married early had sig- 
nificantly larger families than later dropouts: 
37% of the early marrieds had four or mote 
children compared to 9% of the after-June 
1957 group. When all dropouts were com- 
pared to controls, there was also a significant 
difference in family size: 27% of all dropouts 
had four or more children compared to only 
8% of all controls; only 6% of the dropouts 
had no children, compared to 19% of the 
controls. When only those dropouts and con- 
trols who married after June 1957 were com- 
pared, however, there was no significant dif- 
ference in family size. Dropping out of school 
does not seem to be an indication of high 
fecundity; as a whole, dropouts marry earlier 
than graduates and therefore have a greater 
opportunity to produce children. Also, as m- 
dicated above, pregnancy is a determiner of 
dropout so the early marriage group, at least, 
is biased. 

The Hathaway and Monachesi (1963) 
study did not show evidence of any clear-cut 
linear relationship between time of dropout 
and severity of social problems or intelli- 
gence. Similarly, the data of the present study 
did not show significant relationships between 
time of marriage (early vs. later) of dropouts 
and the variables of socioeconomic level of 
the parents, present socioeconomic status, 
parents’ marital status, or intelligence test 
scores. As stated above, number of children 
produced the only significant difference. Simi- 
larly, and in line with the earlier results, "° 
significant relationship was found between 
time of dropout and present socioeconomic 
level. 


" Because intelligence test scores were found 
‘to differentiate dropouts from controls in the 
eatlier study, the dropouts were grouped by 
high (69-100 percentile), medium (40-68 
percentile), and low (1-39 percentile) apti- 
tude scores and compared on several of the 
variables, Among these dropout groups, the 
intelligence levels showed no significant rela- 

“tionship to years of school eventually com- 
‘pleted (less than high school vs. eventual 
high school graduation or higher), early or 
‘ater marriage, present marital status, pres- 
ent socioeconomic level, or number of chil- 
dren. 

When both the dropouts and the controls 
were divided by intelligence and high was 
compared to high, medium to medium, and 
low to low, a different pattern appeared. Both 
high and medium intelligence groups showed 
significant frequency differences in marital 
status categories, but in both cases the largest 
contribution to the significance level came 
from the different percentage of single girls in 
each group. As was reported above, the con- 
trols included a much higher frequency in 
this category. When the unmarried women 
were eliminated from the samples and the 
still-marrieds were compared to the separated 
Or divorced girls, the high intelligence drop- 
outs had a greater separation and divorce rate 
than high intelligence controls; between the 
medium intelligence groups this difference 
Was less reliable, and differences between low 
intelligence dropouts and low intelligence con- 
‘trols fell to an unreliable level. All three apti- 
tude level dropout groups had significantly 
latger families when compared to their con- 
trol groups. This may relate to the selection 
factor mentioned above for family size. 

Dropouts differed significantly from con- 
trols on present socioeconomic level (based on 
husband’s occupational level). Table 1 gives 

percentages in each SES category. The 

tences in the slightly skilled and the un- 
employed categories are not meaningful. Even 
with these relatively large Ws, the SES levels 
had to be grouped into: (a) white collar (pro- 
fessional, semiprofessional, clerical, skilled) ; 
(6) blue collar (semiskilled, slightly skilled) ; 
(c) day laborer and unemployed; and (4) 
F , for other comparisons. With these 
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TABLE 1 


OCCUPATIONAL LeveL OF HusBANDS OF DROPOUTS 
COMPARED TO THOSE OF CONTROLS 


Occupational level Dropouts | Controls 

Professional and semi- 

professional 4 21 
Clerical, skilled 21 23 
Semiskilled 30 26 
Slightly skilled 24 17 
Day laborer 5 1 
In service (only) 6 5 
Farmer 7 5 
Unemployed 3 t1 

Total 100 99 


Note.—All entries are in percentages. For dropouts, N = 
481; for controls, Ñ = 491. x? = 82.61, df = 7, p <,001, 


four classes, significant differences in SES 
level continued to appear when the dropout 
and control samples were grouped by intelli- 
gence. The high intelligence and medium in- 
telligence controls showed significant differ- 
ences. For the high intelligence groups the 
difference was most apparent at the white 
collar level: far fewer than the expected num- 
ber of dropouts married men who had white 
collar jobs. For the medium intelligence groups 
the difference was more at the day laborer and 
unemployed level, with twice the expected 
number of dropout girls who married men at 
this lower occupational level. The comparison 
of low intelligence dropouts to low intelligence 
controls produced no statistically significant 
difference. These findings are shown in Table 
2 which gives the percentages in each SES 
category for each intelligence group. Whereas 
the difference between the high and low in- 
telligence control group in the white collar 
category was 26%, the difference in the same 
category for the dropout group was only 8%. 
Similarly, the control group blue collar cate- 
gory difference from high to low intelligence 
was 21%, and for the comparable dropout 
group was only 5%. The farmer category was 
almost constant across the table. 

Although the data showed that girls who 
drop out of high school tended as a group to 
come from lower SES levels and to continue 
by marrige at lower SES levels, a study of 
social mobility following each girl could show 
if, among dropouts, there was really upward 
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TABLE 2 


PERCENTAGES OF MARRIED Dropouts AND CONTROLS IN EacH SES CATEGORY (BASED ON 
HUSBAND’S OCCUPATION) WHEN GROUPED BY INTELLIGENCE PERCENTILES 


High intelligence* Medium intelligence Low intelligence’ 

SHS caterer Dropout Control Dropout Control Dropout Control 

(N= 103) (N= 180) (N= 128) (N=128) (N= 203) (N=156) 
White collar 31 58 28 48 23 32 
Blue collar 56 36 52 44 61 57 
Day laborer or unemployed 6 1 12 0 9 5 
Farmer 7 5 9 8 7 S 
Total 100 100 101 100 100 99 


a x? = 20,22, df = 3, p < .001. 
bx? = 23.95, df = 3, p < .001. 
e x? = 5.10, df = 3, p < .25. 


or downward mobility. The SES level of the 
family of origin of each married dropout and 
control S was compared to the occupational 
level of S’s husband. When compared to the 
controls by this procedure, only the dropouts 
whose parents were at the blue collar level 
showed a significant downward mobility trend. 
Far fewer than the expected number of girls 
married into the white collar class and far 
more than the expected number married men 
who were day laborers or unemployed. The 
percentages of girls moving from blue collar 
families into each SES category are given in 
Table 3. 

None of the other SES mobility data 
yielded significant differences. Thus, if a girl 
who drops out of school comes from a white 
collar family, she is just as likely to marry a 
man with a white collar job as a girl from the 
same SES level who finishes school. 

Another look at SES mobility was provided 
from grouping by intelligence test percentiles. 
For these analyses coarser SES grouping had 
to be used because of small Ns. The blue col- 
lar, day laborer, and unemployed categories 
were combined. As with other analyses where 
the girls were grouped by intelligence, it was 
the high and medium intelligence dropouts 
who contributed the chief differences. The low 
intelligence dropouts did not differ signifi- 
cantly from the low intelligence controls in 
any of the SES movement categories. The high 
intelligence dropouts from blue collar or below 
SES families did not show a reliable trend to 
upward mobility as did their comparable 


control groups (x°, $ < .001). This same re- 
lationship was found in the mobility trends 
between medium intelligence dropouts and 
medium intelligence controls, but with lesser 
certainty (xê, p< .01). This same relation- 
ship was found in the mobility trends between 
medium intelligence dropouts and medium in- 
telligence controls, but with lesser certainty 
(x*, b < 01). Because of the necessary coarse 
grouping of blue collar with day laborer and 
unemployed, a downward trend, if it existed, 
could not be demonstrated. Table 4, whi 
compares the SES mobility between low and 
high intelligence groups for both the dropouts 
and controls, shows that, for the controls, in- 
telligence seems to have some effect on SES 
level; a high intelligence girl from a white 
collar family tends to marry a man at the 
white collar level, or if she comes from a blue 


TABLE 3 


MOVEMENT oF DROPOUTS AND CONTROLS FROM Bue 
Cortar Famtty or ORIGIN TO FAMILY OF 
PROCREATION LEVEL 


Dropouts | Controls 
Item (N= 193) | (w=207) 
Parent to husband 6 
Blue collar to white collar 26 4 
Blue collar to day laborer/ 
unemployed 60 47 
Blue collar to farmer 6 5 
Total 101 9 
2 = 27.19, df =* 


Note,—All entries are in percentages. x’ 
<.001. 


| 
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‘ TABLE 4 
if í COMPARISON or SES MOBILITY BETWEEN LOW AND HIGH INTELLIGENCE DROPOUTS 
AND LOW AND HIGH INTELLIGENCE CONTROLS 
Dropouts Controls 
SES mobility 
7 Low High Low High 
intelligence | intelligence | intelligence | intelligence 
Parent to husband 
White collar to white collar 39 53 40 67 
White collar to blue collar/day laborer/unemployed 61 47 60 33 
Total 100 100 100 100 
N 31 19 25 61 
a b 
Blue collar/day laborer/unemployed to white collar 23 22 34 56 
Blue collar/day laborer to unemployed 77 78 66 44 
Blue collar/day laborer/unemployed 
| Total 100 100 100 100 
es 105 54 91 80 
| c d 
i l 
r eet A a in percentages. 
l gr = 12 < 1025. 
: dxf = 7,70, df = 1, p <.01. 
collar level, she tends to marry above the Discussion 


ate not apparent for the girl who drops out of 
high school; a high intelligence girl from a 
white collar family is just as likely as a low 
‘intelligence girl from a white collar family to 
_ Marry in the same broad social class. 

: _As a final analysis of social mobility, data 


: level of her family of origin. These tendencies 


were taken from the authors’ report on drop- 
Out boys (Hathaway et al., 1969) for com- 
“Parison to similar data on the girls. Table 5 
Prestnts these comparisons for male and fe- 
! male dropout groups and the control groups. 
Although differences are not marked, the pat- 
tern for male and female controls seems to be 
that of stable to upward social mobility, but 
When male and female dropouts are compared, 

a downward trend for the boys is apparent. 
_ Table 6 gives the actual outcome propor- 
tions of male and female dropouts in each 
i SES category, that is, the level of the S, if a 
Male, or the spouse, if a female. Clear differ- 
ences between males and females appeared at 
both extremes—more than the expected num- 
ber of dropout females married into the pro- 
fessional/; semiprofessional level and more than 
the expected number of dropout males became 


day laborers, 


Girls in Minnesota who drop out of high 
school before graduating can be characterized 
as of average to below-average intelligence 
(only 22% have intelligence test scores above 
the 68th percentile) and as coming from low- 
income families (66% of the parents are at 
the semiskilled level or below). They marry 
readily and early, and, when compared to 
girls who graduate from high school, they 
tend to marry men with low income jobs 
(62% of the husbands are at the semiskilled 
level or below). Among these dropout girls, 
however, being bright or not so bright seems 
to have little relationship to their eventual 
socioeconomic level. Ten years later, the 
dropouts average out with larger families 
than nondropouts mainly because many of 
them have been married longer, but their sep- 
aration and divorce rate is higher even when 
compared to nondropouts who have been mar- 
ried the same length of time. 

Tf socioeconomic level of the family she es- 
tablishes can be used as a criterion of suc- 
cess in life, social mobility data can help to 
answer the question of how much effect 
school dropout has on a girl’s later chances 
of success. A comparison with nondropouts of 


388 


the same parental SES level shows that drop- 
out girls who come from blue collar (semi- 
skilled and slightly skilled) families, and only 
this group, have a tendency to downward mo- 
bility; 60%, however, remain at the same blue 
collar level when they marry. When compared 
to their control group, dropout girls from 
white collar families show no downward 
trend, and dropout girls from day laborer and 
unemployed families show an upward mobil- 
ity as much as does their control group. 


TABLE 5 
SES Mosiury COMPARISONS BETWEEN MALE AND 
FEMALE Dropout GROUPS AND THE 
CONTROL Groups 


Dropouts Controls 
SES mobility 
Female | Male | Female | Male 
Parent to husband 
White collar to white 
collar 40 23 57 61 
White collar to blue 
collar 54 63 42 34 
White collar to day 
laborer/ un- 
employed 6 14 1 5 
Total 100 100 100 100 
N 70 103 112 168 
a b 
Blue collar to white 
collar 27 20 49 44 
Blue collar to blue 
collar 63 63 50 48 
Blue collar to day 
laborer/un- 
employed 9 17 1 8 
Total 100 100 99 100 
N 182 262 196 | 303 
o d 
Day laborer/un- 
employed to 
white collar 24 18 41 35 
Day laborer/un- 
employed to 
blue collar 64 60 51 54 
Day laborer/un- 
employed to day 
laborer/un- 
employed 12 21 8 10 
Total 100 99 100 99 
e t 
X ! | 


Note.—All entries except a are in percentages. 
<05. 


ext = .13, df = 2, p < 30. 
a 2 3.00 i ep E 
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TABLE 6 


COMPARISON OF Outcome SES LEVEL or 
FEMALE AND MALE Dropouts 


Outcome SES level Female Male 

Professional and 
semiprofessional 4 2 

Clerical, skilled 22 17 
Semiskilled 31 27 
Slightly skilled 26 31 
Day laborer 6 11 
Farmer 8 7 
Unemployed 3 5 

Total 100 100 


Note.—All entries are in percentages. For females, N = 453; 
for males, N = 680. x? = 23.93, df = 6, p <.001, 


In general, high school dropout for girls 
does not have the severe ramifications that it 
has for boys. Similar to the girls, boy drop- 
outs tend to come from low-income families, 
but unlike the girls, their social mobility di- 
rection is downward regardless of the parental 
SES level. It is only at the day laborer, un- 
employed level where boy dropouts do not 
differ significantly from girl dropouts in mo- 
bility direction, although the downward trend 
for boys is still apparent. Among male drop- 
outs, 57% have intelligence test scores at of 
below the 39th percentile; among the female 
dropouts, the percentage is 47. When each of 
these groups is compared to its control group, 
that is, boys or girls of the same intelligence 
level who graduated from high school, the 
data show that dropout girls as a gro do 
not suffer the consequences that dropout ys 
do regarding eventual socioeconomic level 
(81% of the dropout boys had jobs at the 
semiskilled level or below, compared to 56% 
of the control boys; 70% of female low-intel- 
ligence dropouts married at the semiskilled 
level or below, compared to 62% of the com 
trol low-intelligence group). These findings 
can be interpreted as an indirect indication of 
the importance of the high school diplom 
for boys and the lesser importance it has for 
girls—the most “justifiable” dropouts woul 
be the ones who lack the intelligence to finish 
the academic curriculum of the average big 
school, and girls whose dropping out is thus 
“justified” fare no worse than girls of co™ 
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parable intelligence who persevere to the end 
of high school. 
Another group of girls who should be con- 
sidered, however, are those of high intelli- 
gence. Do these girls, who have the mental ca- 
pacity to continue on to the high school di- 
ploma and beyond, lose out in the long run? 
‘And does society lose out because of wasted 
talent? Twenty-two percent of the girl drop- 
outs were of high intelligence. When these 
‘girls are compared to girls of high intelligence 
who finish high school, a clear case of handi- 
cap is demonstrated: only 5% of the dropouts 
eventually went back to school to get their 
high school diploma, and of these, one girl had 
1 year of college, one girl finished college, and 
one had 1 year of postgraduate work. Gibson, 
Higgins, and Mitchell (1967) found that 117% 
of high school dropouts (both male and fe- 
male) in Toledo had “potential for college.” 
After contact and invitation to participate in 
a workshop that led to the university, only 
2% of this group completed their freshman 
year, These authors conclude that “more 
fruitful efforts might be directed toward ‘talent 
retention’ rather than ‘talent reclamation.’ ” 
The present data suggest that even without a 
reclamation process, 2% of dropout girls with 
potential go on to college, casting doubt on 
the value of a reclamation program, yet con- 
curring with the Gibson et al. conclusion. As 
a reminder of contrasting careers, 657 of 
the high school graduates went on to college 
or training (e.g., nursing), and of these, 35% 
finished college. 
_ Among the high intelligence dropouts, 97% 
‘had been married by the time of follow-up, 
And of these, 13% were either separated or 
divorced. Again, for contrast, 81% of the 
high school graduates were married, and of 
these, 5% were separated or divorced. Twenty- 
seven percent of the high intelligence drop- 
“outs had four or more children, but only 5% 
of the graduates had as many. Only 31% of 
the dropouts married at the white collar level 
(only 7% at the professional, semiprofessional 
level) compared to 58% of the high school 
Btaduates (31% at the professional, semipro- 
fessional level). Only 20% of the dropout girls 
“came from white collar families (compared to 
35% of the graduates); the dropouts, how- 
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ever, did not marry below themselves in so- 
cial class any more than did the graduates. It 
appeared to be the dropouts from the blue 
collar and lower SES levels whose social mo- 
bility was downward in comparison to the 
graduates. 

Tf talent existed among these high intelli- 
gence dropouts, it now seems to lie dormant 
in typically, a young lady who is married, 
whose husband has a semiskilled job, who has 
several children and who, perhaps because of 
lack of time, has never obtained her high 
school diploma. 


CONCLUSION 


Few studies have been reported on girls 
who drop out of high school before graduation. 
Perhaps researchers have suspected what is in 
part borne out by this paper: the consequences 
of drop out for girls are not so severe as they 
are for boys. Future research on dropouts 
should study each sex separately. In the 
1950s, only 14% of girls in Minnesota quit 
school before graduation. Ten years later 
practically all of these girls had married and 
many had produced sizable families. Their 
social mobility was more stable than for boys 
who dropped out of school. Both boy and girl 
dropouts tended to, in the authors’ data, come 
from and continue at the lower socioeco- 
nomic levels. Very few of the girls returned to 
school to obtain a diploma. As a group they 
had a higher separation and divorce rate than 
the girls who graduated. This was true even 
when a correction was made to equate the 
time of marriage. 


REFERENCES 


Cervantes, L. F. The dropout: Causes and cures. 
Ann Arbor: University of Michigan Press, 1965. 
Deutscx, M. Early social environment: Its influence 
on school adaptation. In D. Schriber (Ed.), The 
school dropout. Washington, D. C.: National Ed- 

ucation Association, 1964. 

Grsson, R. L., HIGGINS, R., & MırcrerL, M. H. The 
high school dropout goes to college. Personnel and 
Guidance Journal, 1967, 45, 824-827. 

Greene, B. I. Preventing student dropouts. Engle- 
wood Cliffs, N. J.: Prentice-Hall, 1966. 

Haraway, S. R., & MONACHESI, E. D. Adolescent 
personality and behavior. Minneapolis: University 
of Minnesota Press, 1963. 


390 S. R. Harnaway, P. C. Reynotps, AnD E. D. MONACHESI 


Harmaway, S. R., Reynotps, P. C., & Monacuest, E. 
D. Follow-up of the later careers and lives of 1,000 
boys who dropped out of high school. Journal of 
Consulting and Clinical Psychology, 1969, 33, 370- 
380. 

InstrurTe or Caup WELFARE, UnrversITY or MIN- 
NESOTA. Minnesota Scale for Paternal Occupations. 
Minneapolis: Author, undated. 

NATIONAL COMMITTEE FOR CHILDREN AND YOUTH. 
Social dynamite: The report of the conference on 
unemployed, out-of-school youth in urban areas. 
Washington, D. C.: Government Printing Office, 
1961. 


Pauxer, J. D. Girls pregnant out of wedlock: Are 
they pregnant because they are different, or are 
they different because they are pregnant? Paper 
presented at the National Conference on Social 
Welfare, San Francisco, May 1968. 

Scurerer, D. (Ed.) The school dropout. Washing- 
ton, D. C.: National Education Association, 1964, 

Scurerer, D. (Ed.) Profile of the school dropout, 
New York: Random House, 1967. 

Worrsem, S. L. Transition from school to work: A 
study of the school leaver. Personnel and Guidance 
Journal, 1959, 38, 98-105. 


(Received July 26, 1968) 


o! T 
> 4969, Vol. 33, No. 4, 
me 


X 


C E O A E 


E 


Consulting and Clinical Psychology 
Maa of 391-395 


4 


g 
+ 
y 


BODILY CONCERNS AND THE WISC OBJECT 
ASSEMBLY SUBTEST 


GERALD MARSDEN? anp NEIL KALTER? 


University of Michigan 


Previous examinations of the hypothesis that bodily concerns are reflected in 
impaired WISC Object Assembly performance have produced contradictory 
results, but have also used different criteria for assignment of Ss to two S 
groups. To investigate whether these differences account for the discrepant 
results, 73 white, latency and early adolescent boys were assigned to three 
groups differing in intensity of bodily concerns. One-way analyses of variance 
with covariance adjustments for relevant variables failed to yield any support 
for the hypothesis. Theoretical limitations of the hypothesis were discussed 
and a more appropriate approach suggested and supported by prima facie evi- 
dence gleaned from analysis of the finding that intensely bodily concerned boys 
performed significantly better (p < 01) than others on the Information subtest. 


Blatt, Allison, and Baker (1965) found 
support for the hypothesis that concerns 
about body intactness and integrity are re- 
flected in the Wechsler intelligence tests by 
a subtest pattern in which the Object Assem- 
bly (OA) subtest score is lower than the 
scores of many of the other subtests. Among 
the varieties of data brought to bear on this 
hypothesis was a comparison of two groups 
of children who differed in concerns about 
body intactness. Records of children previ- 
ously evaluated at a child guidance center 
were reviewed by a clinical psychologist, not 
familiar with the hypothesis, who assigned 
them either to a group clearly expressing 
bodily concerns or to a group for which bodily 
concerns were not believed to be a dominant 
issue. The Wechsler Intelligence Scale for 
Children (WISC) OA subtest scaled scores of 
the children in these groups were then com- 
pared, and found to differ, as expected, to a 
degree falling just short of the .05 level of 
Significance. When the groups were compared 
in terms of the deviation of OA scaled scores 
ftom mean scaled scores computed across all 
11 subtests and across the five Performance 
Scale subtests, the expected difference was 


1 Requests for reprints should be sent to Gerald 
Marsden, Children’s Psychiatric Hospital, University 
tien Medical Center, Ann Arbor, Michigan 


2 The authors gratefully acknowledge the statistical 
Consultation provided by J. E. Keith Smith, and the 
helpful criticism of an earlier draft by Albert C. 
Cain, Martin Mayman, and Paul Meisel. 


again found, this time significant in both 
analyses at the .03 level. Similar analyses of 
the other WISC subtests yielded no significant 
differences. 

Rockwell (1967) criticized this portion of 
the Blatt et al. study on the grounds that 
the V was too small to permit generalization 
(the two groups totaled 13 Ss), that Ss in 
these groups were apparently not matched in 
any way, that the groups were biased with 
respect to sex (three girls were included in 
the two groups), and that the bodily concern 
group contained one S with a very low Full 
Scale IQ (FSIQ) score (when this S was 
removed in Rockwell’s reanalysis of the Blatt 
et al. data the significant relation reported 
above failed to appear). Rockwell then repli- 
cated the study. He selected two groups of 
15 boys each from the files of the University 
of Minnesota Child Development Clinic. They 
were matched within two IQ points on WISC 
FSIQs and all had IQ scores of 90 or above. 
Rockwell found no significant relation be- 
tween the bodily concern groups on OA or 
on any other subtest when he repeated the 
analyses employed by Blatt et al. 

One cannot be certain, however, that 
Rockwell’s effort constituted a replication of 
the Blatt et al. study. The point at issue has 
to do with the criteria for assigning Ss to the 
two groups. Blatt et al. appear to have in- 
cluded in the bodily concern group children 
for whom bodily concerns were a dominant 
issue, and in the control group children who, 
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while they may have had some degree of 
concern about bodily intactness, were more 
concerned about other issues. Rockwell, on 
the other hand, included in his bodily concern 
group children whose records included the 
statement that bodily concerns were present, 
while the records of the children in his control 
group contained no mention of bodily con- 
cerns. Thus Rockwell’s bodily concern group 
may well have contained children who would 
have been assigned to either of the Blatt et al. 
groups. And the Blatt et al. control group 
may have contained some children that 
Rockwell would have assigned to his bodily 
concern group, The question to be examined 
in this report is whether these apparent dif- 
ferences in the composition of the groups can 
account for the discrepancies in the findings 
reported by Blatt et al. and Rockwell. 


METHOD 


Records of children who had been evaluated at 
Children’s Psychiatric Hospital? were checked by a 
secretary to screen out those in which a complete 
WISC record was unavailable. The remaining records, 
including social work, psychiatric, and psychological 
reports (but not psychological test protocols), and 
treatment notes were then drawn alphabetically for 
study by the junior author. He was, at that time, 
not familiar with the hypothesis, having been told 
only that the study involved a possible relation be- 
tween certain WISC subtests and bodily concerns. 
His task was to eliminate all cases in which there 
was question of organicity or in which the WISC 
had been administered previously, and to assign the 
remaining cases to one of three groups. 

Group 1 was comprised of cases in which concern 
about bodily damage, integrity, or intactness repre- 
sented the primary mode of giving expression to the 
concerns, conflicts, or impulses which lay at the 
heart of the psychopathology and presenting prob- 
lems for which the child was being evaluated. In 
each of these cases two or more of the evaluators, 
particularly psychologists and psychiatrists, agreed 
that bodily concerns were of fundamental importance 
in explaining the psychopathology. Moreover, in 
every case the child and/or his parents gave explicit 
verbal expression to their awareness of these concerns 
and their intensity. Thus, two important features of 
the Group 1 cases were that the level of inference 
involved in classifying them was minimal, and that 
conscious recognition of the child’s bodily concerns 
by himself or by members of his family were con- 
firmed by diagnostic evaluations which weighed un- 
conscious factors heavily. Group 2 consisted of cases 
in which bodily concerns were Present, but did not 
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constitute the major mode of impulse ¢ 
expression. Other modes, such as phob 
sions, or intense anxiety, all relatively 
least overtly) to the body concerns, provid 
and important means of expression. Cases 
to this group varied from some in w 
concerns were clearly present and important j 
in which such concerns were mentioned only 
ing. Records of Ss assigned to Group 3 co 
no hint that the child entertained bodily con 

A total of 137 case records were read, 0 
33 were discarded because the WISC had 
ministered previously or because the qi 
organicity was open. Three other records 
impossible to rate and were discarded. 
female and Negro children occurred in our 
with the same relative infrequency as their 
rence in the hospital population. Because the 
too few of either to consider the effects of 
race separately, the sample was restricted to” 
males. This left a total of 73 cases dist 
follows: Group 1 = 10, Group 2 = 33, Group 

Early in the research 22 potential Ss 
gorized* and then later, after grouping g 
had been elaborated and refined, they 
pendently recategorized by the junior author. 
parison of the grouping decisions on these 
vides a rough—and probably minimal—e 
the reliability of S grouping. Interjudge 
was 82%. Three of the four disagreements 
between Groups 1 and 2, and in each of 
stances the junior author assigned the S to | 
thus indicating the stringency with which | 
Ss were selected. 


RESULTS 


A one-way analysis of variance Ti 
that the three groups did not differ 
cantly on any of the following variabl 
(in months) when tested, FSIQ, Verbal t 
IQ, Performance Scale IQ, year tested. 
latter was examined to check the pos 
that the groups might reflect, in part, cha 
in hospital personnel’s diagnostic orien 
and sensitivity over time. The optional 
Span subtest (Verbal Scale) had been 
istered to only about half of the Ss. Thus 
Verbal and Full Scale IQs of all Ss we 
strictly comparable. In order to bring to 
any group differences which might be 
by this factor, the contribution of D 
to IQ scores was prorated out and the 
were compared on the basis of these mi 
scores. Again, no significant differences 
found. This modification is incorporated 
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TABLE 1 
Means, STANDARD DEVIATIONS, AND F Ratios OF SUBJECT GROUPS ON BACKGROUND VARIABLES 
Group 1 Group 2 Group 3 
Variables Fe 
xX SD x SD x SD 

Full Scale IQ 103.80 14.77 102.82 14.92 98,93 12.00 81 
Full Scale IQ* 104.40 14.36 103.82 14.24 99.17 12.50 1.11 
Verbal Scale IQ 104.50 15.15 100.76 14.31 96.27 13.49 152 
Verbal Scale 1Q* 105.70 14.25 101.58 13.95 96.67 14.39 1.84 
Performance Scale IQ 102.10 15.67 104.52 14.23 102.13 11.93 .28 
Age (months) 128.90 23.34 122.55 17.99 124.87 22.81 37 
Year tested 63.60 2.12 62.82 2.19 62.67 2.43 64 


Note.—For Group 1, N = 10; for Group 2, N = 33; for Group 3, N = 30. 
a Modified by omission of Digit Span subtest in approximately 50% of the cases. 


» df = 2/72. None of these ratios reach significance, 


subsequent analyses. Table 1 presents the 
means, standard deviations, and F ratios for 
the three groups on each of these variables. 

The absence of significant differences on 
these background variables cleared the way 
for examination of the bodily concerns-Object 
Assembly hypothesis. There were three ques- 
tions: Did Group 1 have a significantly lower 
mean OA scaled score than Group 2 or 
Group 3? Was Group 1’s mean OA scaled 
score among the lowest of their mean scaled 
scores? And were there significant differences 
between the groups on any other subtests? 
These questions were examined by means of 
a one-way analysis of variance with covari- 
ance adjustment for FSIQ. Age was not used 
as a covariate because an age control is built 
into the method of arriving at scaled scores 
and the IQ score. The correlation between 
age (in months) and FSIQ was .06. Group 
differences were sought on each of the 10 
WISC subtests. In order to avoid equating 
the groups on a variable (i.e., FSIQ) which 
was in part composed of the variable on 
Which group differences were sought (sub- 
tests), the FSIQ scores were first modified 
by removing the contribution of the subtest 
under examination in each instance. This was 
done by the usual method of prorating WISC 
scaled scores in all analyses employing FSIQ 
as a covariate. 

Table 2 presents subtests means and stan- 
dard deviations for each group, and the F 
ratios for differences between groups. These 
data appear first with no covariance adjust- 


ment and then with the adjustment for FSIQ. 
There was no Statistically meaningful dif- 
ference between groups on any of the subtests 
except Information. Not only did Group 1 fail 
to differ from either of the other groups on 
the OA scaled score, but neither Group 1 nor 
Group 2 scored lower on OA than on most 
other subtests as Blatt et al. predicted. In- 
deed, the mean OA scaled score fell above 
the median in the distribution of group mean 
subtest scaled scores for all three groups. 
Of course, scaled scores might not be the 
most appropriate data in terms of which to 
test the hypothesis. Certain items in the OA 
subtest might arouse sufficient anxiety in 
children with intense bodily concerns to inter- 
fere with their performance on that item. Such 
interference could be masked when item raw 
scores are summed and transformed into 
scaled scores. This possibility was explored 
through an analysis of variance of the OA 
item raw scores with covariance adjustments 
for age and FSIQ. While this analysis repre- 
sented a post hoc examination of an insignifi- 
cant finding and is, therefore, of dubious 
respectability, it was undertaken here for 
heuristic reasons with full recognition that 
any significant findings might well be chance 
effects. The analysis revealed that only Item 2 
(horse) differentiated the groups at the .05 
level (adjusted means for Groups 1, 2, and 3 
were, respectively, 5.58, 5.15, and 5.17; 
F = 3.23, df = 2/68). Thus, the high body- 
concern group (Group 1) had a significantly 
higher mean item score than the other two 
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TABLE 2 
WISC SUBTEST MEANS AND STANDARD DEVIATIONS WITH AND WITHOUT ADJUSTMENT FOR FSIQ, 
AND F RATIOS, FoR THREE BODILY CONCERN GROUPS 
Group 1 Group 2 Group 3 
Subtest Covariate Ps 
x SD X: SD x SD 
Information None 12.00 2.72 9.61 2.58 9.00 2.92 4,32" 
FSIQ 11.78 2.72 9.38 2.58 9.32 2.92 6.10 
Comprehension None 10.30 2.90 10.52 3.21 9.03 2.37 2.61 
FSIQ 9.99 2.90 10.42 3.21 9.24 2.37 1.76 
Arithmetic None 10.40 2.50 9.58 2.84 9.30 2.56 61 
FSIQ 10.10 2.50 9.46 2.84 9.53 2.56 28 
Similarities None 10.80 3.40 10.76 3.73 10.43 3.63 07 
FSIQ 10.17 3.40 10.55 3.73 10.87 3.63 24 
Vocabulary None 10.90 2.70 10.36 3.26 9.63 2.66 83 
FSIQ 10.42 2.70 10.16 3.26 10.02 2.66 12 
Picture Completion None 10.40 2.29 11.76 3.17 11.03 2.86 95 
FSIQ 9.94 2.29 11.60 3.17 11.36 2.86 1,84 
Picture Arrangement None: 9.90 2.88 10.12 3.17 9.40 2.30 05 
FSIQ 9.63 2.88 9.99 3.17 10.14 2.30 15 
Block Design None 10.70 3.29 10.88 2.64 10.97 2.37 04 
FSIQ 10.38 3.29 10.72 2.64 11.25 2.37 73 
Object Assembly None 10.50 2.54 10.97 2.82 10.07 2.94 16 
f FSIQ 10.20 2.54 10.84 2.82 10.31 2.94 46 
Coding None 8.60 4.29 9.00 3.12 9.17 2.82 11 
FSIQ 8.41 4.29 8.84 3.12 9.40 2.82 46 


Note.—For Group 1, N = 10; for Group 2, N = 33; for Group 3, N = 30. 


adf maan covariates = 2/72, with 
D> < 01. 


groups, and the hypothesis was not supported. 
Moreover, while the magnitude of this dif- 
ference was statistically significant, it appears 
psychologically trivial. 


Discussion 


Despite the use of more carefully differenti- 
ated S groups in this study, no evidence could 
be found supporting the hypothesis that 
bodily concerns are reflected in impaired per- 
formance on the WISC OA subtest. And this 
was true whether the analysis was conducted 
on subtest scaled scores or on the level of 
individual items. 

Several factors might account for this nega- 
tive finding. These include the accuracy and 
adequacy with which bodily concerns were 
reflected in the closed case files used to define 
the criterion groups, and the possibility that 
the relation between bodily concerns and OA 
performance may be obscured in latency and 
early adolescence by rapidly changing organi- 
zation of cognitive capacities or by heightened 


IQ covaried df = 2/71. 


drive-defense constellations in the face of 
puberty. Most of the evidence Blatt et al. 
adduced in support of the hypothesis was 
based on work with adults. It is important to 
note that the OA subtest itself poses partially 
different psychological problems for children 
and adults. While all four items of the 
Wechsler Adult Intelligence Scale OA sub- 
test require the adult S to discover what 
object the cut up pieces form when assembled, 
the child S is informed what the object 15 
before he begins work on the first two WIS 
OA items. l 
But far more significant are theoretical 
limitations of the hypothesis. One of Blatt 
et al.’s purposes in testing the bodily concerns 
hypothesis was to extend the understanding 
of the ego-psychological functions underlying 
the OA subtest. Working from Rapaport, Gill, 
and Schafer’s (1968) rationale for the sub- 
test as primarily involving visual-motor © 
ordination, Blatt et al: asked what could be 
expected to disrupt this psychological funt- 
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tion. The theoretical inadequacy of the hy- 
pothesis lies in this one-sided emphasis on 
disruption and the research strategy implied 
by that emphasis which led them to seek 
directly disrupting factors. A more adequate 
theoretical approach, one more fully consistent 
with psychoanalytic ego-psychology theory, 
would suggest that the presence of bodily 
concerns interacting with mediating variables 
could produce either enhanced or impaired 
performance on OA, or might not be limited 
in their effects to this subtest. 

Examples of these mediating variables are 
patterns of defense or cognitive control 
principles, and stimulus thresholds required 
to evoke their use. There is now considerable 
experimental evidence of interindividual and 
intraindividual differences in the use of cogni- 
tive controls, and of relations between certain 
of these controls and both intellectual abili- 
ties and defensive styles (Gardner, Holzman, 
Klein, Linton, & Spence, 1959; Gardner, 
Jackson, & Messick, 1960; Gardner & Long, 
1962; Guthrie, 1967; Holzman & Gardner, 
1959), That individuals use quite different 
defense mechanisms in managing anxiety and 
other affects generated by roughly similar 
conflicts is a clinical commonplace. While 
many persons may deal with concerns about 
bodily integrity through primarily repressive 
and avoidant defenses, resulting in temporary 
disorganization of visual-motor functioning 
on the OA subtest, others might use isolation 
or counterphobic defenses, for example, pro- 
ducing temporarily heightened visual-motor 
performance. 

_A case in point may be our finding of a 
significant difference in performance on the 
Information subtest between Group 1 and 
Groups 2 and 3 (Table 2). That boys with 
intense bodily concerns should out-perform 
other boys on this subtest was unexpected 
and did not occur in either the Blatt et al. 
or Rockwell studies. It may be, therefore, 
a chance finding, but it may also reveal the 
effect of mediating variables. In an attempt 
to uncover the source of the Group 1 S’s 
superior performance, one-way analyses of 
variance with covariance adjustment for age 
and FSIQ were computed for each subtest 
item. Only three items significantly differen- 
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tiated the groups: Number 16, “Who wrote 
‘Romeo and Juliet’?” (F = 10.96, p < .01); 
Number 20, “Where is Chile?” (F = 22.65, 
p< 01); Number 27, “What is a barom- 
eter?” (F = 3.20, p < .05) (df = 2/68). None 
of these is obviously related to the issue of 
bodily concerns. Notably, all three appear in 
the last half of the subtest and two occur in 
the last third. Group 1 Ss appear to differ 
from other Ss in that they succeed with cer- 
tain more difficult, esoteric items. These Ss 
seem to have been either more attentive to 
information available in their environment or 
to have such information more available to 
memory or both. Those subtest items which 
do appear relevant to bodily concerns were 
among the least differentiating. If the relation 
between intense bodily concerns and high 
Information subtest scores is meaningful, it 
is not as a direct function of items in the 
subtest which pertain to the bodily concerns 
issue, but must reflect more indirect, more 
complex psychological processes. 
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SITUATIONAL APPRAISAL INVENTORY: 


DEVELOPMENT AND VALIDATION OF A MEASURE 
OF EVALUATIVE ATTITUDES 


STEPHEN M. PITTEL+ AND 


The Situational Appraisal Inventory (SAI) was designed to meet the need for 
a measure of subjective moral (evaluative) attitudes. Previous instruments used 
for this purpose were found inadequate because of their psychometric proper- 
ties or psychological assumptions. Two 100-item forms (SAI-A and SAI-B) 
were developed and standardized for college populations and a 30-item form 
(SAI-J) was derived from SAI-A to represent the cluster dimensions and total 
score of the longer form. Internal consistency and test-retest reliabilities of all 
forms exceeded .90 and all forms of the test were shown to be comparable. 
SAI scores related as expected to measures of parental identification and 
religiosity, and were consistently higher for females than for males. SAI scores 
did not relate to measures of social desirability response set. The SAI was 
considered to be a reliable and valid measure of subjective evaluative attitudes. 


GERALD A. MENDELSOHN 
University of California, Berkeley 


A review and critique of measures designed 
to assess the strength of moral values, con- 
science, or superego (Pittel & Mendelsohn, 
1966) led to the conclusion that the instru- 
ments which have been used for this purpose 
are subject to one or more methodological or 
conceptual criticisms which detract from their 
usefulness in psychological research. Although 
the authors pointed to the failure of these 
tests to predict morally relevant behavior in 
a direct way, the major concern is not with 
their predictive validity, but rather with their 
construction and meaningfulness as measures 
of moral values (cf. Ebel, 1961). 

The critique of previous instruments cen- 
tered about the following issues which, in 
addition to the need for standardization data, 
may be used as guidelines for the construction 
of more meaningful devices. 

1. Test items should describe morally rele- 
vant behavior occurring within specified social 
and motivational frames of reference rather 
than mere statements of the acts themselves. 

2. The range of moral behavior covered 
should be extensive, and should include acts 
considered to be morally relevant by the 
population to be studied in addition to those 


1 Requests for reprints should be sent to Stephen 
M, Pittel, Department of Psychiatry, Mount Zion 
Hospital and Medical Center, 1600 Divisadero 
Street, San Francisco, California 94115. 


aspects of conventional morality usually 
included in previous instruments. 

3. Tests should require the evaluation of 
acts on the basis of subjective attitudes. 
They should not assess knowledge of societal 
standards or awareness of conventional behav- 
ior; that is, they should not be contaminated 
by social desirability response set. 

4. Scoring of such tests should not be 
based on assumptions about the correctness 
or psychological significance of responses, nor 
should they require qualitative interpretation 
by the examiner. 

5. Moral values should be assessed directly 
rather than by inference from assumed indi- 
cators such as guilt responses, resistance w 
temptation, or self-reports of characteristic 
behavior. 

6. Parallel forms should be available for 
evaluation of changes in values over time O! 
as a function of experimental manipulations. 


METHOD 


Development of the Situational Appraisal 
Inventory 


Considerations of these guidelines led the present 
authors to construct a new instrument called the 
Situational Appraisal Inventory (SAI). It was de- 
signed to serve as a measure of moral or evaluativé 
attitudes in a variety of content areas and to provid? 
an overall index of severity of conscience or SUP 
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ego as these functions are defined in attitudinal 
terms (cf. Beres, 1958; Sears, 1960). 

In the body of this paper the term evaluative 
attitudes rather than the traditional one, moral atti- 
tudes, will be used in order to avoid some of the 
connotative meanings and content limitations which 
are associated with the latter term. Evaluative atti- 
tudes are inferred from judgments of human conduct 
along a dimension of “rightness—wrongness.” Such 
judgments may be made about any behaviors, re- 
gardless of whether or not they fall into the cate- 
gories of conventional moral concern. 

The SAI consists of items describing the behavior 
of a fictitious character in a number of specific situa- 
tions, each of which is to be evaluated by S in the 
light of the framework in which the act takes place. 
Implicit in each item is the assumption that mean- 
ingful judgments cannot be made about behaviors 
in the abstract. For example, there is high agree- 
ment that taking of human life is wrong, but in 
time of war this value is easily altered; likewise, to 
steal is generally considered wrong, but the legendary 
virtue of stealing from the rich and giving to the 
poor has been taught to generations of children. 

The items for the inventory were derived from 
sample items suggested by a large number of Uni- 
versity of California undergraduates who were asked 
to describe situations in which a person violated 
some moral code under extenuating or justifying 
circumstances, The use of this procedure for item 
construction provided us with acts-in-context which 
were not necessarily restricted to socially defined 
morality, but which were, nevertheless, morally salient 
to college students. Furthermore, the provision that 
the act be justified within the situational context 
allowed a sampling of both the content and the 
motivational aspects of morally relevant situations. 

From the initial pool of items thus collected, two 
100-item forms of the inventory were compiled 
(SAI-A and SAI-B). A number of items were ran- 
domly chosen to be included in both forms, and 
other items were repeated within a single form to 
five a preliminary estimate of response consistency. 
The following are typical items, chosen to indicate 
the diversity of situations and behaviors included: 

1. John finds a wallet containing a few dollars 
lying on the sidewalk. Since no one is in sight he 
Pockets the money and leaves the wallet where he 
found it. 

2. John is walking past the door to his sister's 
room when he notices her lying naked on the bed. 
This sight arouses him and he stays hy her door 
and secretly watches her. 

3. When John goes to the Smorgasbord for dinner 
ie always eats more food than he would ordinarily 
Since he doesn’t have to pay extra for it. 

4. One day John discovers his father giving a 
merciless beating to his younger sister. He tries 
to pull his father away and when he is unable to 
stop him he hits his father over the head with a 
Paperweight. 

5. John has heard that Mary has a reputation for 

g sexually promiscuous. He ask her out on a 
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date just for the purpose of having sexual relations 
with her. i 

6. John is taking an examination for which he 
is well prepared, He glances around him and notices 
that the other students are copying directly from 
their notes. He realizes that he will not be able 
to do as well as those who are cheating and so he 
opens his own notebook and begins to copy from 
his notes, 

The range of content is extensive, but given the 
source of the items, certain themes predominate, 
notably sexual behavior, academic honesty, prejudice, 
authority relationships, and interpersonal honesty. 
However, an effort was made to transcend these 
limitations by including items removed from the 
immediate concerns of a college population, for 
example, stealing, aggression, job behavior, marital 
behavior. 

For each item, S is asked to rate the degree of 
“wrongness” on an 8-point scale ranging from “Not 
Wrong” to “Extremely Wrong.” The S’s ratings are 
considered a measure of the intensity of evaluative 
attitudes with respect to the particular situation 
described. A total score for the SAI is obtained for 
each S by summing the responses to each of the 
items in a test form. This score may be thought of 
as a measure of the overall strength of evaluative 
attitudes independent of the limitations of specific 
content. As such, it represents a tendency to respond 
with a characteristic intensity in evaluating human 
conduct, 

In the initial standardization SAI-A was adminis- 
tered to 143 undergraduates (72 males, 71 females), 
50 of whom (24 males, 26 females) were retested 
on the same instrument following a 2-week interval. 
SAI-B was administered to a similar sample of 209 
undergraduates (88 males, 121 females), 79 of whom 
(35 males, 44 females) were retested on the same 
form. The Ss were volunteers from undergraduate 
psychology courses and included both psychology and 
nonpsychology majors of all class standings. The 
mean age for the first sample (UC 1) was 20.46 
(SD = 2.65) and for the second sample (UC 2) was 
19.51 (SD=2.87), a difference significant beyond 
the .01 level. This difference is probably accounted 
for by the selection of Ss in UC 1 largely from a 
sophomore psychology class, while many UC 2 Ss 
were taken from the introductory course. 


RESULTS 
Reliability of the SAT 


Internal consistency. The split-half (odd- 
even) reliabilities, corrected for attenuation 
by the Spearman-Brown prophecy formula, 
are .96 for SAI-A and .97 for SAI-B, Eleven 
items on Form A and eight items on Form B 
appeared twice in the same form. For Form A 
the correlations between first and second 
ratings of the same item range from .76 to 
92 with a median correlation of .87. Since 
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TABLE 1 


Two-WeEx TEST-RETEST CORRELATIONS, MEANS, 
AND STANDARD DEVIATIONS FOR 
SAI-A anp SAI-B 


Test Retest 
Form N rie 
X | sp| xX SD 

SAI-A 

Males |24 | 422.08 | 78.67 | 438.21 | 82.94 | .939 

Females | 26 | 465.06 | 58.94 | 444.27 | 59.67 | .909 

Total 50 | 443.42 | 72.61 | 441.36 | 71.10 | .920 
SAL-B 

Males |35 | 376.50 | 92.04 | 368.29 | 102.62 | .951 

Females | 44 | 434.91 | 91.13 | 407.86 | 113.69 | .955 

Total 79 | 415.11 | 95,18 | 390.33 | 110.04 | 952 


the instructions on both forms cautioned Ss 
against turning back or skipping pages, these 
figures suggest the high stability of item re- 
sponses to identical situations within a single 
administration. 

Test-retest. Test-retest correlations, means, 
and standard deviations for total score are 
given for each sex and for the total sample 
in Table 1. Reliability coefficients for both 
sexes and the combined sample all exceed .90. 

Frequency distributions of 2-week item 
reliabilities for SAI-A for each sex and for 
the combined sample show individual item 
reliabilities to be only slightly lower than total 
score reliability. The median reliabilities of 
.77 for males, .71 for females, and .73 for 
the combined sample all indicate the high 
stability of item responses over time. Since 


Ss were not prepared for a retest in advance 
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and were specifically instructed “to answer 
each item as you feel about it now. Do not be 
concerned with the responses you gave last 
time,” it would appear that the SAT taps a 
particularly stable and consistent dimension 
of moral evaluation. 


Comparability of SAI-A and SAI-B 


The mean total score for SAI-A is 436.90 
(SD = 77.50), and for SAI-B is 418.43 
(SD = 94.15) a difference significant at the 
.05 level (¢ = 2.01). This represents an aver- 
age item difference between the two forms of 
only .20, which, considering the 8-point scale 
used, is quite small. Forty-three females were 
given Form B followed, after 2 weeks, by 
Form A. The total score correlation for the 
two forms is .89. This correlation is only 
slightly lower than the test-retest correlations 
for the same form over the same time interval. 


Dimensions of Evaluative Attitudes 


While the SAI was designed to measure 
evaluative attitudes in a variety of content 
areas, no categorization of items according to 
any preconceived scheme was attempted. 
Rather, the problem of content scoring was 
approached through the technique of cluster 
analysis. 

Intercorrelation matrices were obtained for 
ratings of the 89 nonrepeated items of SAI-A, 
for the total sample and for the sexes, Sepa 
rately. Tryon key cluster analyses of these 
three matrices yielded seven cluster dimen- 
sions for the combined sample and five fot 
each sex group. The proportion of the total 


TABLE 2 
DESCRIPTION oF CLUSTERS or SAI-A: COMBINED ANALYSIS 
Proportion | 47) of No. of Mean rat- be oa 
of commu- `g items with | ing of items| relation 
ORES gonen nality seus loadings loading ites ie 
y: .50 | tota! 
exhausted above .50 | above eee 
1 General 45 9 18 5.20 54 
2 Sex-Private 14 4 3 2.53 35 
3 Sex-Hlicit 07 2 3 4.26 32 
4 Prejudice 12 3 3 4.94 30 
5 Coercion 06 2 2 6.61 48 
6 Aggression 08 2 2 2.86 32 
7 Personal Well-Being 08 2 2 3.31 39 


Note.—N = 72 males and 71 females. 


wait a, eee eee eee 
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initial communality exhausted by the ob- 
tained clusters was .78 for the combined 
sample, .72 for males, and .77 for females. 
Table 2 presents data descriptive of the 
clusters obtained for the combined sample. 

The first cluster, which accounts for a large 
part of the communality, is defined by nine 
items with a wide diversity of content, 
ranging from academic honesty to petty 
larceny to sibling hostility. While few of these 
items deal with legal violations, all of them 
are rated as being quite wrong. Further, the 
18 items with loadings above .50 on this 
cluster are characterized by having high cor- 
relations with total score on SAI-A. Thus, it 
would seem that this may best be described 
as a general cluster; an assumption which is 
supported by the uniform positive loadings of 
all items on the cluster and by the finding that 
58% of the items have their highest loading 
on this cluster. 

Although the remaining clusters each 
account for only small proportions of the 
communality they are each marked by the 
homogeneity of their content and by their 
psychological meaningfulness. Clusters 2 and 
3 deal with sexual behavior. The first of these 
contains items dealing with masturbation and 
voyeurism while the latter deals with pre- 
marital and extramarital sexual relations. 
Clusters 3 through 7 are defined by only two 
or three items and deal, respectively, with 
anti-Negro prejudice, coercion by force or 
blackmail, physical aggression, and behav- 
lors which are either directly or indirectly 
injurious to one’s personal well-being. 

As shown in Table 3, the intercorrelations 
among the oblique cluster dimensions are, on 
the whole, relatively low. As might be 
expected, the highest correlations are those 
with the general cluster and those between 
the two sexual clusters. However, the inter- 
correlations of items within each cluster 
(shown in the diagonal) are generally higher 
than the average intercluster correlation of .29. 

Findings similar to the above were ob- 
tained in the separate male and female sex 
analyses, Notable exceptions to these marked 
Similarities include the appearance of a cluster 
for males centering on petty violations which 
are almost universally condoned (such as 
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TABLE 3 


INTERCORRELATIONS OF OBLIQUE CLUSTERS 
or SAI-A: COMBINED SEX ANAYLSIS 


Cluster iesnas KA esaat 


4 
39 yk 
48 48 43 


620. —.05° «11-40 

52 28 40 37 38 

31 40 16 .02 .29 29 
A8 AZ .25. 01 31 34S 


NAW WNH 


Note.—Diagonal values are the mean intercorrelations among 
key cluster variables for each of the seven clusters, 


overeating at a Smorgasbord), and the ab- 
sence of a prejudice cluster in the female 
analysis, The female analysis also revealed 
a cluster dealing primarily with interpersonal 
honesty and a less marked differentiation be- 
tween the two sexual clusters obtained in both 
the combined and the male analysis. 

On the basis of these cluster analyses 30 
SAI-A items were selected for a short form 
of the inventory (SAI-J). Twenty-three items 
were selected for their ability to represent 
a given cluster in a pure fashion, that is, 
each item had to have a high cluster loading 
on a single cluster and low loadings on all 
others. In most cases the items selected by 
this criterion were the defining, or key cluster, 
variables for a given cluster. An additional 
seven items were selected on the basis of 
their correlation with total score on SAI-A. 
Items which correlated highly with total score 
but did not load heavily on any particular 
cluster were thus chosen for inclusion in 
Form J. The 30-item SAI-J was constructed 
in such a way as to ensure that the informa- 
tion available in Form A, in terms of both 
cluster structure and total score, could be 
derived more quickly and more efficiently. 
Comparability of SAI-A and SAI-J 

Although data are not available for a direct 
comparison of these two forms there is evi- 
dence to indicate their comparability. For the 
combined UC 1 sample (N = 143) total 
score on SAI-A correlated .96 with the 
score obtained by summing the 30 items 
which comprise SAI-J. Similarly, the mean 
ratings for the 30 selected items taken from 
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the original standardization data correlate .94 
with mean ratings for SAI-J items obtained 
from a sample of 88 male and female Ss who 
were given only the short form. 

Despite these high correlations the ratings 
of items in SAI-J tend to be somewhat lower 
than ratings for the same items included in 
SAI-A. While few of these differences reach 
statistical significance it is likely that intra- 
test context does affect item ratings in a 
consistent fashion. However, the magnitude 
of these differences is slight and the correla- 
tions of item means and total score are suf- 
ficiently high to justify the use of the two 
measures as equivalent. 


Validity of the SAI 


There are serious difficulties in discussing 
the validation of an instrument, like the SAI, 
which is designed to assess subjectively held 
attitudes. Previous research findings indicate 
that, with few exceptions, attitude measures 
are, by themselves, poor predictors of overt 
behavior. Nevertheless a test which purports 
to measure evaluative attitudes should have 
face and construct validity and should differ- 
entiate between groups with known value 
orientations. 


Face Validity 


Since the SAI was designed as a measure 
of the strength of evaluative attitudes (i.e., 
the attitudinal component of conscience or 
superego), its face validity may be estimated 
by considering the representativeness of its 
item content, As indicated above, the items 
do cover an extensive range of behaviors con- 
sidered to be morally relevant by college stu- 
dents and the dimensions obtained in the 
cluster analysis of SAI-A correspond to many 
of the rubrics of conventional morality. Cer- 
tainly, the cluster dimensions include the 
types of behaviors tapped by other measures 
of this sort in addition to those which reflect 
the special concerns of college students. Thus, 
in terms of face validity, the SAT does provide 
a measure of individual differences in the 
domain it purports to assess. 


Sex Differences in SAI Ratings 


Previous research (e.g., Bronfenbrenner, 
1961; Rettig & Pasamanick, 1959) indicates 
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that females show stronger evaluative atti- 
tudes than males. The expectation that this 
sex difference would obtain on the SAI was 
supported for all forms of the test. 

The tendency for females to be more severe 
in evaluative ratings than males is reflected 
by their generally higher mean ratings on 
most items of the inventory although only 
one-third of these differences reach statistical 
significance. Those items which do signifi- 
cantly discriminate between the sexes are fre- 
quently of a sexual nature or are at least 
concerned in some fashion with male-female 
relationships. On Form A of the SAI females 
have significantly higher ratings on 33 of the 
89 nonrepeated items, 19 of which deal with 
male-female relationships. Females, for ex- 
ample, consider it more wrong for John to 
marry a girl for her parents’ money, to have 
intercourse with a prostitute prior to his 
wedding, to neck with his sister, to buy 
clothes for himself while depriving his wife, 


or to date a girl specifically with the inten- 


tion of having sexual relations with her. 
Females also consider certain nonsexual be 
haviors to be more wrong than do males. 
Included in these are items dealing with not 
donating money to charity, cheating on a0 
examination, use of physical aggression, and 
attempting to evade the draft. y 

Although no differences in the opposite di- 
rection were statistically significant, males 
gave higher ratings of wrongness to items 
dealing with lying to parents, stealing office 
supplies, arguing with mother, and habitu- 
ally smoking despite the potential danger t0 
health. 

Table 4 presents mean total scores for 
males and females on the three SAI forms, 
While females in all samples score signifi- 
cantly higher on total score for all forms of 
the test, the mean differences across items are 
sufficiently consistent to result in high agte 
ment about the relative wrongness of items. 
For SAI-A the correlation between male am 
female ratings for the 89 nonrepeated items 
is .93. For SAI-B the two sets of rating 
correlate .91, and for SAI-J the correlation 
is .94. 
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TABLE 4 


Comparison oF SAI TOTAL Scores 
or MALES AND FEMALES 
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TABLE 6 


Comparison or SAI-A Scores IN TERMS OF RELIGIOUS 
BELIEFS AND OBSERVANCE 


Males Females 
Form £ 
N R SD N bi SD. 

SAI-A| 72 | 422.08 | 78.67 71 | 465.06 | 58.94 | 3.69** 
SAI-B| 88 | 395.05 | 91.88 | 121 | 435.44 | 92.87 | 3.12** 
SAI-J | 100 | 119.26 | 26.96 | 100 | 128.00 | 28.60 | 2.10* 

*p <.05. 

"b <01. 


Religious Differences in SAI Ratings 


While the strength of moral (evaluative) 
attitudes should not necessarily differ as a 
function of specific religious affiliations, Ss 
who are strongly religious (regardless of 
denomination) should have higher SAI scores 
than those who do not believe or participate 
in religious matters (cf. Scott, 1965). A com- 
parison of Catholic, Protestant, and Jewish Ss 
yielded no significant differences in total score 
for any of the SAI forms (all Fs < 1.00). 
However, with all forms, Ss who claimed any 
religious affiliation obtained higher SAI scores 
than those who had no religious affiliation. 
Table 5 presents the mean SAI scores for 
religious and nonreligious affiliated Ss. The 
telatively small number of Ss in the non- 
religious group for SAI-A precluded finding 
a significant difference, but the data are 
clearly indicative of the trend demonstrated 
ìn each of the other analyses. 

The 59 Ss used in the above analysis of 
religious differences on SAI-A also responded 
to a questionnaire dealing with the nature 
and strength of belief systems pertaining to 


TABLE 5 


Comparison or SAI Scores: ComBinep RELIGIOUS 
AFFILIATIONS AND NONAFFILIATED SUBJECTS 


Combined religious 


groups No stated religion 


SAT 


N| & 


A | 50 | 463.10 | 60.83 | 9 | 409.88 | 80.51 | 1.79* 
191 | 431.44 | 83.79 | 64 | 371.94 | 106.10 | 4.06** 
69 | 121.51 | 28.74 | 20 | 101.75 | 28.03 | 2.70** 


SD|N| X SD 


Religious belief and SAEN 
observance 
Ne) Re OSD ee 
Church attendance 
High 18 | 479.94 | 48.00], .,, 
Low 41 | 444,02 | 70.95 | 2-22 
Strength of belief in God 
High 43 | 472.51 | 50.08 | > cu 
Low 16 | 407.88 | 82.41 | 285 
Frequency of thoughts of 
God 
High 38 | 477.18 | 47.95 
Low 21 | 418.50 | 77.36 | 9:02" 
Thoughts of God influence 
Ss behavior 
High 27 | 478.88 | 48.18 | gyu 
Low 32 | 437.84 | 75,22 | “ 
* 
#3 S21 


religion. The four questionnaire items which 
tapped strength of religious belief and observ- 
ance are as follows: (a) I attend church: 
(every week, regularly, only on special occa- 
sions, never); (b) I believe that God: (defi- 
nitely exists, probably exists, probaby does not 
exist, definitely does not exist); (c) Thoughts 
of God enter my mind: (very often, fre- 
quently, occasionally, never); (d) Thoughts 
of God influence my behavior: (very often, 
frequently, occasionally, never). : 

Since relatively few Ss gave extreme an- 
swers to any of these items, the data from 
the first two and second two response cate- 
gories for each item were combined for analy- 
sis. Table 6 presents the mean SAI-A scores 
for Ss falling into these response categories. 

For each of the four items which tap 
degree of religious belief and observance, 
those Ss who indicated stronger religiosity 
obtained significantly higher SAI-A scores 
than those who were low on religiosity. 

In conjunction with the finding that non- 
religious affiliated Ss score lower on SAT than 
those who do state a religious affiliation, it 
would appear that the strength of religious 
beliefs is definitely related to the severity of 
moral evaluation. Thus, the SAI clearly taps 
a dimension of subjective evaluation which 
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relates strongly to socially valued belief 
systems. 


Parental Identification and SAI Ratings 


Following most other writers in the field 
(eg., Sears, 1960), it is assumed that the 
development of conscience is accomplished 
through the process of identification. Since it 
is generally believed that the parents of 
typical college students represent and transmit 
to their children the conventional values of 
their society, it follows that strength of pa- 
rental identification should be related to the 
strength of conscience. To the extent that it 
taps conventional moral (evaluative) atti- 
tudes, the SAI should provide an indication 
of the degree to which Ss have adopted pa- 
rental values. The results of three studies 
with the SAI bear on its validity in terms 
of this hypothesis. 

In evaluating these results, it must be kept 
in mind that each of the measures of identi- 
fication is crude and open to methodological 
criticism. Consequently, it is the convergence 
of the results of these studies that provides 
evidence of the validity of the SAT. 

The Ss who took SAI-A were asked 
to estimate the number of items on which 
the judgments of their parents would agree 
with their own. The estimates of parental 
agreement was taken as a rough indication of 
the strength of parental identification. Table 7 
presents the mean SAI-A scores of Ss who 
estimated high agreement with parents and 
those who indicate low agreement with parents 
(75% or more vs. 50% or less of the items). 

The results are in accord with theoretical 
expectations. For both sexes, those Ss who 
estimate high agreement with parents obtain 
higher SAI scores than those who do not. The 
difference for males is small and insignificant 
but the effect is a powerful one for females 
(t = 4.95, p < 001). 

In a second study Ss were asked to re- 
spond to each item on SAI-J in terms of their 
own judgment, and as they anticipated that 
their parents would respond. A measure of 
assumed similarity to each parent based on 
profile congruence (Bronfenbrenner, 1958) 
was derived by summing the absolute differ- 
ences between each S’s self- and father ratings 
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TABLE 7 


Comparison or SAI-A Scores or Groups Du 
IN STRENGTH OF PARENTAL IDENTIFCATION 
CENTAGE OF AGREEMENT) 


Identifi- Males Females 
cation 
with 
both 
parents Ni z SD. N; 
High | 18 | 438.89 | 58.90 | 18 | 509.05 
Low 15 | 422.07 | 88.82 | 18 | 439.50 


and self- and mother ratings across 
items. The Ss were classified as being 
high or low on identification with each pat 
by dichotomizing the distribution of ass 
similarity scores for each parent sē] 
for each sex, High difference scores in 
weak identification while low difference 
indicate strong identification. Table 8 pr 
mean scores on SAI-J for groups of Ss ¥ 
show strong identification with both pare 
and for those who show weak indentificat 
with both parents. 

As in the previous study, higher SAI 
are associated with strong identification 
both parents. Comparisons of SAI scores) 
Ss who are not identified with either 
and those who are identified with both p 
again yield a significant difference only f 
females (for males, £ = 1.19, ns; for fem 

= 2.70, p < .02). While the data for mi 
are in the expected direction, the differe 
was not sufficient to achieve statistical 
nificance. It should be noted that the res 
of the two studies are remarkably simi 
even though two different estimates 


TABLE 8 


Comparison or SAI-J Scores or Groups DIFEBE 
IN STRENGTH OF PARENTAL IpENTIFICAT 
(ASSUMED SIMILARITY) 


Identifi- 
cation 
with 


both a 
parents N x SD N z 


Males Females 


High |19 
Low 15 


138.45 | 29.14 | 16 
126.13 | 28.97 
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strength of identification and two different 
forms of the SAI were used. 

In a third study Distler (1965) derived 
parental identification scores from a number 
of questionnaire items dealing with cognitive, 
affective, and behavioral similarity to parents. 
Distler was concerned with comparisons be- 
tween Ss who were primarily mother identi- 
fied and those who were primarily father 
identified. The Ss who were not identified 
with either parent or who were equally iden- 
tified with both parents were excluded from 
his analyses. For both sexes, higher SAI-J 
scores were found for mother identified than 
for father identified Ss. The ordering of SAT 
scores in terms of parental identification was 
females-mother identified (142.04), females- 
father identified (130.68), males-mother iden- 
tified (126.30), males-father identified 
(119.86). For females only, the differences 
between mother and father identified Ss was 
significant at the .05 level (¢ = 2.09). 

It is clear that SAI scores are related to 
measures of parental identification in a man- 
ner consistent with expectations. High SAI 
scores are related to strong parental identifi- 
cation for Ss of both sexes and, consistent 
with the notion that females have more severe 
evaluative standards, higher SAT scores are 
associated more with maternal than with 
paternal identification. 


Social Desirability Response Set 


SAI total score has been correlated with a 
number of measures of social desirability 
tesponse set. For a sample of 100 male and 
100 female Ss, the correlations of SAI-J with 
the Edwards Social Desirability scale (Ed- 
wards, 1957) were, respectively, .06 and .15. 
For these same Ss the correlations of SAI-J 
and the Marlowe-Crowne measure of social 
desirability (Crowne & Marlowe, 1960) were 
34 for males and .17 for females. Only the 
Correlation of SAT-J and the Marlowe-Crowne 
Scale for males is statistically significant ($ 
< .001). Form B of the SAT was adminis- 
tered to a sample of 40 male and female 
College students who also took the California 
Psychological Inventory (CPI; Gough, 1957). 

correlation of SAI-B with the Com- 
Munality scale (Cm) of the CPI was .04 for 
the combined sample. The correlation of 
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SAI-B with the CPI Good Impression (Gi) 
scale was .03, These scales are described as 
measures of “the degree to which an indi- 
vidual’s reactions and responses correspond 
to the modal (‘common’) pattern established 
for the inventory” (Cm) and the degree to 
which a person is “capable of creating a 
favorable impression . . .” and is “concerned 
about how others react to them” (Gi) (Gough, 
1957, pp. 12-13). 

With the exception of the one significant 
correlation, for males, between SAI-J and the 
Marlowe-Crowne scale, the results point to the 
conclusion that SAT scores are not related to 
any general tendency to respond to personal- 
ity tests in a socially desirable or typical 
fashion, Even the one significant correlation 
coefficient is quite moderate and considerably 
lower than those reported for most other per- 
sonality inventories. It is possible, of course, 
that Ss could, if motivated to do so, “fake 
good,” but in normal research settings this 
does not appear to affect test scores to an 
appreciable degree. 


Discussion 


The SAI provides a very stable measure of 
evaluative attitudes. While few studies using 
other measures of moral attitudes report data 
bearing on either internal consistency or sta- 
bility of scores, those which do also report 
high coefficients. For example, Lincoln and 
Shields (1931), using the Shields Moral 
Judgments Examination report odd-even re- 
liabilities in four samples of children and 
young adults of .62, .81, .83, and .93, Fried- 
enberg and Havighurst (1948) report internal 
consistency reliabilities in the .90s for young 
adolescents, and Rettig and Pasamanick 
(1959, 1960, 1961, 1962), using both the 
Crissman (1942) inventory and a measure of 
their own, report Kuder-Richardson reliabili- 
ties of .93, .95, .96, and .91 for samples of 
college students, college alumni, blue collar 
workers and white collar workers. Woodrow 
(1926) reports a 3-week test-retest correlation 
of .79. One year test-retest correlations of 
50, .65, .66, and .68 are also reported by 
Friedenberg and Havighurst (1948). Although 
reliability data are not available for such 
techniques as story completion, interviews, 
and a variety of paper-and-pencil and behav- 
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ioral tests, it still appears that moral attitudes, 
even in Ss younger than those in the present 
study, are extremely stable and consistent. 

Though the items of the SAT are unusually 
complex for a psychological test, the relia- 
bilities are in general somewhat higher than 
those of other instruments in this area. It 
does not appear necessary to reduce item con- 
tent to simple abstractions removed from 
more realistic behavioral contexts to obtain 
stable judgments. It is possible that the great 
inter-S variability of ratings for almost all 
SAT items is attributable to this complexity. 
Seventy of the 89 unrepeated items in Form 
A have a 7-point score range, and 89 of the 92 
unrepeated items in Form B also have this 
great a range. This considerably enhances 
the utility of the test for studies concerned 
with individual differences in evaluative atti- 
tudes. 

Another conclusion to be drawn from the 
data is that Ss respond to the SAI in large 
part with a generalized evaluative set. De- 
spite the variety of situations included, a 
number of results support this argument. The 
high split-half reliabilities indicate that a 
single dimension accounts for a great propor- 
tion of the test variance. Further, for all 
forms of the test all items correlate positively 
with total score. The high interform correla- 
tion, likewise, indicates a general evaluative 
set relatively independent of content as does 
the large general cluster obtained in the 
cluster analysis of SAI-A. Thus, the interpre- 
tation of SAI total score as a measure of the 
“overall strength of evaluative attitudes inde- 
pendent of the limitations of specific content” 
seems justified. 

This discussion is not meant to imply that 
the general evaluative set is the only determi- 
nant of response to the items. The cluster 
analysis also yielded clusters of items which 
were defined by common content character- 
istics, While total score represents a good 
estimate of overall strength of evaluative at- 
titudes, Ss do discriminate among items on 
the basis of content specific judgments. The 
content clusters are not, however, independent 
either of each other or of total score. 

The objective in the development of the 
SAT was to provide an instrument suitable for 
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the systematic study of evaluative attitudes as 
characteristics of individuals and of social 
groups. As has been indicated, it seems to 
fulfill this function both from a psychometric 
and psychological standpoint. It has been 
shown that SAI total score is systematically 
and predictably related to other relevant 
variables. Differences in SAI scores as a func- 
tion of sex, religiosity, and strength of pa- 
rental identification all support the notion that 
the SAI measures individual differences re- 
lated to the attitudinal domain of conscience 
or superego, These results in conjunction with | 
the high reliabilities lead to the conclusion 
that SAI measures a generalized evaluative 
set which may be thought of as a consistent 
personality trait. 

To summarize, the SAI is an extremely 
stable and internally consistent measure of 
evaluative attitudes. It is largely unaffected by 
social desirability response set but does relate 
to sex, religiosity, and strength of parental 
identification. The great inter-S variability of 
the SAI in terms of items, clusters, and total 
score makes it suitable for the study of indi- 
vidual differences related to conscience as well 
as for the normative study of contemporary 
evaluative attitudes. The SAT is also suitable 
as a dependent variable in studies of attitude 
change, effects of child-rearing practices, pat- 
terns of identification, and of reference group 
affiliation. 
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EXTREME RESPONSE TENDENCY AS A FUNCTION 


OF EMOTIONAL ADJUSTMENT AND 
STIMULUS AMBIGUITY 


RUSSELL P. NORMAN? 
College of William and Mary 


This study attempted to explore relationships between the tendency to check 
extreme categories on the semantic differential, emotional adjustment, and 
ambiguity of stimuli to be rated. One hundred and fifteen general psychology 
students were administered the Welsh A and R scales of the MMPI and asked 
to rate nine stimuli on 21 Semantic Differential scales. Stimuli were chosen 
to reflect three levels of ambiguity (verbal concepts, TAT pictures, and Ror- 
schach inkblots). Three groups of 10 Ss each were formed representing high 
and low combinations of the A and R scales. Results suggested that anxiety 
(high A, low R) is related to extreme response tendencies and repression (low 
A, high R) is related to use of the neutral category. “Adjusted” Ss (low 4, 
low R) tend to make more use of the intermediate categories. The relative 
tendency of high-A Ss to make extreme responses was most pronounced when 
stimuli were most ambiguous, Discussion focused on the relationships between 
scale-checking behavior, “psychopathology,” and meaningfulness of material 


to be rated. 


Considerable attention has been paid re- 
cently to the study of response sets or styles 
and the influence these might have on the 
validity of various psychological test instru- 
ments. As a natural extension of these in- 
terests, investigators have come to view the 
problem of response sets in terms of general 
personality theory. What was formerly 
thought of as basically “nuisance variance” in 
a test instrument is now viewed more in terms 
of motivational variables, the most frequently 
discussed being social desirability (Edwards, 
1957), need for social approval (Crowne & 
Marlowe, 1960), and acquiescence (Couch & 
Keniston, 1960). 

Most studies which have attempted to view 
response sets in the broader context of gen- 
eral personality theory have stressed the use 
of standard ‘objective personality inventories, 
particularly the MMPI. The true-false or 
agree-disagree format quite naturally lends 
itself to ready demonstration of and con- 
trolled experimentation with a variety of re- 
sponse styles, It is possible, however, that this 
approach to the study of personality could 
make profitable use of additional test instru- 
ments, both standardized and experimental, 


1 Requests for reprints should be sent to Russell 
P. Norman, Department of Psychology, College of 
William and Mary, Williamsburg, Virginia 23185. 
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and that broader and more comprehensive 
conclusions about the nature and meaning of 
response sets could be developed. 

In their now famous monograph, Osgood, 
Suci, and Tannenbaum (1957) discuss the 
issue of scale-checking styles on the Semantic 
Differential. The authors cite a number of 
interesting studies suggesting that various 
scale-checking styles can differentiate normals 
and schizophrenics (Bopp, 1955) and high- 
and low-IQ high school students (Kerrick, 
1954). They also discuss some possible 
relationships between generalized anxiety, 1- 
telligence, and scale-checking style. 

In a recent publication, Zax, Gardiner, and 
Lowry (1964) discovered that “maladjusted 
Ss tended to use extreme categories more than 
intermediate categories when rating projet- 
tive material (Rorschach inkblots and Blacky 
pictures) on the Semantic Differential, while 
“adjusted” Ss used more intermediate a” 
fewer extreme categories. They argued thal 
the immature or poorly adjusted individua 
was more likely to organize his cognitive fie! 
in terms or extreme categories, and ample 
theoretical basis for such a formulation ws 
derived from such diverse areas as behavio i 
ism (Spence, 1958), psychoanalysis (Fenichel, 
1945), and Gestalt psychology (Werne, 
1948). The authors suggested that not © y 


E 


EXTREME RESPONSE TENDENCY 


psychopathology but probably also develop- 
mental level influences this tendency to en- 
dorse extreme rather than intermediate cate- 
gories, and that research in this area could 
add relevant contributions to a growing body 
of information on the relationship between 
response style and personality variables. 

Tt occurred to the present writer that ad- 
ditional variables were probably quite rele- 
vant to the issue of extreme response ten- 
dency, those first coming to mind being the 
nature of the emotional maladjustment and 
the type of stimulus being rated on the Se- 
mantic Differential. Zax et al. (1964) used the 
Rorschach (a relatively ambiguous stimulus) 
and the Blacky (a less ambiguous stimulus). 
Since the results with the Blacky did not 
confirm their hypothesis and the Blacky was 
used only with the children’s groups, one 
might be as easily drawn to a level of am- 
biguity interpretation as a developmental one. 

By the same token if one equates emotional 
maladjustment with the presence of unde- 
fended anxiety, then response extremity on 
the Semantic Differential might be an under- 
standable occurrence. There is a considerable 
body of evidence which attests to the fact 
that anxiety narrows the cognitive field and 
tigidifies behavior (see a recent review of 
literature by Korchin, 1964). But what of the 
maladjusted individual without significant 
overt or manifest anxiety? Would such a re- 
lationship hold for the overcontrolled and re- 
Pressed person as well? Since these two dimen- 
sions of maladjustment have consistently 
emerged in factor-analytic studies (Adams, 
1964; Welsh, 1956), it would seem meaning- 
We study response extremity in relation to 
each, 

The writer? attempted to follow up the 
findings of Zax et al. by using a more precise 
Measure of maladjustment (Welsh A and R 
scales of the MMPI) and having Ss rate four 
verbal concepts (College Professor, President 
Johnson, Negro, Myself) rather than the more 
ambiguous Rorschach stimuli. When high-A— 
low-R and high-R-low-A Ss were compared 
With Ss low on both A and R, no significant 

?R. P. Norman, Emotional adjustment and re- 


pone extremity: The rating of verbal concepts. 
Upublished manuscript, 1965. 
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stylistic differences of any kind could be de- 
tected on the Semantic Differential. It is pos- 
sible that the A and R scales were not ade- 
quately measuring emotional maladjustment, 
or more likely, that the extreme response 
tendency might not appear in maladjusted Ss 
when more structured verbal concepts are 
being rated. 

In deference to the second possibility, the 
present experiment was devised to study the 
relationships between emotional adjustment, 
stimulus ambiguity, and response style on the 
Semantic Differential. It was hypothesized 
that in all Ss there would be a tendency to 
make greater use of the extreme scale posi- 
tions as the stimulus material increased in 
ambiguity, and that this tendency would be 
significantly more pronounced in the malad- 
justed Ss. No specific predictions were made 
concerning the anxiety-repression dimension, 
but as a working hypothesis, it was assumed 
to be more likely that Ss high on R would fall 
between Ss high on A and “adjusted” Ss in 
terms of frequency of extreme responses. 


METHOD 


The Ss were 115 general psychology students at 
the College of William and Mary. The 79 items of 
the Welsh A and R scales were randomly ordered 
and administered to all Ss in a single group session. 
The questionnaires were scored and three groups 
were formed based on high and low scores on the 
two scales. The three groups and their A and R 
means were as follows: High A (24.3), Low R 
(12.2); Low A (7.5), High R (21.1); and Low A 
(78), Low R (13.2). A fourth possible group (High 
A, High R) was excluded since only three indi- 
viduals would have qualified for inclusion, Total V 
was 30, with 10 in each group. 

In a second group session, Ss were asked to rate 
nine stimuli on 21 Semantic Differential scales, which 
were the same as those used by Zax et al? The stim- 
uli represented three groupings along the dimension 
of stimulus ambiguity. Three verbally stated concepts 
(College Professor, President Johnson, Negro) were 
assumed to represent the least degree of stimulus 
ambiguity, Three pictures from TAT cards (3BM, 
8GF, and 17BM) were chosen to represent mod- 
erately ambiguous stimuli, and three Rorschach 


3 Tense-Relaxed; Ugly-Beautiful; Hot-Cold; Cau- 
tious-Reckless; Large-Small; Wise-Foolish; Happy- 


Sad; Rounded-Angular; Brave-Cowardly; Sharp- 
Dull; Good-Bad; Fair-Unfair; Hard-Soft; Rough- 
Smooth; Kind-Cruel; Clean-Dirty; Fast-Slow; 


Strong-Weak; ‘Active-Passive; Masculine-Feminine; 
Heavy-Light. 
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cards (I, V, VII) were selected as highly ambiguous 
stimuli. The nine stimuli were presented to the en- 
tire group in balanced sequence (ABCABCABC) by 
means of an opaque projector. No special criteria 
were used to select the stimuli within each am- 
biguity grouping. It was felt, however, that the 
verbal concepts should be salient to college students, 
and that the TAT stimulus material should be clear 
enough in detail to be noticeably less ambiguous than 
the Rorschach cards. 

The mimeographed scales of the Semantic Differ- 
ential were stapled together in such a way that three 
random orderings would occur in equal numbers for 
each stimulus, and no S would be exposed to the 
same ordering for consecutive stimuli. Adjective 
position was also varied so that no one adjective 
appeared on the same side of the scale on all three 
orderings. Semantic Differential scoring consisted of 
a simple frequency count of the use of the three pri- 
mary positions for each S; extreme responses (two 
scale positions), neutral responses (one scale posi- 
tion), and intermediate responses (four scale posi- 
tions) . 


RESULTS 


The mean frequencies for extreme, neutral, 
and intermediate responses in these three 
groups are shown in Tables 1-3. A two-way 
analysis of variance was carried out for each 
response category. 


Extreme Responses 


The three groups are significantly differen- 
tiated (F = 3.75, df = 2/81, p< .05) with 
the “adjusted” Ss making extreme responses 
less frequently in each stimulus condition. In 
addition, the anxious Ss make extreme re- 
sponses most frequently, and consistent with 
the working hypothesis, the “repressed” Ss 
are intermediate between the other two groups. 
In terms of the stimulus dimension, the TAT 
seems to elicit more extreme responses than do 
the verbal concepts or the Rorschach. This is 
true to some extent in each experimental 


TABLE 1 


MEAN NUMBER OF EXTREME RESPONSES IN THREE 
EXPERIMENTAL GROUPS FOR THREE 
DIFFERENT STIMULI 
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TABLE 2 


MEAN NUMBER OF NEUTRAL RESPONSES IN Ture 
EXPERIMENTAL GROUPS FOR THREE 
DIFFERENT STIMULI 


Ror- 


Group Verbal | TAT at M 
High A, Low R| 18.2 14.3 19.3 17.3 
High R, Low A | 20.8 19.4 27.6 22.6 
Low A, Low R | 13.4 11.5 211 15.3 
M 17.5 15.1 22.7 


group, although the F ratio (F = 3.05, df= 
2/81) barely fails to attain significance at the 
.05 level. Simple analyses of variance (Stim- 
uli X Groups) were run, and only the Ror- 
schach significantly differentiated the three 
groups (F = 3.35, df = 2/27, p = .05). With 
respect to extreme responses, then, the am- 
biguity hypothesis receives some support since 
the most ambiguous stimuli are the only ones 
to differentiate significantly between the 
groups. The Rorschach findings are also con- 
sistent with those of Zax et al., although a 
somewhat different picture emerges when the 
stimulus dimension is changed in the direction 
of lessened ambiguity. 


Neutral Responses 


Analysis of neutral responses reveals that 
both groups (F = 3.84, df = 2/81) and stim- 
uli (F = 4.10, df = 2/81) are differentiated 
at the .05 level, Apparently, the ambiguity of 
the Rorschach blots selected for this study 
resulted in more neutral responses than either 
of the other two stimulus categories. Regard- 
less of condition, however, the repressed 5s 
demonstrate a greater tendency to make neu- 
tral responses. On the verbal concepts and the 


TABLE 3 


MEAN NUMBER or INTERMEDIATE Responses IN THREE 
EXPERIMENTAL GROUPS FOR THREE 
DIFFERENT STIMULI 


pa 

~ Ror- M 
Group Verbal | TAT M Group Verbal | TAT schach 

ae ee bere e poet foes 

High A, Low R| 8.0 12.7 10.5 High A, Low R| 37.8 | 36.0 32.9 oR 

High R, Low A| 9.1 | 11.3 8.7  HighR,LowA| 341 | 323 29.6 | 32) 5 

Low A, Low R | 5.2 8.5 5.5  LowA,LowR | 44.5 | 43.0 38.0 | 4) 

M 74 | 108 M 38.8 | 37.1 33.5 
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4 TAT, the anxious Ss are intermediate between 
the other two groups, although this does not 
occur on the Rorschach. 
Me 
Intermediate Responses 
“Yntermediate responses significantly dis- 
tinguish between the three groups beyond the 
(Ol level (F = 7.02, df = 2/81). Regardless 
of stimuli, the “adjusted” group makes a 
" greater use of this type of response. There is 
a slight and statistically insignificant ten- 
dency (F = 2.07, df = 2/81) for intermediate 
| responses to decrease with increasing am- 
‘Diguity of the stimulus. 


e DISCUSSION 


The results of the present study seem to 
“suggest that anxiety is related to extreme re- 
sponse tendencies, repression to neutrality of 
response, and that “normal” Ss seem to avoid 
both of these tendencies and stress the use of 
intermediate categories. The data also suggest 
that the nature of the stimulus is related to 
hese response tendencies but not necessarily 
7 in terms of ambiguity. Verbal stimuli may be 
less ambiguous than projective test material, 
Dut it is obvious that they differ in other ways 
as well. If the verbal concepts are more sali- 
ent, it may well negate the relevance of cer- 
fain Semantic Differential scales, thus forcing 
More responses into the neutral category. For 
similar reasons, scales that are more relevant 
{0 4 particular concept may result in a greater 
Use of extreme categories. 
Of the three stimulus conditions, the TAT 
elicited more extreme responses, while the 
Rorschach was associated with more neutral 
Tesponses. It could be that the human content 
‘depicted by the TAT pictures was personally 
More meaningful than the relatively vague 
and amorphous inkblots. If this were true, it 
Would lend support to the theory which states 
Tesponse extremity can be viewed not 
“only as a function of psychopathology but 
"also as a function of the meaningfulness of the 
Material to be rated and the rating categories 
_ themselves (see a recent review of the litera- 
tute on response extremity by O’Donovan, 
1965). The fact that the Rorschach elicited 
More neutral responses might be a result of 
‘the particular blots selected, but could also be 
| ‘plained by the fact that less ambiguous 
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stimuli were present in the same rating task. 
In other words, after viewing and rating 
material that was more salient and/or more 
meaningful, it is possible that Ss perceived 
many of the 21 scales as irrelevant to the 
Rorschach blots. In fact, several Ss placed 
nearly all of their Rorschach ratings in the 
neutral category. It is clear that in future 
research in this area, it will be necessary to 
define stimulus categories more precisely and 
take into account the dimension of meaning- 
fulness in terms of the stimuli and rating 
scales. 

In discussing the findings in relation to the 
Welsh A and R scales, certain issues must be 
resolved in this regard as well. The writer 
does not feel it is relevant to use the term 
“psychopathology” when studying an essen- 
tially normal population. Since the A and R 
scales have been developed from self-descrip- 
tive statements, one may infer that high-A 
sensitizers describe themselves in somewhat 
deviant, socially objectionable terms, while 
high-R repressors describe themselves in a 
conventionally acceptable manner. This means 
that the status of these individuals in terms 
of psychopathology must be assessed by means 
of additional criteria. Clinicians working with 
college students are familiar with those indi- 
viduals who show very deviant MMPI pro- 
files, but who fail behaviorally to show a 
degree of disturbance commensurate with their 
self-descriptions. Technically, these might be 
called “false positives,” but their existence 
demonstrates the futility of attempting to 
determine degree of psychopathology purely 
in terms of the symptoms a person is willing 
to admit or deny. 

Other evidence reported by Ourth (1963; 
cited in O'Donovan 1965) suggests that indi- 
viduals who use extreme rating categories are 
more capable of showing improvement in psy- ` 
chotherapy than those who tend toward the 
use of neutral categories. It is generally as- 
sumed that the best candidates for psycho- 
therapy are subjectively uncomfortable (admit 
symptoms), personally sensitive to a wide 
range of stimuli, and willing to view them- 
selves and their difficulties from an internal 
frame of reference. These might well be the 
same individuals who have been referred to as 
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“sensitizers.” Conversely, those who are per- 
haps least likely to profit from psychotherapy 
tend to deny subjective discomfort, are per- 
sonally sensitive to only a narrow range of 
stimuli, and tend to view their problems from 
an external frame of reference, a description 
which closely resembles that of the “repres- 
sor” category. 

The results of this study suggest that 
stimulus ambiguity might tend to maximize 
the basic tendencies of these two groups. One 
might expect the sensitizers to see the stim- 
uli as more meaningful and give them subjec- 
tive interpretations, while the repressors will 
tend to move in the opposite direction and 
see the stimuli as less meaningful or irrele- 
vant. This, of course, is related to the ration- 
ale of most projective tests, and would be 
consistent with the observation that anxious 
individuals give more varied and dynamically 
significant responses, while repressed indi- 
viduals are typically unproductive and de- 
fensive. 

Thus, the writer does not find it difficult to 
reconcile the psychopathology and meaning- 
fulness hypotheses at least in relation to a 
nonpsychiatric population. It must also be 
remembered that group averages tend to ob- 
scure individual data and exaggerate the dif- 
ferences that do exist. All Ss in the present 
study were drawn from a relatively homo- 
geneous population in terms of age, ability, 
socioeconomic background, and psychiatric 
status. Correlational analysis of these data for 
all Ss would no doubt reveal positive but uni- 
formly low coefficients. One is forced to agree 
with O’Donovan that future research in this 
area should focus on the interaction between 
stimulus and subject characteristics. In addi- 
tion, the writer feels that it will be manditory 
to move beyond such overly simplified con- 
cepts as psychopathology, as well as exclusive 
reliance on paper-and-pencil instruments. 


RUSSELL P. NORMAN 
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PERSONALITY FACTORS RELATED TO RESPONSE 


TO SOCIAL ISOLATION AND CONFINEMENT* 


DALMAS A. TAYLOR, IRWIN ALTMAN, LADD WHEELER ê 
AnD ESTELLE N. KUSHNER 


Naval Medical Research Institute, Bethesda, Maryland 


The present study examines (a) personality factors relating to or predictive 
of endurance and adjustment to social isolation, and (b) personality and/or 
perceptual changes occurring under varying conditions of social isolation. Pairs 
of men were confined for 8 days in small rooms without recreational materials 
of any kind and a minimal work schedule. Multiple conditions of isolation were 
effected by a factorial combination of mission-length expectation, stimulation, 
and privacy. Eight Holtzman Inkblot Technique variables showed significant 
changes from before to after confinement. Most of these changes were among 
variables included in Factors I and II (Holtzman et al., 1961). Movement, 
Color, Penetration, and Barrier changes were associated with varying conditions 
of confinement. Interpretation of Factor I changes was confounded by corre- 
sponding changes in verbal productivity. Other instruments used provided only 
marginal findings as regards prediction or adjustment. A post hoc analysis of 
disruptive groups yielded a syndrome of adjustment possibly related to extreme 
or severe conditions of social isolation. Suggestions for future research leading 
to selection and training were discussed. 


with psychological adjustment to isolated en- 
vironments (Cole, Machir, Altman, Hay- 
thorn, & Wagner, 1967; Fulkerson, Freud, & 
Raynor, 1958; Sells, 1961; Wright, Chylinski, 
Sisler, & Quarrington, 1967; Wright, Sisler, 
& Chylinski, 1963) have achieved only lim- 
ited success in discriminating between well- 
adjusted and poorly adjusted individuals, 

In a review of research, Myers (1969) 
concluded that personality patterns relating 
to endurance of individuals in prolonged 
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Investigations of personality factors associ- 
ated with endurance in prolonged sensory and 
social deprivation (Hull & Zubek, 1962; 
Myers, 1969; Wright & Zubek, 1966) and 


sensory deprivation (SD) situations can be 
distinguished only through “combinational 
equations predictive of endurance or rough di- 
rectional trends from many massive N experi- 
ments [p. 24].” Myers also stated that un- 
successful Ss are higher on the Psychopathic 
Deviate (Pd) and Hypomania (Ma) scales 
of the MMPI, and on the n-Aggression scale 
of the EPPS. Successful Ss, on the other 
hand, are higher on the Consistency, n-Defer- 
ence, and n-Affiliation scales of the EPPS, 
and higher on the Social Responsibility (Re) 
and Ego Strength (Es) scales of the MMPI. 
Wright et al. (1967) investigated adjustment 
in isolated, semiisolated, and nonisolated en- 
vironments and found poor adjustment asso- 
ciated with high Pd and Ma scores, and low 
Es scores primarily in semiisolated settings. 
Additionally, poorly adjusted groups in the 
isolated and semiisolated settings were lower 
on n-Exhibitionism and Heterosexuality and 
higher on n-Intraception on the EPPS than 
well-adjusted groups. 

Projective inkblot tests have also been used 
with meager results. Wright and Zubek 
(1966) used the Rorschach and the Buhler- 
Lefever Sign List in a multiple discriminant 
function analysis of successful and nonsuc- 
cessful perceptually deprived Ss, but failed to 


411 


412 


replicate an earlier study which distinguished 
successful and nonsuccessful Ss. Cole et al. 
(1967) found perceptual changes, for isolates 
only, as measured by the Holtzman Inkblot 
Technique (HIT), on Human form following 
10 days of social isolation, as well as decre- 
ments in Location, Form Definiteness, and 
Movement and an increase in Color for both 
isolation and control groups. They failed to 
find any relationship between HIT changes 
and personality composition factors related to 
compatibility and incompatibility. Subsequent 
analyses of this data by the present authors, 
combining Color and Movement into an M/C 
ratio, showed incompatible dyads significantly 
decreasing in M/C ratio from preisolation to 
postisolation. Cole et al. (1967) interpreted 
declines in Movement and increases in Color 
as indicative of Ss being “more sensitive to 
external stimuli while utilizing internal re- 
sources less as the stress of experimental 
conditions continued [p. 332].” This inter- 
pretation is also supported by the subsequent 
finding of a decrease in the M/C ratio for 
incompatible groups. Furthermore, postexperi- 
mental questionnaires from the same Ss indi- 
cated that incompatible groups experienced 
higher levels of stress than compatible groups 
(Haythorn, Altman, & Myers, 1966). 

The purposes of the present study were to 
examine further (a) personality factors re- 
lating to or predictive of endurance and ad- 
justment to social isolation and (b) per- 
sonality and/or perceptual changes occurring 
under varying conditions of social isolation. 
Specifically, social isolation conditions were 
manipulated according to (a) privacy, deter- 
mined by whether two men lived in one room 
or in two adjoining rooms with free access be- 
tween the rooms, (b) stimulation, determined 
by whether or not groups had various forms of 
enrichment and contact with the outside 
world, and (c) mission-length expectation, or 
whether groups expected to be isolated for 
relatively short periods (4 days) or relatively 
long periods (20 days). Data reported else- 
where (Taylor, Wheeler, & Altman, 1968) 
indicated that privacy, stimulation, and short 
missions yielded least stress reactions, whereas 
groups expecting long missions, having no out- 
side stimulation and in a privacy condition 
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were most stressed and debilitated. A key 
question posed here is whether personality 
and/or perceptual changes will vary in ac. 
cordance with these differentially stressful 
situations. 


METHOD 


Pairs of men were confined for 8 days in small 
rooms without recreational materials of any kind and 
a minimal work schedule. Diet consisted of liquid 
food, survival crackers, and water. Lights remained 
on at all times to allow continuous monitoring by 
closed-circuit television and ceiling microphones. 

The Ss were 18- to 20-year-old volunteers who had 
just completed boot training at the Great Lakes 
Naval Training Center. Ninety-five percent of 168 of 
the men requested to volunteer on arrival at the 
Naval Medical Research Institute did so.t Pairs were 
selected such that members were not only strangers 
but roughly equated for age, religion, IQ, education, 
and other demographic variables. 

In half of the groups, Ss had separate rooms at 
their disposal (privacy condition) ; the other half 
lived in one room (no-privacy condition). In all 
cases, the rooms were similar in size and were 
furnished identically. 

A stimulation manipulation permitted an examina- 
tion of the impact of loss of contact and separation 
from cultural experiences. The Ss in the no-stimula~ 
tion condition did not hear another human voice. 
Communications were given by tone and buzzer code 
system. By contrast, Ss in the stimulation condition 
received verbal instructions to perform all tasks, were 
asked to report room temperature and food con- 
sumption, and received three 5-minute broadcasts 
per day. 

After the isolation period had begun, half of the 
groups were told that the mission would Jast 4 days, 
and the remaining half were informed that the mis- 
sion would last 20 days. The mission actually lasted 
8 days for both groups. Both groups were told that 
the mission might be extended “due to 
operational requirements.” At the end of 4 days, n0 
announcement was made to the 4-day groups regard- 
ing the extension of their mission. This combination 
of experimental variables yielded a 2X2X2 fac 
torial design. 

The dependent variable measures used 
personality and perceptual factors were 
nesota Multiphasic Personality Inventory (Hathaway 
& McKinley, 1951), the Edwards Personal Preferon 
Schedule (Edwards, 1959), the Holtzman Inkble 
Technique (Holtzman, Thorpe, Swartz, & nee 
1961), and a sociopathy scale (Lykken, 1951). 


to assess 
the Min- 


4The authors owe a special debt of thanks t0 
Orren R. Royal, United States Navy, for 56 all 
as Medical Officer on the project, administering ad 
preexperimental and postexperimental physical a 
psychiatric examinations, and making himself # 
able 24 hours a day for almost a year. 
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TABLE 1 TABLE 2 
Pre-Post MEAN SCORES FOR ISOLATION MEAN NUMBER OF COLOR RESPONSES 
Groups on HIT BEFORE AND AFTER ISOLATION 
Scales Pre Post F Days Isolation Pre Post 
Form Definiteness (FD) 7741 | 71.72 | 9.71% 4 Private 8.36 10.28 
Color (C) 9.23 12.60 | 20.69** No Private 7.44 11.25 
Shading (Si) 3.45 | 5.06 | 11.43* 
Movement (M) 29.10 | 23.38 | 13.41* 20 Private 10.67 17.08" 
Integration (7) 5.33 3.60 |43.16** No Private 10.56 12.61 
Human (H) 22.38 | 15.72 | 48.00** 
Penetration (Pn) 4.58 | 3.42 | 10.56* ® Significantly larger than all other means (p <.01) by 
Location (L) 40.20 | 48.42 | 41.33%" Newman-Keuls test. 
Total Words Used (TWU) | 168.43 | 139.92 29.62* 
: x a tases {lee Human, Integration, Movement, and Form 
#3 S001. Definiteness constituted one factor considered 


MMPI and Lykken scales were administered 1 or 
2 days before confinement and as soon after con- 
finement as possible; the EPPS was given precon- 
finement only. The HIT was group administered 
with Forms A and B counterbalanced from pre- 
confinement to postconfinement.® It was given the 
evening before isolation began and, in all cases, 
within an hour or two following termination of 
the experimental run. 


RESULTS 
Holtzman. Inkblot Technique 


Eight of the HIT scales showed significant 
changes from before to after social isolation 
(see Table 1), Significant decreases occurred 
for Form Definiteness (FD), Movement (M), 
Integration (7), Human (H), and Penetra- 
tion (Pn). Significant increases occurred 
for Location (L), indicating use of smaller 
Portions of the blot, Color (C), and 
Shading (Sk). Additionally, Total Words 
Used (TWU) decreased from preexperimen- 
tal to postexperimental testing. These findings 
replicate and extend those of Cole et al. 
(1967), the only difference being that the 
earlier study did not find pre-post changes 
m I, Pn, and Sh. 

Six of the eight HIT determinants showed 
Pre-post changes are included in two of 
the empirically derived factors identified by 
Gorham (1967) and Holtzman et al. (1961). 


Forms A and B of the HIT were projected on 
ae from 35-mm. color slides produced at the 
aval Medical Research Institute with the kind 
an of the Psychological Corporation, New 
Ee. The, authors seo tke. fp thank Donald R. 
ore | f 
protocols, his computer scoring of the ink! lot 


to “be indicative of well organized, ideational 
activity and good imaginative capacity 
[Holtzman et al., 1961, p. 171].” Postconfine- 
ment decrement occurred on all scales in this 
factor. The other factor included Color and 
Shading, thought to depict oversensitivity to 
the stimulus properties of the inkblots. In the 
present study, Color and Shading increased 
from preisolation to postisolation. Thus, in 
agreement with research on sensory depriva- 
tion (Bexton, Heron, & Scott, 1954), social 
isolation seems to have produced an impov- 
erishment of cognitive-perceptual activity. 
One interest of the present study was to 
examine the possibility that these changes 
might vary according to different conditions 
of social isolation. Specifically, Movement 
and Color changes were expected to relate 
to stress such that simultaneous increases in 
Color and decreases in Movement would be 
greater under the most stressful isolation 
conditions. A significant Mission Length x 
Privacy X Pre-Post interaction occurred for 
Color (F = 4.59, p < .05). Examination of 
means associated with this interaction (see 
Table 2) indicates that all groups showed 
increments in Color from preisolation to post- 
isolation, although the increase was significant 
only for the 20-day privacy group, whose 
posttest Color scores were also significantly 
higher than those of all other groups. It is 
interesting thet. this condition also yielded 
the highest subjective reports of stress and 
anxiety (Taylor et al., 1968). Aborters (i.e., 
Ss who could not complete the 8 days of con- 
finement) in the privacy condition also 
showed significant increases in Color (F = 
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10.29, p < .01) from preisolation to postisola- 
tion and had higher postconfinement means 
for Color than all other groups. 

Further, there were significant Mission 
Length x Stimulation X Privacy X Pre-Post 
interactions for Shading (F = 5.33, p < .05) 
and Movement (F = 5.17, p < .05). Newman- 
Keuls analyses on the means for Movement 
showed significant decrements for the 4-day 
stimulation no-privacy groups only. Means 
for Shading revealed no differential pre-post 
changes. Perhaps short missions affording en- 
richment and interpersonal contacts mini- 
mized perceptual experiences related to Move- 
ment, Similarly, 4-day groups decreased 
significantly on Penetration and Barrier. No 
interpretation of these latter two changes 
is offered. Pre-post decreases on Human 
and Integration were parallel for aborters 
and completers, with no postconfinement 
differences. 

In summary, pre-post increases in Color 
were associated with long mission expecta- 
tions (20 days), privacy, and unsuccessful 
mission completion, whereas postconfinement 
decrements for Movement, Barrier, and Pene- 
tration occurred for various combinations of 
short mission expectations (4 days). The 
finding regarding Color confirms other data 
relating Color change to heightened affect 
(see Cerbus & Nichols, 1963). Lowered 
Barrier and Penetration scores may relate to 
better adjustment under less stressful condi- 
tions of confinement. 

A rather unexpected finding was a decre- 
ment in total verbal output for all groups, 
although they were least for aborter groups 
and for groups under the most stressful con- 
ditions—20-day privacy (see Taylor et al., 
1968). 

There were no preexperimental abort-com- 
pleter differences in HIT determinants, indi- 
cating the lack of overall utility of the test 
in predicting adjustment to isolation. How- 
ever, as will be discussed, there were several 
preconfinement HIT differences for groups 
showing extreme responses to the isolation 
situation. 


Minnesota Multiphasic Personality Inventory 


There were six MMPI scales which pro- 
duced pre-post isolation changes, although the 
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TABLE 3 


Pre-Post MEAN SCORES FoR ISOLATION 
Groups oN MMPI Scares 


Scales Pre Post F 
Depression (2) 53.10 | 49.58 |635 | 
Hysteria (3) 52.10 | 50.48 | 4.98% 
Psychopathic Deviate (4) | 57.30 | 54.43 | 841m 
Hypomania (9) 60,08 | 63.27 | 8.726 
Introversion (0) 50.18 | 48.35 | 7.32% 
Anxiety (MA) 12.05 | 13.21 | 3.10* 


Note.—T scores with K corrections, 

*p <.10, 

** p <.05. 
+p < 01. 
absolute magnitiude of change was small (see 
Table 3). Depression, Hysteria, Psychopathic 
Deviate, and Social Inversion decreased and 
Hypomania and Manifest Anxiety increased 
from preconfinement to postconfinement, Pre- 
sumably, social isolation had the effect of 
abating elevations on some of the clinical 
scales of the MMPI, but did produce elevated 
states of anxiety and hyperactivity. 

Higher order interactions occurred but did 
not yield any systematic relationship to & 
perimental treatments. For example, several 
pre-post changes were associated with the 
stimulation manipulation. However, these 
were difficult to interpret since Newman- 
Keuls analyses revealed that stimulation 
groups had higher preconfinement means fot 
Hysteria, Psychopathic Deviancy, and Psych- 
asthenia and did not differ from the 10 
stimulation groups on postconfinement assess: 
ment. Hence, these differences might be due 
to sampling problems. 

A Privacy X Pre-Post interaction for Psy: 
chopathic Deviancy (F = 3.98, p< 05) an¢ 
a significant Stimulation X Privacy X Pre 
Post interaction for Hysteria (F= 5.85 
p< .05) indicated that no-privacy Ss de 
creased from preisolation to postisolation or 
the Psychopathic Deviate scale, and th 
stimulation-privacy Ss also decreased OA th 
Hysteria scale. However, the preexperimenta 
mean in the latter case was signi 
higher than all other preconfinement mean 
associated with the Stimulation x Priva? 
interaction. fo 

Analyses of changes in ‘MMPI scores i 
aborters and completers in relationship to 
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: TABLE 4 
One-Way ANALYSES OF VARIANCE (KRUSKAL-WALLIS) ror Rrot-ABoRTERS, 
ABORTERS, AND COMPLETERS ON SELECTED PERSONALITY TRAITS 
Pre Post 
i Variable i 
Riot- Com- Riot- Com- 
abort ae plete H abort ai plete H 
M M M M 
MMPI 
F (validity) 4.0. 1,8, 6.0, | 4.30 21.0. 6.0, 3.2» | 6.30** 
Hypochondriasis (Hs) 49.0. 41,2, 49.8, | 4.41 Th 2a: 40.0, 44,5, | 8.09** 
Depression (D) 44.5, | 50.2. 49.5, | 1.24 63.2, | 47.8 50.2, | 4.84* 
Paranoia (Pa) 50.0, 43.2, 52.2, | 1.29 72.5 44.0, 46.2, | 7.48"** 
Psychasthenia (Pt) 54.55 48.55 48.5, | 2.44 65.84 46.5, 50.0, | 7.57*** 
Schizophrenia (Se) 56.5, | 51.8. 54.8, | 1.64 91.0, | 44.0, 48.2, | 7.90*** 
Repression-Sensitization (R-S) 34.25, 25.2a» 17.5» | 4.29 46.50 28.0» | 24.2, | 6.11*** 
Control (Cn) 59.5, | 530s, | 45.2» | 5.76** | 58.0, | 49.20 54.2, | 171 
ty (Re) 42.2, | 46.8.» | 58.5p 5.86** | 43.0, 49.8, 53.0, | 4.84* 
Heterosexual 26.0. 23/25 19.0, | 7.72*** — = — — 
Aggression 15.8. 9.8» 9.8, | 4.24 T — T 5 
Primary sociopathy 
Lykken 26.2a | 19.5a 20.5, | 3.90 29.05 17.0» 22.5, | 5.83* 
Affect /stress 
Subjective Stress Scale 10.8, 46.05 8.8, | 7.70*** | 55.2ab | 72.20 15.5p | 4.67* 
Fer (PAS) 12.9, 3.7 6.6p | 4.86% | 12.5, | 18.40 3.7% | 8,00*** 
Animal (A) 31.8, | 25.9 | 21-5» | 3.46 25.5, | 31.2, 17.0, | 6.32** 
Hostility (Hs) 7.0» | 3.80 9.0, | 5.07 3:8. 5.2, Soa, |e coe, 
Barrier (Br) 7.5ab | 7-00 11.0, | 4.65* 7.29 8.0. 9.2, | 1.25 
Popular (HP) 5.05 5.85 5.8, | 5.07** 3.55 3.0. 4.5, | 52 


att Note.—Items within a row having a single subscript 
er in common are significant at the .05 or .01 level. These 
Dostconfinement means. 


experimental conditions revealed no meaning- 
ful relationships, nor did they indicate any 
preexperimental differences among Ss. 

„In general, there were few significant rela- 
tionships between MMPI preexperimental 
scores or changes in scores and success in 
completing the mission or as a function of 
experimental treatments. However, pre-post 
changes indicated better adjustment (post- 
confinement decreases on D, Hy, Pd, and Si 
scales) and increased activity and anxiety 
(Postconfinement increases on the Ma and 
MA scales), The absence of preexperimental 
differences between aborters and completers 
Indicates that MMPI scales had little power 
for predicting adjustment to conditions of 
Social isolation. 

There were also no significant pre-post or 


in common do 


not differ significantly from each other; items not sharing 


comparisons are not appropriate between preconfinement an 


preisolation findings associated with the 
Lykken Sociopathy Scale or the Edwards 
Personal Preference Schedule. 


Desocialization and Social Isolation 


During the last experimental run, two 
dyads under similar experimental conditions 
completely deteriorated by the third day. In 
violation of mission procedures, these groups 
established verbal communication and jointly 
agreed to abort the mission. The disruptive 
behaviors accompanying this act occurred for 
a period of several hours. Their behavior 
included shouting obscenities through the 
ventilating system into other rooms, beat- 
ing and kicking the outside door to their 
compartments, tampering with experimental 
equipment, refusing to execute assigned tasks 
properly, and generally ignoring instructions 
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from the mission control center. Because of 
the extreme nature of these violations, these 
groups were not included in the regular data 
analyses. 

In order to investigate personality variables 
possibly related to disruption and maladapta- 
tion to social isolation, an equal number of 
so-called “riot-abort,” abort, and completer 
groups under similar experimental conditions 
were selected for comparison. Table 4 presents 
premeans and postmeans and Kruskal-Wallis 
test (see Siegel, 1965, p. 184) results for 
these groups on a number of personality 
traits. There were seven traits on which 
these groups differed prior to confinement. 
The direction of these differences, confirmed 
by Mann-Whitney U tests, indicated that the 
riot-abort groups were significantly higher 
than completers and/or aborters on Re- 
pression-Sensitization (R-S), Control (Cn), 
Heterosexual, Aggression, Subjective Stress 
States (SSS), and Anger as measured by the 
Primary Affect Scale (Johnson & Myers, 
1967). They were lower than completers 
and aborters on Responsibility (Re). Riot- 
aborters were also higher on several clinical 
scales, although differences were not sta- 
tistically significant. Thus, the clinical instru- 
ments used in this study seemed to have 
good predictive ability in the assessment of 
maladjustment to social isolation, when the 
maladjustment is extreme. 

Preisolation and _ postisolation analyses 
showed the riot-abort groups having elevated 
scores on a majority of the MMPI clinical 
scales whereas the completer and abort groups 
showed no such elevations. Riot-aborters had 
significantly higher postisolation scores on the 
MMPI Validity scale (F) (p< .05), Hypo- 
chondriasis ( < .01), Depression (p < .10), 
Paranoia (p < .01), Psychasthenia (p < .01), 
Schizophrenia (p< .01), and Repression- 
Sensitization (p < .05) scales. T scores on 
three of these scales (Hypochondriasis, Para- 
noia, and Schizophrenia) exceeded 70. Com- 
pleter groups remained higher than the other 
two groups on Responsibility (p < .10). The 
Lykken Sociopathy Scale failed to distinguish 
between the groups prior to confinement; 
however, the means were in the expected 
direction. This trend continued following iso- 
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lation, with the riot-abort groups having a 
significantly greater mean on primary soci- 
opathy (p< .10). Finally, the riot-abort and 
the abort groups were significantly higher 
than completer groups on _postconfinement 
Subjective Stress States (p< .10), Anger 
(p< .01), and Animal (p< .05) on the 
HIT. Thus, these unusual groups not only 
had severe adjustment problems, but showed 
marked changes on most of the personality 
scales. 


Discussion 


The present study was concerned with (4) 
the relationship between personality changes 
and various types of confinement and (b) the 
prediction of adjustment to isolation and 
confinement. 

The major Holtzman Inkblot findings in- 
volved pre-post changes occurring for all 
groups, that is, decreases in FD, M, I, H, Pn, 
and Response Length and increases in C, Sh, 
and L. Similar findings for FD, M, L, and C 
were found by Cole et al. (1967) for both 
control and isolation groups, while a decrease 
in H occurred for isolated groups only. 
Studies reporting factor analyses of HIT 
variables (Gorham, 1967; Holtzman et al, 
1961) found that M, J, H, and FD define 4 
factor which Holtzman et al. (1961, P. 171) 
describe as dealing with ideational activity 
and imaginative capacity. That is, response 
to inkblots in this cluster are felt to hav 
little psychophysical correspondence in th 
stimuli but rather reflect the fantasy activity 
of the respondent. This factor accounted fo 
the greatest proportion of the variance m oul 
data. All of the variables in this factor sho 
postconfinement decreases, suggesting tha 
social isolation produces an impoverishmer 
of ideational and imaginative output. *™ 
interpretation is consistent with studies 0 
sensory deprivation reporting postisolatio! 
breakdowns in thought processes, problem 
solving, and intelligence test performant 
(Bexton et al., 1954). Decreases in a 
also confirm the Phillips and Smith (1955 
hypothesis that the number of Human 
sponses varies inversely with social ee 
although Fernald and Linden (1966) fai 
to obtain such decrements. 
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A possible problem in interpreting the 
present data revolves around the question of 
pre-post HIT changes simply as a function 
of repeated testing rather than experimental 
conditions. Holtzman et al. (1961, p. 141), 
using a somewhat comparable sample, re- 
ported slight decreases in Sh, M, and J, simply 
as a function of repeated testing. The same 
authors found significant decreases on M, I, 
and Pn, and a significant increase in Location 
on two repeated testings (p. 133). However, 
the mean changes found in the present study 


were substantially larger than those other 
studies and occurred on many more determi- 
nants. Furthermore, Cole et al. (1967) found 
no changes on these variables in an ambula- 


tory control group. Finally, recent data from 
several studies conducted separately by Gor- 
ham and Holtzman on test-retest reliability, 
counterbalancing for form and length, indicate 

quite stable means for all variables. Thus, 
the evidence suggests that the magnitude and 
extent of HIT changes in the present study 
can be attributed to other than simple test- 
retest changes. 

On a somewhat related issue, Megargee 
(1966) found that Response Length is highly 
loaded on the determinants in Factor I, and 
suggested that response length may be medi- 
ating changes in these variables. Support for 
this position can be seen in a study by 
Conners (1965) who found a highly signifi- 
cant increase in Factor I for an experimental 
(drug) group, while a placebo group showed 

| to such change. One interpretation for change 
in this experimental group suggested that 
| Response Length may have mediated this 
lġ change. This notion was strengthened by the 
knowledge that one effect of the drug 

(dextroamphetamine) seems to be an increase 
in verbalization. In the present study a de- 

Crease in Factor I was accompanied by a 

decrease in total words used. : 

A second cluster contained two variables— 
Color and Shading, both of which reflect a 
Sensitivity or concern with the stimulus quali- 
ties of the inkblots. Shading and Color re- 

| Sponses increased from preisolation to post- 
isolation, especially for aborters. Increases in 


; Wayne H. Holtzman and Donald Gorham, per- 
onal communications, June, 1968. 
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Color have been interpreted as impulsiveness 
(Murray & Jackson, 1964). Cole et al. (1967) 
found increases in Color accompanied by de- 
creases in Movement which they interpreted 
to mean that Ss were being more senstive to 
external stimuli, while using fewer internal 
resources as the stress of isolation escalated. 
Data from the present study provide support 
for the finding and interpretation made by 
Cole et al. (1967). This interpretation is also 
strengthened in that similar changes occurred 
among the most stressful experimental condi- 
tions. That is, a significant increase in Color 
occurred for 20-day privacy groups, who also 
reported increasingly greater levels of subjec- 
tive stress and anxiety over time and who 
had the highest abort rate of all experimental 
conditions (Taylor et al., 1968). The rela- 
tionship between these changes and stress and 
anxiety results provide further support for a 
linkage between Color and Shading and 
certain affective states, such as anxiety and 
impulsiveness (Ainsworth, Klopfer, & Holt, 
1954; Rapaport, Gill, & Schafer, 1946; 
Rorschach, 1949; Schachtel, 1943). How- 
ever, these findings are not wholly in accord 
with the experimental literature which seems 
to indicate little relationship between Color 
on the Rorschach and anxiety (Cerbus & 
Nichols, 1963). The discrepancy might be 
due to the fact that the present study dealt 
with state anxiety as opposed to trait anx- 
iety (see Taylor et al., 1968), while many 
studies in the literature have failed to make 
this distinction (Spielberger, 1966). 

The attempt to find personality traits that 
would predict adjustment or success to social 
isolation met with only limited success. None 
of the instruments used in the present study 
detected preexperimental differences between 
aborters and completers. However, compari- 
sons between extremely disruptive groups 
who aborted and groups confined under simi- 
larly stressful conditions revealed several 
personality traits which may be predictive 
of adjustment. 

During the course of the experimental runs, 
two dyads became disruptive to the point of 
mutually aborting the mission. These dis- 
ruptive groups differed from matched com- 
parisons (completers and aborters) on several 


418 


traits. MMPI scores on Control and Respon- 
sibility for riot-aborters agreed with findings 
by Shipman (1963) indicating that tendencies 
to tempestuous and combative behavior ap- 
peared in people with low Responsibility and 
high Control scores. According to the EPPS, 
riot-aborters tended to be more aggressive and 
were significantly higher than other groups on 
expressions of heterosexual activities. The 
disruptive groups also had high Anger scores 
and were significantly lower than other groups 
on the HIT Barrier score. Megargee (1965) 
found that juvenile delinquents had Barrier 
scores significantly lower than nondelinquent 
adolescents, Furthermore, the more seriously 
delinquent Ss had Barrier scores significantly 
lower than the less seriously delinquent Ss. 
His interpretation and finding are consistent 
with events that led to the riotlike behavior 
described above. 

A markedly noticeable difference between 
the disruptive groups and the controls was 
the extreme postconfinement elevation of the 
MMPI F (validity) scale, which seems to 
agree with other work on aggressive criminals 
(Gynther, 1961). However, the extremely 
elevated postisolation F score, which affects 
other MMPI scales and is usually interpreted 
as a “faked bad” record, leads us to view the 
MMPI changes as a deliberate effort by these 
groups to justify their disruptive behavior 
during confinement. Other changes following 
confinement indicated that the riot-abort 
groups tended to have elevated sociopathy 
scores and were more stressed and angered 
than other groups. Completers were sensitizers 
who admitted, prior to isolation, their con- 
cerns about being confined. Additionally, they 
were able to express aggression and anger and 
yet were not hostile. 

Data reported elsewhere (see Taylor et al., 
1968) showed that long missions with privacy 
and no stimulation from the outside consti- 
tuted the most stressful conditions. A major 
finding in the present study was a significant 
change in perceptual responses from groups 
under these most stressful conditions. Specifi- 
cally, the predicted changes in Color and 
Movement responses on the HIT occurred for 
groups under long mission expectation and 
privacy. This finding replicates a similar 
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change among incompatible dyads from the 
Cole et al. (1967) study, and relates to other 
findings in the literature suggesting a relation. 
ship between Color change and heightened 
affect. Pre-post changes reported here also 
replicated those reported by Cole et al. How- 
ever, those changes associated with Factor | 
are confounded with decreased verbal produc- 
tivity. Hence, decreases in TWU may have 
mediated decreases in Factor I scores, 

The present study demonstrates that com- 
binations of social isolation yielding varying 
degrees of stress, produce perceptual changes 
related to personality functioning. In addition 
to replicating an earlier finding relating in- 
creased stress to changes in Color and Move- 
ment, these data conform to a developing 
pattern of perception, personality, and overt 
behavioral response to the stressful effects of 
social isolation and confinement (Taylor et 
al., 1968). Post-hoc analyses of selected 
groups under the most extreme conditions of 
isolation revealed a syndrome of adjustment. 
Good adjustment was associated with flexibil- 
ity, low sociopathy, and ability to be aggres- 
sive and angry yet not hostile. The completers 
were sensitizers who admitted freely to con 
cern about being isolated. Further research 
is needed to define more definitely this 
adjustment syndrome. Results of such an 
effort will likely lead to accurate diagnosis 
of criteria related to adjustment in isolated 
situations, with implications for selection and 
training. 
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PSYCHOLOGICAL CHARACTERISTICS OF NEIGHBORHOOD 
YOUTH CORPS ENROLLEES* 


SHERMAN EISENTHAL 2 anp LEWIS J. SHERMAN * 


Veterans Administration Hospital, Brockton, Massachusetts 


The entire enrollment of an area Neighborhood Youth Corps (NYC) was 
administered the California Psychological Inventory (CPI) to facilitate pro- 
gram planning and evaluation. Compared to high school norms, all the enrollees 
showed significant achievement problems and social immaturity; the boys, 
however, were more extreme than the girls. They also showed a greater poten- 
tial for dropping out of school and for acting-out delinquent behavior. The 
NYC results were also compared with norms developed on high school disci- 


plinary problems and school dropouts. 


The Neighborhood Youth Corps (NYC) is 
the work training program established by the 
Economic Opportunity Act as part of the war 
on poverty. A major objective of the program 
is to enhance the educational and vocational 
potential of its enrollees. These enrollees are 
of high school age and comprise a self-selected 
group from a disadvantaged economic back- 
ground. The eligibility criteria, set by the 
Department of Labor, limit the maximum 
family income according to family size. In 
order to facilitate program planning and 
evaluation, it seems desirable to assess the 
psychological characteristics of this popu- 
lation. 

Previous research on the lower social classes 
suggests an association with certain psycho- 
logical variables: impulse expression, authori- 
tarianism, low need achievement, high rate of 
school dropouts, etc. (Havighurst, 1949; 
Hollingshead, 1949; Leshan, 1952). Although 
enrollees are all “disadvantaged,” they are 
also all self-selected. Hence it is not clear 
how representative they are of the low-income 
population from which they derive. There is 
at present no large body of data comparing 


1 Another version of this paper was presented at 
the meeting of the Eastern Psychological Associa- 
tion, Washington, D. C., April 1968. The authors 
wish to thank Ray T. Mentzer, Executive Director 
of Self Help, Inc., and Vernon K. Sport, Adminis- 
trator of the Brockton, Massachusetts, Neighbor- 
hood Youth Corps for their assistance. 

2 Requests for reprints should be sent to Sherman 
Eisenthal who is now at Boston University School 
of Medicine, 80 East Concord Street, Boston, Massa- 
chusetts 02118. 

3 Now at the University of Missouri at St. Louis. 


NYC enrollees and high school students along 
basic psychological dimensions aside from 
relatively nonrevealing demographic studies. 
This report attempts to describe the charac- 
teristics of the NYC enrollees with respect 
to two linked areas: (a) education and (b) 
social-emotional adjustment. 


METHOD 


The Ss were the entire enrollment in the Brockton, 
Massachusetts, NYC area during August 1967. There 
were 111 males and 150 females, ranging from the 
seventh to the twelfth grade, with most of the males 
in the ninth and tenth grades, and most of the 
females in the tenth and eleventh grades. They 
ranged in age from 14 to 21; most of the males 
were between 15 and 16 (55%) and most of the 
females were between 16 and 17 (63%). The enroll- 
ment was predominantly white (89%). The time in 
the program ranged from less than 1 month to over 
a year, the median for boys was 2 months and for 
girls was 3 months. Thirty-one of the enrollees, 12 
male and 19 female, were high school dropouts 
Since the distinction between “in school” and “out 
of school” is not a functionally sharp one, as à 
number of enrollees change their school status during 
the school year, the data from both groups Wete 
pooled for ease of presentation. ane 

The enrollees were administered the ee 
Psychological Inventory (CPI; Gough, 1964) an 
a 45-item questionnaire about the program. aa 
were told that the purpose of the testing was a 
research directed toward better understanding of 
enrollees and toward improving the effectiveness Py 
the program. The CPI was selected because it | 
designed to measure many of the variables of spe ‘al 
interest in this research, that is, achievement orien! 
tion, social adjustment, and intellectual ecien 
Furthermore, it is an instrument which has ra 
widely used in counseling settings and has a 
standardized on populations possessing problems © f 
countered in the NYC program, for example, 
cent delinquents and school dropouts. 
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TABLE 1 
Comparison OF NYC EnroLLEES AND HicH SCHOOL STUDENTS ON CPI 
Male Female 
Scale NYC High school NYC High school 
Du Du 
M SD M SD M SD M SD 

Do 22.76 4.97 23.2 6.0 —:44 22.69 5.25 23.7 6.1 —1.01* 
Cs 14.36 3.93 15.3 44 —.94* 13.95 3.79 16.0 4.9 —2,05** 
Sy 20.65 4.01 21.5 5.4 —.85* 21.32 4.81 21.4 5.7 —.08 
Sp 31.63 4.41 32.7 5.7 —1.07* 31.00 5.56 HY 5.8 —.10 
Sa 18.02 3.61 18.7 4.1 —.68 18.56 3.77 18.9 4.4 — 34 
Wb 26.04 6.94 33.5 5.6 —7.46** 30.88 6.87 34.6 5.7 —3.72" 
Re 23.06 5.00 26.7 5.7 —3.64** 26,92 5.32 30.0 5.2 —3.08** 
So 29,30 5.74 36.3 6.0 —7.00** 35.42 6.55 39.4 5.6 —3,98** 
Sc 22.36 7.26 25.3 8.0 —2.94** 26.40 7.10 27.6 8.5 —1.20* 
To 15.21 5.22 17.8 5.3 —2.59** 17.03 5.55 18.7 5.5 =1.67** 
Gi 14.84 5.66 15.1 6.2 —.26 15.48 5.54 15.7 6.2 —.22 
Cm 19,12 5.40 25.2 28 —6.08** 24.10 3.62 26.1 1,9 —2.00** 
Ac 20.36 4.85 22.3 5.3 —1,94** 22.60 5.13 24.1 5.3 —1.50** 
Ai 14.34 4.45 14.6 4.1 —.26 14.90 4.34 15.5 4.2 —.60 
Te 29.20 6.32 33.6 6.3 —4.40** 32.23 6.19 34.4 6.5 =2.17"" 
Py 10.19 2.70 9.2 2.6 .99** 9.60 2.53 8.7 2.6 90" 
Fx 9.47 3.92 9.1 3.4 37 8.73 3.62 8.9 3.2 —.17 
Fe 16.74 3.28 15.4 3.6 1.34* 22.91 3.46 24.1 3.5 —1.19%* 


Massachusetts, during August 1967; for males, N = 111, for females, N = 150. 


Note.—NYC enrollees were from Brockton, 
by Gough (1964) ; for males, N = 3,572; for females, N = 4,056, 


High school standardization sample was reported 
*p <.05. 


> <.01, 


elaborate norms are available on a variety of relevant 
samples for comparison with the NYC groups. 


RESULTS AND DISCUSSION 


Table 1 compares NYC enrollees with the 
national norms for high school students pub- 
lished by Gough (1964). It is evident that 
the NYC males scored below the high school 
norms on 15 of 18 scales, 11 of which reached 
statistical significance. The male enrollees 
thus performed consistently and significantly 
below the high school norms. Three aspects 
of the comparative profiles are particularly 
salient. First, the extremely low scores On 
Well-Being and Communality, below a T 
value of 25, suggest a large number of care- 
less, atypical, and possible faked-bad proto- 
cols. Second, the NYC enrollees deviated from 
high school students most strongly on two 
of the four groups of CPI scales—social 
maturity and achievement.* On the social 

4 Gough divides CPI scales into four broad psycho- 


logical categories: I. Measures of Poise, Ascendance, 
and Self-Assurance; II. Measures of Socialization, 


maturity scales, the NYC scored significantly 
below the high school norms on Responsibil- 
ity, Socialization, Self-Control, Tolerance, and 
Communality. On the achievement scales, the 
NYC males scored significantly below the 
high school norms on Achievement via Con- 
formance and Intellectual Efficiency. The 
Socialization scale could be included in this 
achievement group of CPI scales because it is 
a good “dropout” predictor (Gough, 1966). 
The NYC performance thus strongly suggests 
their potential for school problems and their 
difficulty in adjusting to conventional “middle- 
class” school environments. 

To evaluate better the nature of their 
potential school problems, the NYC scores 
were compared with norms published by 
Gough (1964) on high school students with 
disciplinary problems (Table 2), most of 
whom finish school, and with norms on high 


Maturity, and Responsibility ; III. Measures of 
‘Achievement Potential and Intellectual Efficiency; 
IV. Measures of Intellectual and Interest Modes. 
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school dropouts (Gough, 1966) (Table 3). 
The enrollees appeared considerably more simi- 
lar to the disciplinary problem group than to 
the high school students. The NYC pattern 
of scoring consistently below the high school 
norms on social maturity and achievement 
scales thus does not apply to the disciplinary 
problem group. Table 2 shows that the NYC 
enrollees scored below the disciplinary prob- 
lem group on 10 of the 18 scales, 5 of which 
reached statistical significance. On the achieve- 
ment scales, the NYC scored significantly 
higher on Achievement via Conformance, but 
significantly lower on Intellectual Efficiency. 
On the social maturity scales, the enrollees 
scored significantly lower on Socialization and 
Communality. This pattern suggests that the 
students with disciplinary problems have fo- 
cused their rebellion in settings where achieve- 
ment requires conformity, but are somewhat 
better organized and relatively more socially 
mature than the male NYC enrollees. 
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The scores of the NYC male enrollees ap. 
peared more similar to the dropout norms 
than to the high school norms. The enrollees 
scored below the dropouts on 10 of the 18 
CPI scales, 5 of the 10 attaining statistical 
significance (Table 3). The NYC enrollees 
seemed to share the same difficulties with 
the dropouts in self-direction and achievement 
in a structured situation. The NYC enrollees, 
however, retained achievement values in un- 
structured situations to a greater degree than 
dropouts. Despite this, the enrollees tended 
to be more socially immature than dropouts, 
The fact that the male NYC enrollees scored 
significantly lower on the Socialization scale 
than the dropouts, in addition to the Respon- 
sibility and Self-Control scales, suggests that 
for a number of NYC enrollees their social 
immaturity may manifest itself in a more 
extreme manner than dropping out of school, 
perhaps in more unconventional, delinquent, 
and acting-out behavior. 


TABLE 2 


Comparison OF NYC ENROLLEES AND HIGH SCHOOL DISCIPLINARY 
PROBLEM STUDENTS ON THE CPI 


Male Female 
High school High school 
Scale NYC disciplinary NYC disciplinary 
problem Du problem Du 
M SD M SD M SD M SD 
e a aaa 
Do | 22.76 | 4.97 | 222 | 58 56 22.69 | 5.25 | 228 | 53 =e 
Cs 1436 | 3.93 | 15.0 | 4.2 —.64 13.95 | 3.79 15.4 | 4.6 | —1.45* 
Sy 20.65 | 4.01 | 21.4 | 51 =.75 21.32 | 4.81 20.7 4.5 62 
Sp 31.63 | 4.41 | 340 | 56 | —2.37 | 31.00 | 5.56 | 31.8 | 5.3 — 80 
Sa 18.02 | 3.61 | 189 | 37 —.88 18.56 | 3.77 19.2 | 44 —.64 
Wb | 26.04 | 694 | 312 | 58 | —5.16* | 30.88 | 6.87 31.6 5.2 2172 
Re 23.06 | 5.00 | 21.7 | 47 1.36 26.92 | 5.32 | 26.0 | 5.6 92 
So 29.30 | 5.74 | 31.3 | 54 | —2.00* 35.42 | 655 | 348 | 7.1 62 
Sc 22.36 | 7.26 | 21.7 | 69 66 26.40 | 7.10 | 23.9 | 84 2.50" 
To 15.21 | 5.22 | 158 | 48 —.59 17.03 | 5.55 16.2 | 5.1 83 
Gi 1484 | 5.66 | 13.1 | 56 1.74* 15.48 | 5.54 | 140 | 5.4 1.48" 
Cm | 1912 | 540 | 238 | 32 | —4a6s* | 2410 | 3.62 | 25.5 | 21 | 140" 
Ac 20.36 | 4.85 | 18.7 | 43 1.66 | 22.60 | 513 | 208 | 53 1.80* 
Ai 1434 | 4.45 | 13.2 | 39 1.14 14.90 | 434 | 143 | 40 60 
Te 29.20 | 6.32 | 31.0 | 62 | —1.80* 32.23 | 619 | 32.4 | 64 -11 
Py 10.19 | 2.70 8.7 | 23 1.49** 9.60 | 2.53 84 | 24 1.20" 
Fx 9.47 | 3.92 97 | 27 c 8.73 | 3.62 90 | 32 | -2 
Fe 16.74 | 3.28 | 13.7 | 33 3.04" | 2201 | 3.46 | 23.5 | 3.5 = 59 


Note—NYC enrollees were from Brockton, Massachusetts, durin, 7 ; ya 
Í izati ' a g August 1967; for males, N = 111; for females, 
High go iay standardization sample was reported by Gough (1957) ; for males, N = 91; for females, N = 87- 


ep <01. 
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To determine the degree of similarity 
among the groups, score differences were 
calculated between enrollee and high school 
student norms, enrollee and disciplinary prob- 
Jem norms, and enrollee and dropout norms, 
and the significance of these differences tested 
by the Wilcoxon matched-pairs signed-ranks 
test. The difference scores between the en- 
rollees and the high school students differed 
significantly from those with the high school 
dropouts but not from those with the high 
school disciplinary problems. This finding 
serves to emphasize the relative similarity be- 
tween the NYC male enrollees and high 
school dropouts. 

The female NYC enrollees differed from 
the male enrollees more than they resembled 
them, Like the males, the females scored be- 
low the high school norms (Gough, 1964) 
(Table 1) on 17 of 18 scales, 11 of the 17 
comparisons attaining statistical significance. 
Also, the female enrollees scored significantly 
below the norms in scales measuring social 
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maturity and achievement orientation. How- 
ever, unlike the boys, the females did not 
obtain the extremely low scale scores, The 
females appeared much less deviant and dis- 
turbed than the males for most psychological 
dimensions. 

The likelihood of the enrollees developing 
school problems was explored by comparing 
their scores with norms for disciplinary 
(Gough, 1964) (Table 2) and dropout groups 
(Gough, 1966) (Table 3). The results from 
the female enrollees resembled those of the 
disciplinary problem students more closely 
than those of the high school students. Fur- 
thermore, the female enrollees did not score 
consistently above or below the disciplinary 
students, Thus, the enrollees scored below 
the disciplinary group on only 9 of the 18 
scales with just 2 of the 9 lower scores reach- 
ing statistical significance. Four of the 9 
scales on which the enrollees scored above the 
disciplinary group reached statistical signifi- 
cance. A comparison of the two profile pat- 


TABLE 3 
Comparison or NYC in ScHoor Ann HicH SCHOOL Dropouts ON THE CPI 
Male Female 
S High school High school 
= Shae dropouts Du nye dropouts Du 
M SD M SD M SD M SD 

Do 22.64 5.12 22.54 | 5.77 10 22.44 5.37 24.10 6.12 —1.66* 
Cs 14.20 3.94 13.58 | 3.77 -60 14.21 3.80 14.99 | 4.43 —.78 
Sy 20.58 4.00 20.84 | 5.24 —.26 21.38 4.89 20.91 5,62 7 
Sp 31.33 4.77 31.10 | 5.53 .23 30.59 5.46 30.38 5.98 21 
Sa 18.51 4.24 18.02 | 3.54 49 18.51 3.76 18.42 | 4.24 09 
Wb 26.30 6.12 32.09 | 5.59 —5.79* 30.71 6.68 31.64 | 6.03 —.93 
Re 23.23 4.17 24.73 | 4.90 —1,50* 27.20 5.36 28.86 | 4.91 —1,66"* 
So 29.41 5.77 34,62 | 5.61 —5.21* 35.27 6.31 37.61 5.89 —2.34** 
Sc 21.90 7.10 24.28 | 8.18 —2.38* 26.28 7.28 25.88 8.04 .40 
To 14.95 5.16 16.28 | 5.45 —1.33 17.24 5.68 16.55 5.07 69 
Gi 14.55 5.67 15.53 | 6.42 —.98 15.41 5.54 15.73 6.30 —.32 
Cm 19.81 4.42 24.79 | 2.56 —4.98** 23.86 3.62 25.56 2.58 —1.80** 
Ac 20.26 4.94 20.85 4.80 —.59 22.68 5.10 22.79 4.98 —.09 
Ai 14.32 4.45 12.41 | 3.82 1.91** 14.99 4.35 14.00 | 3.89 .99* 
Te 29.40 6.53 30.86 6.15 —1.46 31.89 6.99 32.15 5.94 —.26 
Py 10.03 2.54 8.80 | 2.54 Ey K yad 9.82 2.52 8.48 2.49 1.34" 
Fx 9.43 | 3.79 8.34 | 3.23 1,09* 8.90 | 3.47 8.48 | 3.09 A2 
Fe 16.73 3.15 16.08 | 3.45 65 22.92 3.39 24.44 | 3.67 —1.52** 


__ Note——NYC enrollees were from Brockton, Massachusetts, during 
High school dropout sample was reported by Gough (1966) ;for males, N = 


*p <.05, 
Dp <.01. 


August 1967; for males, N = 99; for females, N = 131, 
124;forfemales,N = 160. 
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terns suggests (4) the NYC females are some- 
what more socially mature though less confi- 
dent of their social status than the discipli- 
nary students and (b) the NYC females are 
relatively less conflicted and rebellious in 
school settings. 

As was true of the comparison with the 
disciplinary students, the scores of female 
enrollees were closer to those of the dropouts 
than to the general high school student. The 
enrollees scored below the dropouts on 10 of 
the 18 CPI scales with 5 of the 10 reaching 
statistical significance (Table 3). On the 
social maturity scales, the enrollees scored 
significantly below the dropouts on Responsi- 
bility and Socialization. On the achievement 
scales, the enrollees scored significantly above 
the dropouts in Achievement via Indepen- 
dence but not Achievement via Conformance 
or Intellectual Efficiency. This is the same 
pattern found in the comparison of the NYC 
males and the dropout norms. These findings 
suggest that the female enrollees are relatively 
less mature than the female dropouts but 
place more intrinsic value on achievement 
than do the dropouts. 

It is of interest to compare the NYC en- 
rollees and the three other groups in terms of 
the CPI factor analysis reported by Nichols 
and Schnell (1963). They obtained two pri- 
mary factors in their study which they labeled 
Value Orientation (Factor I) and Person 
Orientation (Factor II). Factor I measures 
“responsibility, self-control, and maturity as 
well as a tendency to minimize complaints and 
a concern over the reaction of others [pp. 
230-231].” The enrollees differed from high 
school students primarily on this Factor I and 
consistently demonstrated greater social im- 
maturity than the other groups. 

Tn conclusion, then, there is an ironical 
contrast with regard to the sexes in terms of 
participation in the NYC. The boys, who show 
the greater need for help, are underrepre- 
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sented as compared with the girls. Attracting 
and keeping the male enrollees, who most need 
direction and assistance, remains a challeng- 
ing but unresolved problem for program plan- 
ners. The NYC boys show significant achieve. 
ment problems and a high potential for drop. 
ping out of school. They are also socially im- 
mature and have considerable potential for un- 
conventional and delinquent behavior. Their 
sense of intellectual inadequacy is striking, 
Their style is to act out their problems and 
behave in an impulsive, nonreflective and 
intellectually rigid pattern. The NYC girls 
also manifest a potential for achievement 
problems but would seem to be less likely to 
drop out of school than the boys. In addi- 
tion, the girls are also more mature and less 
likely to engage in unconventional and de- 
linquent behavior. The pattern of scores on 
achievement related scales suggests that the 
male NYC enrollees do have achievement 
values but that they lack the social maturity 
and intellectual efficiency to effectively per- 


form in the school setting. The essential issue 


is how to cope with the male enrollee’s im- 
patience and lack of direction and recognize 
that he is likely to rebel in response to aca- 
demic social pressure. 


REFERENCES 


Goucm, H. G. Manual for the California Psycho- 
logical Inventory (Rev. ed.) Palo Alto, Calif.: 
Consulting Psychologists Press, 1964. 

Govucu, H. G. Graduation from high school as pre 
dicted from the California Psychological Inven- 
tory. Psychology in the Schools, 1966, 3, 208-216. 


Havicnrst, R. J, & Tasa, H. Adolescent character 


and personality. New York: Wiley, 1949. 

Hortincsneap, A. B. Elmstown’s youth. New York: 
Wiley, 1949, 

Lesman, L, L. Time orientation and social class. 
Journal of Abnormal and Social Psychology, 1952 
48, 589-592. 

Nicuors, R. C, & Scunext, R. R. Factor scales for 
the California Psychological Inventory. Journal of 
Consulting Psychology, 1963, 27, 228-235. 


(Received July 18, 1968) 


al of Consulting and Clinical Psychology 
iso Ht Sar No. A, 425-429 


EFFECTS OF PRESESSION INFORMATION ON PERCEPTION 
OF THE THERAPIST AND RECEPTIVITY TO INFLUENCE 
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In order to investigate the idea that presession information can change Ss? 
evaluations and perceptions of a therapy session, 112 Ss were randomly assigned 
to one of four groups and given different information with regard to a 
therapist’s warmth and experience. Following structuring, all Ss listened to 
the same tape of a therapy session. Measures obtained following the tape 
revealed that Ss structured warm or experienced, as opposed to cold or inex- 
perienced, were more attracted to the therapist, more receptive to his influence, 
and evaluated his work more positively. Further, Ss structured warm, rather 
than cold, were more willing to meet with the therapist and more persuaded 
by his communications. The findings thus support the notion that premeeting 


information can affect the therapy relationship. 


As the study of psychotherapy proceeds, 
more time and effort are being turned toward 
an examination of the therapy relationship in 
an attempt to clarify, understand, and even- 
tually control those variables which would 
promote the practice of more efficient therapy. 
The psychotherapeutic relationship was the 
object of study in the present investigation. 
More specifically, interest was focused on the 
class of pretherapy or early therapy events 
which might advantageously affect therapeutic 
process and outcome. The present analogue 
study endeavored to show that information 
given prior to an S’s exposure to a therapy 
session can change his perception of that ses- 
sion and render him more open and receptive 
to the therapist’s influence attempts. 

The fact that “structuring” can influence 
an §’s perception of a person has been amply 
demonstrated by the impression-formation 
literature. A basic finding of this area of study 
seems to be that we react to a person as 4 
unity or whole and not as a series of specific 
traits (Allport, 1937; Asch, 1946; Gollin, 


1This article is based on a doctoral dissertation 
submitted in partial fulfillment of the requirements 
for the PhD degree at Syracuse University. The 
author wishes to express his gratitude to Arnold P. 
Goldstein for his aid and encouragement. 

2 Requests for reprints should be sent to Roger P. 
Greenberg, Department of Psychiatry, State Uni- 
versity of New York at Syracuse, 750 East Adams 
Street, Syracuse, New York 13210. 
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1954). Further, what a person perceives will 
be determined by his total impression of 
what he feels he should see. That is, he will 
distort an “objective” encounter along lines 
that agree with his prior expectations and 
theories (e.g, Asch, 1946; Farina & Ring, 
1965; Kelley, 1950; Secord, 1958). In an 
analogous fashion, the therapy literature 
seems to show that pretherapy expectancies 
or manipulations can affect the therapy rela- 
tionship (e.g., Frank, 1959; Goldstein, 1962). 
Thus, since structuring or expectations do 
play an important role in interactions be- 
tween people, it seems reasonable to ask 
which variables may act to enhance or dimin- 
ish the quality of the initial therapy relation- 
ship. 

Theorizing and common sense support the 
notion that therapist-patient interaction can 
proceed more efficiently in an atmosphere of 
cooperation than in one of hostility and re- 
sistance. Further, a number of studies have 
shown that a person is more susceptible to 
influence attempts made by an individual he 
is attracted to or likes (Back, 1951; Burdick 
& Burnes, 1958; Sapolsky, 1960). Therefore, 
the creation of an agreeable therapy relation- 
ship may be viewed as intimately related to 
variables which enhance patient attraction and 
foster receptivity to influence. 

Two of the variables which have received 
considerable attention in discussions of ther- 
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apy are therapist “warmth” and “experience.” 
Both of these variables have been hypothe- 
sized to be conducive to successful therapy 
outcome. Truax and Carkhuff (1967) have 
suggested in a host of studies that warmth is 
an essential element of successful therapy 
and quote from every major psychotherapy 
theorist in presenting their argument, The 
fact that individuals perceived as behaving in 
a “warm” manner can affect the behavior and 
attitudes of others has been experimentally 
demonstrated many times (e.g., Heller, Mey- 
ers, & Kline, 1963; Kelley, 1950; Masling, 
1957). In like manner, it has been reported 
that the experience of the therapist may be an 
important indicator of potential therapeutic 
gain or adequacy (e.g., Barrett-Lennard, 
1962; Cartwright & Vogel, 1960; Strupp, 
1962). 

The present study attempted to find out if 
just labeling a therapist “warm” or “experi- 
enced” can change the nature of the therapy 
relationship. Do Ss perceive a therapy session 
differently if they feel they are listening to a 
warm or experienced therapist? Does a thera- 
pist have an advantage in influencing patients 
merely by virtue of being labeled warm or 
experienced? Specifically, it was hypothesized 
that Ss structured “warm” would be more 
attracted to the therapist and more receptive 
to his influence attempts than Ss structured 
“cold.” Similarly, it was hypothesized that 
Ss structured “experienced” would be more 
attracted to the therapist and receptive to his 
influence attempts than those structured “in- 
experienced.” Above all, it was expected that 
Ss structured both “warm” and “experienced” 
would be more attracted to the therapist and 
receptive to his influence attempts than Ss 
structured both “cold” and “inexperienced.” 


METHOD 
Subjects 


The Ss were 112 undergraduates obtained from 
introductory psychology courses at Syracuse Uni- 
versity. Each S was randomly assigned to one of 
four equal sized groups: experienced-warm, experi- 
enced-cold, inexperienced-warm, or inexperienced- 
cold. Each group contained 14 men and 14 women. 
The Ss were run in groups of 12-16 with each 
condition being run on two separate occasions. 
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Materials 


After initial structuring all Ss listened to a 15- 
minute tape of a simulated therapy session, This 
tape was judged by 10 advanced clinical psychology 
graduate students and three PhD clinical psycholo- 
gists. All agreed that the tape was a credible repre- 
sentation of a psychotherapy session and relatively 
ambiguous with regard to the variables of “warmth” 
and “experience.” 

Following the tape presentation Ss were asked to 
fill out two questionnaires. Questionnaire 1 consisted 
of a scale to measure each S’s attraction toward the 
taped therapist and a scale to measure each S's re- 
ceptivity to the taped therapist’s influence. The 
attraction scale was a modification of the Client’s 
Personal Reaction Questionnaire developed by Ashby, 
Ford, Guerney, and Guerney (1957), and meaning- 
fully used in studies of the therapy relationship by 
Goldstein (1962) and Snyder (1961). The scale 
consists of a series of 28 statements describing posi- 
tive and negative aspects of the taped therapist- 
patient interactions, to which each S responded on a 
7-point scale from “strongly agree” to “strongly dis- 
agree.” A constant of 84 was added to each S's score 
so that the possible range of attraction scores was 
0-168. Ten statements concerning Ss’ own attitudes 
toward the therapist made up the receptivity scale 
(e.g, I would probably feel uncomfortable talking 
with this therapist), Again Ss rated their degree of 
agreement with each statement on a 7-point scale. 
Scores were taken to reflect each S’s willingness to 
interact and be open with the taped therapist. A 
constant of 30 was added to each S’s score so that 
the possible range of receptivity to influence scores 
was 0-60, 

Questionnaire 2 was a persuasibility measure and 
attempted to measure the therapist’s ability to influ- 
ence Ss’ perceptions and opinions of the taped ses- 
sion. The Ss were presented with a series of 
“psychologically based” statements about the taped 
patient (e.g., the patient is probably absentminded) 
and asked to rate these statements on @ 1-point 
scale from “strongly agree” to “strongly disagree. 
All answers were matters of opinion since there was 
no factual information in the tape on which to base 
answers to these questions. The Ss were also in- 
structed that the taped therapist had filled out this 
questionnaire and were given his rating after each 
item so that they could “compare” their views WI 
his. The therapist ratings presented were randomly 
distributed (i.e, slightly agree through slightly dis- 
agree) except for 20 items which were randomly 
rated at the extremes (i.e., strongly agree oF strongly 
disagree). The Ss were scored for each of the 5 
selected extreme items on a scale ranging from Ll, 
with 7 representing perfect correspondence WI 
“therapist’s opinion”; thus the possible range o 
persuasibility scores was 20-140. The higher the score 
for any S the more his ratings coincided with m 
of the therapist and thus, presumably the more y 
views were influenced by the therapist. 


EFFECTS OF PRESESSION INFORMATION 


Finally, Ss were presented with a printed sheet 
stating that the psychology department, after many 
inquiries concerning student problems, was thinking 
of asking the taped therapist to the university to 
discuss problems. The Ss were then asked in writing 
if they were willing or not willing to meet the taped 
therapist to discuss problems. 

Two questions in Questionnaire 1 served as a check 
on the independent warmth and experience manipu- 
lations (ie., the therapist is a warm person; the 
therapist sounded as though he had dealt with the 
problems of many people in the past). 


Procedure 


After each group of Ss was seated in the experi- 
mental room, they were presented with a printed 
page headed “Biographical Information.” The E then 
read the printed page to the Ss. The information 
given was as follows: 


In recent years more and more people experi- 
encing emotional and psychological problems have 
been seeking help from psychotherapists. We are 
interested in learning more about psychotherapy 
and the reactions of people to this treatment. 
Therefore, we are going to play a tape of part of 
an actual psychotherapy session that took place 
in a large city other than Syracuse and ask you 
for your reactions to it. As you listen to the tape 
we would like you to put yourself in the place 
of the patient and imagine how you, as the 
patient, would react to this particular session. 

The patient in this session is a college student 
at a large university who sought help after find- 
ing, among other things, that he was having in- 
creasing difficulty making decisions. 


The rest of the information given to each S varied 
depending on which of the four groups he had been 
assigned to. 


Experienced: warm or cold. The therapist has 
been engaged in the practice of therapy for over 
20 years and has lectured and taught at some of 
the country’s leading universities and medical 
schools, Questionnaires submitted to the therapist’s 
colleagues seem to reveal that he is a rather (warm 
or cold) person, industrious, critical, practical, and 
determined. 

Remember to try and put yourself in the pa- 
tient’s place as you listen to the tape. 

Inexperienced: warm or cold. The therapist is a 
student engaged in a required part of his training 
and this session is one of his first experiences as 
a therapist, Questionnaires submitted to the 
therapist’s colleagues seem to reveal that he is a 
rather (warm or cold) person, industrious, critical, 
Practical, and determined. 

Remember to try and put yourself in the pa- 
tient’s place as you listen to the tape. 
Following the playing of the tape, all Ss were 


asked to fill out the questionnaires previously de- 
scribed. 
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RESULTS 


Before the main hypotheses were tested, 
the data for the questions serving as a check 
on the independent warmth and experience 
manipulations were examined by two-tailed ¢ 
tests for mean differences. Analysis revealed 
that Ss structured warm rated the taped 
therapist as significantly warmer than Ss 
structured cold (#= 10.51, df=110, p< 
.001). Similarly, Ss structured experienced, as 
opposed to inexperienced, rated the taped 
therapist as sounding significantly more ex- 
perienced (f= 5.98, df= 110, p<.001). 
Thus, both the warmth and experience manip- 
ulations were successful. 

The mean attraction, receptivity, and per- 
suasibility scores are presented in Table 1. 
Separate analyses of variance applied to these 
three measures showed, as predicted, that Ss 
structured warm, rather than cold, were more 
attracted to the therapist (F = 50.68, df = 
1/104, p < .001), more receptive to his in- 
fluence (F= 28.15, df = 1/104, p< .001), 
and more persuaded by his communications 
(F=7.06, df=1/104, p< 01). Also as 
predicted, Ss structured experienced were more 
attracted (F= 8.00, df= 1/104, p< .01) 
and receptive (F = 8.21, df= 1/104, p< 
.01) to the therapist than Ss structured inex- 
perienced. Yet, Ss structured experienced were 
not more easily persuaded by the therapist’s 
statements. Scheffé’s tests for mean differ- 
ences applied to the three measures revealed, 
as predicted, the warm-experienced group was 
more attracted (p< .001), receptive (p< 
.001), and persuaded ( < .05) by the taped 
therapist than the cold-inexperienced group. 
In addition, Ss given the information that 
the therapist was warm, rather than cold, 
were more attracted to him regardless of 


TABLE 1 


Mean AttracTION, RECEPTIVITY, AND 
PERSUASIBILITY SCORES 


Attraction Receptivity | Persuasibility 
Information 


given 


Warm | Cold 


103,54 | 99.43 
101,96 | 93.86 


Warm | Cold | Warm} Cold 


118.25 | 83.18 | 39.46 | 27.21 
105.43 | 66.68 | 32.86 | 20.57 


Experienced 
Inexperienced 
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whether the “warm” and “cold” were paired 
with “experience” or “inexperience” (p’s < 
.001). The Ss were also more receptive to 
the warm therapist regardless of whether the 
“warm” and “cold” were paired with “expe- 
rience” or “inexperience” (p’s< .01). Fi- 
nally, Ss were more attracted to the therapist 
presented as warm-inexperienced rather than 
cold-experienced (p < .05). 

Three chi-square analyses were done on the 
gross measure of whether or not each § was 
willing to meet with the taped therapist to 
discuss student problems. As predicted, a sig- 
nificantly larger number of Ss structured warm 
(42), as compared to cold (30), stated they 
would be willing to meet with the taped thera- 
pist (y?=4.71, df=1, p< .05). Also, as 
predicted, a significantly larger number of Ss 
structured warm-experienced (23), as opposed 
to cold-inexperienced (14), stated they would 
be willing to meet with the taped therapist 
(x? = 5.10, df = 1, p < .05). No significant 
difference in willingness to meet with the 
therapist was obtained between Ss structured 
experienced and those structured inexperi- 
enced. 


Discussion 


In general then, both the trait (i.e., warm- 
cold) and the role (i.e., experience-inexperi- 
ence) information presented seemed to affect 
Ss’ perceptions and evaluations of the session. 
However, the trait information was apparently 
more powerful in biasing perceptions, in this 
sample, than the role information. In evalu- 
ating the results, it should be kept in mind 
that the present study is a psychotherapy 
analogue and as such did not necessarily use 
individuals with psychological problems as Ss. 
Therefore, it seems tenable that the experi- 
ence dimension may not have been as im- 
portant a dimension for this group as for a 
group of actual patients seeking help. The 
warmth dimension, on the other hand, seems 
important as a facilitator of rapport for peo- 
ple involved in any kind of relationship. Also, 
since both the experienced and novice thera- 
pist were presented to Ss as trained individu- 
als the effects of the “experience” information 
may have been attenuated within the present 
study. In like manner, these effects may be 
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attenuated within the practice of psychother. 
apy at large where patients are seen by thera. 
pists known to have varying degrees of ex. 
perience. Further, since most people do not 
know how a therapist behaves or what goes on 
in a psychotherapy session, it may be more 
difficult for Ss to bias their perceptions given 
the information that a therapist is experienced 
than given the information that he is warm, 
The impression-formation literature makes 
the point that negative information about a 
person has more influence on Ss’ perceptions 
of that person than positive information (eg, 
Briscoe, Woodyard, & Shaw, 1967; Pastore, 
1960). Here it is probable that Ss would feel 
more resentful and negative toward a cold 
therapist than an inexperienced one since it 
is assumed that individuals have control over 
acting warm or cold but not over being 
experienced or inexperienced. 

Finally, and perhaps most significant, the 
traits of the therapist (i.e., warmth) may be 
much more important than the number of 
hours of his experience or his role title in 
promoting therapeutic change. This idea seems 
to be gaining support as studies point out 
that lay people, with certain attributes, can 
be trained in a relatively short period of time 
to do effective therapy (e.g., Carkhuff & 
Truax, 1965; Rioch, Elkes, Flint, Udansky, 
Newman, & Silber, 1963). i 

The present study suggests that premeeting 
information may significantly affect the proc- 
ess of psychotherapy. It highlights the impact 
which referral sources, rumor, or precon- 
ceived notions may have on psychotherapy. 
Further, it underscores the importance of the 
patient’s perception of the therapist for fa- 
cilitation of attraction and influence. It is 
planned that future research will move further 
along the analogical bridge toward the actual 
therapy room with actual therapy patients. 
The findings of the present study strongly 
suggest that “structuring” can enhance the 
initial psychotherapeutic relationship. 


REFERENCES 


ArLrorT, G. W. Personality. Boston: Houghton Mif- 
flin, 1937. "i 
Asc, S. E. Forming impressions of personality: 
Journal of Abnormal and Social Psychology» 1946, 

41, 258-290. 


| 


EFFECTS OF PRESESSION INFORMATION 


Asusy, J. D., Forp, D. H, Guerne, B. G., & 
Guerney, L. F. Effects on clients of a reflective 
and a leading type of psychotherapy. Psychological 
Monographs, 1957, 71(24, Whole No. 453). 

Back, K. W. Influence through social communica- 
tion. Journal of Abnormal and Social Psychology, 
1951, 46, 9-23. 

BARRETT-LENNARD, G. T. Dimensions of the client’s 
experience of his therapist associated with person- 
ality change. Genetic Psychology Monographs, 
1962, 76, No. 43. 

Briscor, M. E., Woopyarp, H. D., & SHaw, M. E. 
Personality impression change as a function of the 
favorableness of first impressions, Journal of Per- 
sonality, 1967, 35, 343-357. 

Burvickx, H. A., & Burnes, A. J. A test of “strain 
toward symmetry” theories. Journal of Abnormal 
and Social Psychology, 1958, 57, 367-369. 

Carxuurr, R, R., & Truax, C. B, Training in coun- 
seling and psychotherapy: An evaluation of an 
integrated didactic and experiential approach. 
Journal of Consulting Psychology, 1965, 29, 333- 
336. 

Cartwricut, R. D., & VoceL, J. L. A. A comparison 
of changes in psychoneurotic patients during 
matched periods of therapy and no therapy. Jour- 
nal of Consulting Psychology, 1960, 24, 121-127. 

Farma, A., & Rinc, K. The influence of perceived 
mental illness on interpersonal relations. Journal 
of Abnormal Psychology, 1965, 70, 47-51. 

Frank, J. D. The dynamics of the psychotherapeutic 
relationship. Psychiatry, 1959, 22, 17-39. 

Goxpstern, A. P. Therapist-patient expectancies in 
psychotherapy. New York: Pergamon Press, 1962. 

Gort, E, Forming impressions of personality. 
Journal of Personality, 1954, 23, 65-76. 

Herter, K, Mvers, R. A, & Kime, L. V. Inter- 


429 


viewer behavior as a function of standardized 
client roles. Journal of Consulting Psychology, 
1963, 27, 117-122. 

Kerrey, H. H, Warm-cold variable in first impres- 
sions, Journal of Personality, 1950, 18, 431-439. 
Mastinc, J. M. The effects of warm and cold inter- 
action on the interpretation of a projective proto- 
col. Journal of Projective Techniques, 1957, 21, 

377-383. 

Pastore, N. Attributed characteristics of liked and 
disliked persons. Journal of Social Psychology, 
1960, 52, 157-163. 

Rrocu, M. J., Evxes, C., FLINT, A. A., Upansky, B. 
S., Newman, R. G, & Super, E. National Insti- 
tute of Mental Health pilot study in training 
mental health counselors. American Journal of 
Orthopsychiatry, 1963, 33, 678-689. 

Saporsxy, A. Effect of interpersonal relationships 
upon verbal conditioning. Journal of Abnormal 
and Social Psychology, 1960, 60, 241-246. 

Secorp, P. F. Facial features and inference processes 
in interpersonal perception. In R. Taguiri & L. 
Petrillo (Eds.), Person perception and interper- 
sonal behavior. Stanford: Stanford University 
Press, 1958. 

Snyper, W. U. The psychotherapy relationship, New 
York: Macmillan, 1961, 

Strupp, H. H. The therapists’ contribution to the 
treatment process. In H. H. Strupp & L. Luborsky 
(Eds.), Research in psychotherapy. Vol. 2. Wash- 
ington, D. C.: American Psychological Associa- 
tion, 1962. 

Truax, C. B, & Carkanurr, R. B. Toward effective 
counseling and psychotherapy: Training and prac- 
tice. Chicago: Aldine, 1967. 


(Received July 19, 1968) 


Journal of Consulting and Clinical Psychology 


1969, Vol. 33, No. 


4, 430-434 
DIGIT SPAN: 


AN INDICANT OF TRAIT OR STATE ANXIETY? € 


WILLIAM F. HODGES? AND 


University of Colorado Florida State University 


Clinical psychologists generally interpret lower scores on Digit Span (DS) rela- 
tive to other Wechsler subtests as indicating anxiety. However, research findings 
on this topic are inconsistent and difficult to interpret because of ambiguity with 
regard to the concept of anxiety. To clarify this ambiguity, Spielberger has 
proposed a conception of anxiety that elaborates on the distinction, initially 
proposed by Cattell and Scheirer, between anxiety as a transitory state 
(A-state) and as a relatively stable personality trait (A-trait). The present 
study evaluated the relationship between DS performance and measures of 
A-trait (Taylor Manifest Anxiety scale) and A-state (Zuckerman Adjective 
Checklist) for 72 male undergraduates. The Ss reporting high levels of A-state 
showed significant decrements in DS performance. There was no difference in 
the DS performance of high- and low-A-trait Ss, 
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For many years clinical psychologists have 
interpreted the pattern of scores obtained on 
subtests of the Wechsler Intelligence Scale as 
reflecting important personality characteristics 
of individuals taking the test. One of the most 
frequent inferences is that a low score on the 
Digit Span (DS) test, as contrasted with 
scores on other subscales, indicates the pres- 
ence of anxiety (Rapaport, Gill, & Schafer, 
1968; Rapaport, Schafer, & Gill, 1944; Wech- 
sler, 1958). 

While clinicians tend to agree that DS per- 
formance is lowered by “anxiety,” the mean- 
ing and conceptual status of anxiety is ambig- 
uous. Cattell and Scheier (1961) have noted 
that the term “anxiety” has been used to 
refer both to a transitory state or condition 
of the organism and to a relatively stable per- 
sonality trait. Although variables that loaded 
differentially on their state and trait anxiety 
factors have been identified by Cattell and 
Scheier, they have not specified the nature of 


1 The research summarized in this report was based 
on data obtained while the authors were associated 
with the Department of Psychology at Vanderbilt 
University. It was supported in part by a United 
States Public Health Service Fellowship (5-F1-MH- 
28,537) from the National Institute of Mental 
Health to the first author and by a research grant 
(HD 947) to the second author from the National 
Institute of Child Health and Human Development, 
United States Public Health Service. 

2 Requests for reprints should be sent to William 
F. Hodges, Department of Psychology, University 
of Colorado, Boulder, Colorado 80302. 
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the relationship between these two anxiety 
concepts. 

Spielberger has recently proposed a theo- 
retical conception of anxiety that specifies the 
relationship between the concepts of state and 
trait anxiety (Spielberger, 1966; Spielberger 
& Lushene, in press). Building on the work of 
Freud (1933, 1936), Cattell and Scheier 
(1961), and Spence (1964), Spielberger de- 
fines state anxiety (A-state) as consisting of 
subjective feelings of apprehension and con- 
cern and heightened autonomic nervous sys- 
tem activity; whereas trait anxiety (A-trait) 
refers to anxiety proneness, that is, to indi- 
vidual differences in the disposition to re- 
spond with high levels of A-state under stress- 
ful circumstances. 

The failure in most investigations of DS 
and anxiety to distinguish between Acstate 
and A-trait has made the literature in this 
area difficult to interpret. Anxiety (presum 
ably A-state) was induced in two studies by 
telling Ss they were doing poorly on a test of 
ability (Griffiths, 1958; Moldawsky & Mol 
dawsky, 1952). For Ss exposed to this stress- 
ful experimental condition, significant decre- 
ments in DS performance were obtained as 
compared to nonstress control groups. In two 
other studies in which the anxiety measures 
could be reasonably interpreted as indicants 
of A-state, decrements in DS performance 
were also found to be associated with increase 
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in A-state (Pyke & Agnew, 1963; Walker & 
Spence, 1964). 

The results have been inconsistent and 
contradictory in investigations of the relation- 
ship between DS and presumed measures of 
A-trait, such as the Taylor Manifest Anxiety 
(MA) scale, the IPAT Anxiety scale, and the 
Welsh A scale. Three studies have reported no 
relationship between DS performance and A- 
trait measures (Jackson & Bloomberg, 1958; 
Matarazzo, 1955; Walker & Spence, 1964), 
one study found a positive relationship 
(Jurjevich, 1963), and one reported a nega- 
tive relationship (Calvin, Koons, Bingham, & 
Fink, 1955). The results have also been 
equivocal in studies in which clinical diag- 
nosis was used as the criterion of A-trait 
(Lewinski, 1945; Merril & Heathers, 1952; 
Warner, 1950). 

The present study was designed to evalu- 
ate the relationship between DS performance 
and measures of state and trait anxiety. The 
Ss who differed in A-trait were required to 
perform on a DS task under failure and con- 
trol conditions. In accordance with Trait- 
State Anxiety Theory (Spielberger & Lu- 
shene, in press), high-A-trait Ss were ex- 
pected to respond with higher levels of A- 
state than low-A-trait Ss in the failure condi- 
tion, but not in the nonthreatening control 
condition. It was further expected that per- 
formance decrements in DS would be inversely 
related to level of A-state but not necessarily 
to A-trait, since anxiety must be aroused in 
order for it to interfere with behavior. 


METHOD 


The Ss were 72 male undergraduates enrolled in 
the introductory psychology course at Vanderbilt 
University who volunteered for an experiment on 
“numbers.” The Taylor (1953) MA scale served as 
the measure of A-trait. The Ss with MA scale scores 
of 18 or greater were designated the high-A-trait 
group; those with scores of 11 or lower were desig- 
nated the low-A-trait group. These cutoff points 
approximated the upper and lower quartiles of the 
MA scale distribution for males, after Ss with Lie 
scale scores of 8 or higher had been eliminated from 
the S pool. 


8 An additional condition involving the threat of 
shock was also used, but these data were not relevant 
to the hypotheses of the present study and are re- 
Ported elsewhere (Hodges, 1968). 
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Prior to the experiment, each S was assigned either 
to the Failure (stress) Condition or the Control 
(nonstress) Condition. The Ss were tested indi- 
vidually. At the beginning of the experiment, there 
was an 8-minute rest period during which $ was 
required to complete the Zuckerman (1960) Affect 
Adjective Check List (AACL) by checking adjectives 
that described “how you feel at this moment.” The 
AACL served as the measure of the phenomeno- 
logical component of A-state. 

Since Wechsler (1958) considered performance on 
Digits Backwards (DB) to be particularly sensitive 
to anxiety level, a special DB task was constructed 
for this study. After S completed the AACL, he was 
given DB from the WAIS (Wechsler, 1955, p. 41) 
in accordance with standard instructions to determine 
his limit, that is, the level at which he twice failed 
to repeat a series of a specified number of DB. He 
was then given two practice series, each one digit 
less than his limit, followed immediately by six 
more series of digits of the same length. The number 
of series correctly reproduced by S out of the six 
that were given to him was his Performance score. 

After the Performance score was obtained, each S 
in the Failure Condition was told that, although he 
was not doing too badly, he could do better and 
that other Ss seemed to be doing better than he was. 
In the Control Condition, S was told that he was 
doing fine and that the next set of DBs were to be 
“just as easy as the ones he had just completed.” 
Six more series of digits of the same length were 
then given to obtain a Test score for each S. Fol- 
lowing the test series, S was required to fill out the 
AACL again, according to how he felt “while re- 
peating the last few series of digits.” 


RESULTS 


The effects of experimental conditions and 
level of A-trait on DS performance are pre- 
sented in Figure 1. It may be noted that the 
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Fic. 1. The effects of failure and no-threat instruc- 
tions and level of A-trait on digits backwards during 
the Performance and Test Periods (N=18 per 


group). 
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failure instructions produced decrements in 
DS performance. In an analysis of variance of 
these data, in which Conditions and A-trait 
level were the between-Ss variables and Time 
Periods the within-Ss variable, only the Con- 
ditions main effect (F = 4.97, df = 1/68, p 
< .05) and the Conditions x Time Periods 
interaction effect (F = 6.73, df = 2/68, p< 
01) were significant. The absence of any sta- 
tistically significant effects involving the A- 
trait variable confirmed the impression gained 
from Figure 1 that DS Performance was un- 
related to A-trait. 

Next, the effects of experimental conditions 
and level of A-trait on intensity of A-state 
during the Test Period were evaluated. An 
examination of the distribution of A-state 
scores obtained during the Test Period indi- 
cated that the median AACL score was 12. 
The data for 14 Ss whose AACL scores fell 
at the median were eliminated; Ss with scores 
above and below the median were designated 
the high- and low-A-state groups, respec- 
tively. In order to provide an equal number of 
Ss in the high- and low-A-state groups, four 
Ss with AACL scores below the median were 
randomly eliminated. 

As may be noted in Table 1, all of the 
high-A-trait Ss in the Failure group were 
classified as high A-state, and all but one of 
the low-A-trait Ss in the Control group were 
classified as low A-state. A statistical analy- 
sis of these data yielded a significant chi- 
square (x? = 24.78, df = 3, p < .01), indi- 
cating that the effect of experimental condi- 
tions on A-state was influenced by level of 
A-trait. 


TABLE 1 


FREQUENCY or HIGH-A-TRAIT AND LOW-Å-TRAIT 
SUBJECTS IN THE FAILURE AND CONTROL 
Conpitions WHo WERE CLASSIFIED AS 
HicH or Low IN A-sTATE UNDER 


STREss 
SA Low | High 

Conditions| Groups N Astate | Astate 
Control Low A-trait 14 13 1 
Control High A-trait 13 6 7 
Failure Low A-trait 13 8 5 
Failure High A-trait 14 0 14 

Totals 54 27 27 
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Fic. 2. The effects of low-A-state and high-A- 
state levels on Digits Backwards during the Per- 
formance and Test Periods (N = 27 per group). 


The next step in the analysis of the data 
involved an evaluation of the effects of A- 
state on DS Performance. For this analysis, 
the data for the Failure and Control Condi- 
tions were combined. The mean DB scores 
obtained in the Performance and Test Periods 
by Ss in the high- and low-A-state groups are 
presented in Figure 2. These data were evalu- 
ated in an analysis of variance in which the 
most important finding was the significant A- 
state X Time Periods interaction (F = 6.52, 
df = 1/52, p < .01). The main effects of A- 
state (F=5,50, df= 1/52, p<.05) and 
Time Periods (F = 4.90, df= 1/52, P< 
.05) were also statistically significant. In 
order to evaluate further the effects of A- 
state on DB Performance, the data for each 
Time Period were separately analyzed by a 
Neuman-Keuls test (Winer, 1962), which 
indicated that the high- and low-A-state 
groups did not differ in the Performance Pe 
riod (F = 49, df = 1/52) but that the DS 
Performance of the high-A-state group was 
significantly lower than that of the low-A- 
state group during the Test Period (F= 
11.05, df = 1/52, p < 01). 


Discussion 


In the present study, the failure to find 4 
relationship between DS Performance 4 
scores on the Taylor MA scale provides fur- 
ther evidence that decrements in DS are W 
related to measures of A-trait (Jackson 
Bloomberg, 1958; Matarazzo, 1955; Walker 
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& Spence, 1964). The finding that Ss in the 
Failure Condition showed a significant decre- 
ment in DS performance was consistent with 
the results of previous investigations in which 
some form of stress was used to induce anxi- 
ety (Griffiths, 1958; Moldawsky & Moldaw- 
sky, 1952; Pyke & Agnew, 1963; Walker & 
Spence, 1964), and suggests that decrements 
in DS performance are associated with eleva- 
tions in A-state. 

Evaluating the effects of experimental con- 
ditions on level of A-state, it may be noted in 


Table 1 that high-A-trait Ss were particularly 
disposed to manifest high levels of A-state 
under conditions of failure-stress. It may 
also be noted that over half of the high-A- 


trait Ss responded with high levels of A-state 
in the Control Condition, even though they 
were told that they were doing well. In con- 
trast, less than half of the low-A-trait Ss re- 
sponded with high A-state in the Failure 
Condition and only one low-A-trait § re- 
sponded with high A-state in the Control 
Condition. These results support the hypoth- 
esis that high-A-trait Ss tend to respond with 
high levels of A-state to situations that in- 
volve threats to self-esteem such as failure or 
threat of failure (Spielberger & Lushene, in 
press). 

The finding of greatest theoretical impor- 
tance in this study was the statistically sig- 
nificant decrement in DS performance for Ss 
who reported a high level of A-state. The 
fact that high-A-state Ss had lower DS scores 
than low-A-state Ss in the Test Period but 
not in the Performance Period is consistent 
with the interpretation that decrements in DS 
Performance are produced by elevations in 
A-state, This finding is also consistent with 
Trait-State Anxiety Theory (Spielberger & 
Lushene, in press), which posits that per- 
formance decrements on learning and memory 
tasks are produced by differential levels of A- 
state rather than differences in A-trait. Ac- 
cording to this view, it would be expected that 
DS Performance will be related to A-trait 
only when differential levels of A-state are 

| induced (by environmental circumstances or 
stressful experimental conditions) in Ss who 
differ in A-trait. 
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An alternative explanation for the decre- 
ments in DS performance that were observed 
in the present study is that Ss who did more 
poorly on the DS task experienced an in- 
crease in A-state in response to their de- 
teriorating performance. This interpretation 
argues that the failure instructions in some 
way influenced performance without altering 
anxiety level, and that changes in A-state 
subsequently resulted from $’s reaction to his 
poor performance. Although the experimental 
design in the present study does not permit a 
definitive choice between these alternative ex- 
planations of the findings, the first interpre- 
tation, which emphasizes the impact of anxi- 
ety level on performance rather than the re- 
verse, would appear to be more consistent 
with the clinical literature. 
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CLIENT-THERAPIST RELATIONSHIP FACTOR IN 
SYSTEMATIC DESENSITIZATION 
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Forty snake-phobic female Ss were matched on degree of behavioral avoidance 
and subjective fear, and then randomly assigned to one of four groups: (a) a 
conventional desensitization group, using standard interpersonal procedures; 
(b) a semiautomated desensitization group in which treatment was delivered 
by tape recordings; (c) a pseudodesensitization control group; or (d) a no- 
treatment control group. The Ss in both desensitization groups showed signifi- 
cantly greater improvement than did Ss in either of the two control groups. 
No differences were found between conventional and semiautomated procedures. 
It was concluded that the relationship is neither a primary nor critical factor in 


desensitization. 


In a review of the psychotherapy of 
phobiås, Andrews (1966) concludes that the 
central factor in the treatment of phobias is 
a therapist who establishes himself in a direct 
nurturant role and uses this relationship as 
leverage to encourage the patient to confront 
fear-arousing situations. Andrews maintains 
that the behavior therapists’ informal and 
casual discussions of their interchanges with 
their patients reveal a rather characteristic 
type of social relationship which plays an im- 
portant role in the modification of behavior. 

Meyer and Gelder (1963) express a point 
of view similar to that of Andrews. They, too, 
emphasize the importance of the relationship 
between the patient and the behavior thera- 
pist, and even suggest that the relationship 
is the single factor which seems most relevant 
to successful outcome of behavior therapy. 

The patient-therapist relationship and its 
effects on the outcome of behavior therapy 
is an important empirical issue which deserves 
direct and critical exploration. Beginnings 
have been made by Lang, Lazovik, and 
Reynolds (1965), Melamed and Lang (1967), 
and Paul (1965). Altogether, these studies 
not only disallow an unequivocal answer to 
the problem under consideration, but they 
impart further good reasons for conducting 
experiments expressly designed to shed light 
on the significance of the relationship in 
behavior therapy. In the study by Paul, for 


1 Requests for reprints should be sent to M. Mike 
Nawas, Department of Psychology, University of 
Missouri, Columbia, Missouri 65201. 


example, while the attention-placebo therapy 
group did not change as much as the de- 
sensitization group it did show greater im- 
provement than a no-treatment control group. 
Melamed and Lang (1967), comparing the 
performance of a therapist with that of an 
apparatus programmed for the reduction of 
fear, found the automated procedure about 
as effective as the live therapist. In their 
research, however, the human Æ was actively 
involved in some critical aspects of the auto- 
mated program; he conducted the first train- 
ing sessions in hierarchy building and the 
initial relaxation instruction sessions. 

The present study was specifically designed 
to assess the relationship factor, using con- 
trols employed in previous investigations as 
well as controls for several factors which have 
either been neglected or inadequately met in 
a number of previous experimental studies. 
More specifically, all instructions to all Ss 
were delivered by tape recordings; Ss in the 
various groups were matched on their degree 
of avoidance behavior to the phobic stimulus 
(live snake) in an actual behavioral situa- 
tion as well as on their subjective ratings 
of fear. Finally, in order to make possible 
more rigorous comparisons between groups, 
two modifications in the standard desensitiza- 
tion procedure were introduced. First, no de- 
tailed history was taken from each S, and 
second, a standard rather than an individu- 
alized hierarchy was used for all Ss. These 
modifications—history taking and type of 
hierarchy—have been shown to have no un- 
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toward effects on the efficacy of desensitization 
(Emery & Krumboltz, 1967). 


METHOD 
Measures 


Measures taken in this experiment were those used 
in the initial screening and selection of Ss, and those 
employed in testing the effects of experimental 
procedures. 

A Preliminary Screening Questionnaire (PSQ) was 
administered to all undergraduate students taking 
the introductory psychology course at the University 
of Missouri at Columbia. These students are required 
to participate in one or more studies, of their own 
choosing, to satisfy course requirements. The PSQ 
consisted of two main parts, one pertaining to fears 
of animals and the other to the individual’s ability 
to closely follow imaginary scenes suggested by 
another person. 

The S was asked to imagine himself in a room 15 
feet in length at the end of which was a caged ani- 
mal, and to indicate if he could (a) approach the 
animal, (b) touch it with a gloved hand, or (c) 
touch it with a bare hand. The S was asked to indi- 
cate whether he could perform the above steps for 
each of six animals: rat, nonpoisonous snake, spider, 
mouse, cockroach, and beetle. Because more Ss indi- 
cated that they were afraid of snakes than of any of 
the other six animals, the fear of nonpoisonous snakes 
was selected for study. The Ss who indicated that 
they would be able to touch the snake with a gloved 
or bare hand were not considered sufficiently fearful 
of snakes and were, therefore, excluded from partici- 
pation in the experiment. 

The PSQ also contained the “imagery test.” This 
was adapted from experimental work in hypnosis. 
The S was asked to imagine in detail the last meal 
he ate. Questions were then asked concerning (a) 
the clarity of the image, (b) whether or not it was 
in color, and (c) whether or not S was able to 
project himself into the scene which involved the 
meal in question. Those who did not answer all three 
questions in the affirmative were excluded from the 
study, 

The Behavioral Avoidance Test (BAT), similar to 
that used in previous desensitization studies with 
animal phobic Ss (Lang, Lazovik, & Reynolds, 1965), 
was used to measure the intensity of the individual’s 
avoidance response to the feared object. The test was 
conducted in a room 6 X 18 feet containing only a 
table next to the wall opposite the entrance. A white 
cage (24 X 18X 12 inches), which opened from the 
top, was placed on the table, clearly exposing the 
36-inch nonpoisonous Speckled-back King Snake 
through a plate-glass front. The test itself consisted 
of a check list of 20 separate items corresponding to 
20 discreet approach steps toward the snake, ranked 
in order of increasing aversiveness. The steps ranged 
from (1) standing outside the door of the testing 
Toom, to (20) picking up the snake and bringing it 
close to the chest. 
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Each S was escorted individually into an anteroom 
which opened into the testing room, was seated, and 
asked to listen to recorded instructions prepared in 
advance by E. The recording informed S of the 
snake in the next room and he was told that he was 
to attempt to perform, in order, each step in the 
avoidance check list. The S was alone in the room 
which housed the snake but was observed through 
a one-way mirror in an adjoining room. The Ss who 
were able to reach Step 14 on the check list (placing 
their hand into the cage) were excused from the 
study. A 

In addition to S’s behavioral response, a measure 
of subjective fear—Fear Thermometer (FT)—was 
also obtained. The FT has been rather extensively 
employed in previous studies, for example, Lazovik 
and Lang (1960), with inconclusive results. The 
measure consists of a 10-point scale ranging from 
“calm” to “terrified” on which S is asked to indicate 
the intensity of his fear at the point beyond which 
he could proceed no further in the BAT. 

The Fear Survey Schedule II (FSS-II), deVeloped 
by Geer (1965), was used for having Ss rate the 
degree of their fear for each of 50 objects or situa- 
tions. The total score was taken as S’s measure of 
generalized fear. A separate analysis was performed 
on Item 38 of the FSS-II, the snake item, to deter- 
mine whether experimental procedures specifically 
affected S’s response to that item. 


Subjects 


The Ss in the study were 40 female undergraduates, 
selected from the initial pool of approximately 1,200 
students who were administered the PSQ described 
earlier. Of these, 308 Ss indicated that they were 
fearful of snakes, Eleven males and 15 females were 
rejected for failure to pass the imagery test. An addi- 
tional 84 females and 19 males were rejected for 
failing to display sufficient avoidance behavior in the 
presence of the snake. Finally, one male and five 
females were excused from participation in the ex- 
periment because they had had or were then engaged 
in psychiatric treatment. Due to the relatively sm 
number of males remaining eligible it was decide 
that only female Ss would be used for the study: 
Each of the 130 female Ss who remained eligible 
was assigned to 1 of 13 groups. This study i 
concerned with data relevant to four of these groups: 


Procedure 


The 40 Ss were matched on the basis of is 
BAT and FT scores. Assignment of the matched 95 
to one of the groups was then done on a rani p 
basis. Identification of each of the four group 
follows: 

Live desensitization (LD). Each S in this grou? 
received one 30-minute taped training session 
relaxation followed by five 30-minute sessions H 
systematic desensitization, The desensitization sessio” 
were conducted by Æ using standard interpersom 
procedures as described by Wolpe (1958). 
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Taped desensitization (TD). Each S in this group 
received one taped 30-minute training session in 
relaxation followed by five 30-minute taped sessions 
of systematic desensitization. 

Pseudodesensitization (PD). The Ss in the PD 
group received one taped 30-minute training session 
in relaxation followed by five 30-minute sessions of 
pseudodesensitization. The imaginal stimuli used for 
this group were pleasant and unrelated to the phobic 
stimulus, The purpose of this group was to control 
for such factors as expectancy of improvement, 
suggestion, and attention. 

No treatment (NT). The Ss in this group received 
no treatment and were given only the pretreatment, 
posttreatment, and follow-up test batteries. 

The apparatus used during treatment sessions con- 
sisted of a tape recorder, two sets of headphones, 
and a black vinyl reclining chair. For all Ss the E 
was present in the room. For Ss in the LD condition 
the procedure was to escort S into the room, seat 
her facing Æ, and engage in conversation. For Ss in 
the TD condition Æ was seated out of sight of Ss 
and did not communicate with her during the entire 
session, Desensitization was administered by means 
of tape recordings and was received by S through a 
set of headphones. The headphones were set up to 
allow Æ to monitor S’s responses as both E and S 
listened to the recordings. 

Treatment sessions were conducted in a different 
room from the one in which testing was held. Each 
S who received relaxation training was seated in the 
lounge chair and told that everything would be 
explained to her by means of a tape recorder. The 
taped relaxation instructions were similar to those 
used by Paul (1965) and others. Desensitization ses- 
sions began with a 3-minute relaxation induction 
sequence followed by the desensitization itself. Each 
item in the hierarchy of 20 steps (of the BAT) was 
presented until S was able to entertain an image of 
the item for 15 seconds on two consecutive occa- 
sions without signaling anxiety. 

The posttreatment battery was administered to 


_ each S immediately upon her completion of the fifth 


treatment session, At that time measures of S's 
responses were again taken on the BAT, the FT, and 
the FSS-II, in that order. In the follow-up session, 
6 weeks after the completion of the final treatment 
session, the BAT, FT, and FSS-II were adminis- 
tered again. 


RESULTS 


The results are reported separately for each 
Measure used. Separate analyses were per- 
formed on the data of the BAT, the FT, the 
FSS-II, and the FSS-II snake item. Pretest 
data were analyzed on the basis of raw scores, 
whereas the posttest and follow-up data were 
analyzed in terms of change scores between 
Pretest and posttest, then between pretest and 
follow-up tests, respectively. 
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Pretest Posttest Follow-up 


Fic. 1. Mean pretest, posttest, and follow-up test 
scores on the BAT. 


The Ss were matched on their pretest BAT 
scores, so there were, of course, no significant 
differences between the groups and the pre- 
test analysis (F < 1). The analysis of vari- 
ance on posttest change scores, however, did 
reveal significant differences (F = 7.30, p< 
001). The overall strength of the statistical 
association (w°) was estimated at .35, indi- 
cating that the treatment procedures ac- 
counted for a sizable portion of the sys- 
tematic variation. Individual comparisons of 
posttest BAT scores reveal that both the LD 
and the TD groups showed significantly more 


TABLE 1 


Tuxry’s HSD Prerest-10-PostrEsT COMPARISONS 
ON THE BEHAVIORAL Avorance TEST 


Group NT PD TD LD 
NT = 1.5 N bog Sos 
PD = naa 3.6* 3.8* 
TD = ee. = of 

pani 
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TABLE 2 


Tuxery’s HSD PRETEST-TO-FOLLOW-Up Comparisons 
ON THE BEHAVIORAL AVOIDANCE TEST 
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TABLE 3 


Tuxey’s HSD PRETEST-TO-POSTTEST COMPARISO 
ON THE FSS-IT SNAKE Irem ' 


Group NT PD TD LD Group NT PD LD 
NT = — 4.7* 4,3* NT ae 1.8* 
PD = = 5.0* 4.6* PD Fee 1:5% 
TD = = = A LD = p — 

ee = $005 

+*+ p <.001 


approach behavior in the BAT posttest ses- 
sion than did the NT or the PD controls. An 
analysis of variance on the follow-up scores 
indicated that the group differences were main- 
tained over the 6-week follow-up period (F 
= 7.19, p < .001). The o° was .36, virtually 
the same as in the posttest analysis. 

Individual comparisons of group means 
show that the differences found on the post- 
test analysis were essentially maintained. 
Tukey’s HSD comparisons (Winer, 1962) of 
pretest to posttest and follow-up BAT scores 
are shown in Tables 1 and 2, respectively. 
The degree of avoidance of the snake in the 
pretest, posttest, and follow-up sessions is 
depicted in Figure 1. 

No significant differences between the 
groups existed on the FSS-II snake item prior 
to treatment (F < 1.33). However, there 
were differences in the amount of snake fear 
reduction reported by the groups following 


Pretest 


Posttest Follow-up 


Fic. 2. Mean pretest, posttest, and follow-up test 
Scores on the FSS-II snake item. 


of statistical association (w*) here was 
Both the LD and TD groups reported s 
cantly less fear of snakes than did eith 
NT control or the PD group. Again, th 
and PD controls were not significantly d 
ferent from one another on this measure, 

Essentially the same results were found 
the follow-up analysis (F = 4.23, p< 0 
The strength of statistical association (¢ 
was again .23. Tukey’s HSD compariso 
pretest to posttest and follow-up FSS 
snake-item scores are shown in Tables 3 
4, respectively. Figure 2 illustrates the mea 
of self-rated fear of snakes for the vari 
groups. 

Comparison of the groups on the FT 
well as on the overall FSS-II revealed no $i 
nificant difference before treatment, from p 
treatment to posttreatment, or from pre 
ment to follow-up. 


Discussion 

Concerning the fundamental question 
this study, that is, the possible effect of i 
relationship factor on outcome, virtually 
differences between the LD and TD 
were found on any of the measures used. 


TABLE 4 


Touxery’s HSD Pretest-r0-Fottow-Up ComPaRis®l 
ON THE FSS-IT SNAKE ITEM 


Group PD NT LD 
PD — at 1.4* 
NT = — 1.3* 
LD — — — 

*p <05. 
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2 points separated these groups on their pre- 
test to posttest change scores on the BAT. On 
the follow-up the difference was 4 points. 
Also, minute differences between these two 
groups were found in the other measures used 
in the study. These results support the work 
of Melamed and Lang (1967), in which they 
found no differences between automated de- 
sensitization and desensitization conducted in 
the standard fashion by a therapist. 

It might be argued that, even though the 
entire treatment for the TD group was con- 
ducted by means of tape recordings and Ss 
did not interact with Æ during the treatment 
procedures, some sort of relationship was es- 
tablished because Æ was present in the room. 
While it is true that the procedure was not 
fully automated, these criticisms seem to be 
begging the question somewhat. No attempt 
to establish a relationship between Æ and S$ 
in the TD group was made and, indeed, the 
impersonal nature of their interaction was 
more conducive to distancing than to relating. 
There was virtually no communication and Æ 
deliberately tried to avoid interpersonal in- 
volvement with Ss who received the semi- 
automated treatment. 

The proposition put forth by Andrews 
(1966) and others that the therapist must 
accept the patient, establish a nurturant role, 
and then use the relationship as leverage to 
make the patient confront his fears does not 
seem to be supported by the results of this 
study. 

This is not to say, of course, that the thera- 
peutic relationships have no effect. It has been 
demonstrated that clients can associate the 
therapist with the discriminative properties of 
the reinforcing agent; and the therapist can 
use these qualities to bring about changes in 
the client’s behavior. The point, however, is 
that the relationship factor, as demonstrated 
in this experiment—which is merely an ana- 
logue to the clinical helping relationship—is 
not an integral and necessary part of the de- 
sensitization process, and successes with the 
technique cannot be attributed solely or even 
primarily to this factor. 
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Two side observations are worth noting. 
First, to the extent that the control of model- 
ing, suggestion, attention, and E bias through 
the use of taped instructions was attained in 
this study, the results strengthen still further 
the credibility of previous outcome studies in 
desensitization. Second, despite the modifica- 
tions of the systematic desensitization pro- 
cedure introduced in this study, the treat- 
ment appears to have been at least as effec- 
tive as that obtained in previous experimental 
studies. It may be speculated that the record- 
ing of thorough case histories, the construc- 
tion of separate hierarchies, and conducting 
more than five sessions of desensitization are 
often unnecessary, at least insofar as the 
treatment of well-defined fears is concerned. 
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AND 


The present study investigated the effects of therapist-absent alternate sessions 
and a vicarious therapy pretraining experience on group psychotherapy out- 


come with outpatients. It also related 


patient self-exploration to outcome. The 


alternate sessions regimen and high levels of self-exploration both led to slightly 


better outcome. Vicarious therapy prei 


training was found to be highly facilita- 


tive, having its greatest effect on neurotic symptomatology. 


The use of alternate sessions in group psy- 
chotherapy has led to some controversy re- 
garding its efficacy. By alternate sessions is 
meant the use of group therapy sessions in 
which the therapist is absent, alternating with 
sessions in which both patients and therapist 
are present, Their use has been recommended 
by Wolf (1961), Wolf and Schwartz (1962), 
and Truax (1962a, 1962b). On the other 
hand, Slavson (1963) and Ginott* have 
raised objections to the use of alternate ses- 
sions. It is quite apparent that, if helpful, 
alternate sessions would constitute an eco- 
nomically desirable regimen. Previous research 
by Truax and Carkhuff (1965) on a sample 
of hospitalized patients indicated that the use 
of alternate sessions was not only nonbene- 
ficial, but led to significantly poorer outcome 
than regular group psychotherapy, supporting 
the position taken by Slavson and Ginott. 
However, since outpatients as a group are in 
closer contact with reality and more adequate 
in interpersonal relating, it seems plausible 
that alternate sessions might prove beneficial 
with neurotic outpatients even though they 
might be harmful with hospitalized patients. 

Vicarious therapy pretraining (VTP) has 
been proposed by Truax (1962a, 1962b) to 
provide standard cognitive and experiential 
structuring of “how to be a good patient” as 


1This research was supported by Rehabilitation 
Services Administration Grant RD-906 and Social 
and Rehabilitation Services Grant RT-13. 

? Requests for reprints should be sent to Charles 
B. Truax, College of Education, University of Flor- 
ida, Gainesville, Florida 32601. 

a H. G. Ginott, personal communication, March, 
1964. 


a means of quickly engaging the patient in the 
process of group therapy. VTP simply in- 
volves presenting to prospective patients a 
30-minute tape recording of excerpts of 
“good” patient in-therapy behavior. It pro- 
vides a vicarious experience of how clients 
often explore their problems and feelings, and 
how they prove helpful to one another prior 
to their introduction to group therapy. One 
study (Truax & Carkhuff, 1965) has indicated 
modest therapeutic benefit of VTP. 

Prior research in individual and group psy- 
chotherapy has also suggested that successful 
patients engage in a greater degree of self- 
exploration than less successful patients in 4 
wide variety of patient populations (Blau, 
1953; Braaten, 1958; Peres, 1947; Seeman, 
1949; Tomlinson & Hart, 1962; Truax, 1961, 
1962a, 1962b; Truax & Carkhuff, 1963; 
Wagstaff, Rice, & Butler, 1960). The present 
study attempted, also, to study the relation- 
ship between patient in-therapy self-explora- 
tion as it is related to outcome. 

The present study, involving a heterogene- 
ous sample of both therapists and patients, 
was aimed at evaluating the following hy- 
potheses: (a) patients receiving alternate ses- 
sions in addition to their regular sessions will 
show greater evidence of constructive person- 
ality change than patients receiving regulat 
sessions only, (b) patients receiving VIP prior 
to group therapy will show greater evidence 
of constructive personality change than pa- 
tients not receiving VTP, and (c) patients m 
groups that engage in relatively high Jevels 
of self-exploration will show greater evidence 
of constructive personality change than pê- 
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tients in groups that engage in relatively low 
levels of self-exploration. 


METHOD 


A total of eight psychotherapy groups, with an 
initial 10 patients in each group (N = 80), were seen 
by eight different therapists * for time-limited group 
psychotherapy consisting of 24 sessions on a twice 
weekly basis over a time span of approximately 3 
months. The 80 outpatients were assigned to groups 
from the available populations of the following co- 
operating institutions: Department of Psychiatry, 
University of Wisconsin; Counseling Center, Uni- 
versity of Massachusetts; Veterans Administration 
Clinic, Covington, Kentucky; and University of 
Kentucky. 

All group therapy sessions were completely tape 
recorded to allow for analysis of levels of patient 
self-exploration occurring in each group. Patients 
were given a battery of psychological tests pretherapy 
and postherapy which served as the basic measures 
of outcome. 

The eight groups formed a 2 X 2 factorial design, 
with half of the groups receiving VIP and the other 
half non-VTP, and half of the groups receiving 
alternate sessions in addition to regular sessions with 
the other half holding regular sessions only. For 
those patients receiving VTP, the 30-minute VTP 
tape recording was presented to the group prior to 
the first session. The alternate sessions for those 
groups holding alternate sessions began after the 
eighth, ninth, or tenth sessions, and continued alter- 
nating with the regular sessions until the 24 regular 
sessions were completed. 


Patient Population 


The patient population as a whole was a mildly 
disturbed, neurotic outpatient group. The patients 
ranged in age from 18 to 58, with a distribution 
skewed to the right. Six patients, based on their 
pretesting, might be labeled schizophrenics. 


Therapists 


The therapists were assigned to the eight groups 
within the samples from the participating institu- 
tions. In general, they were highly experienced in 
individual and group psychotherapy. They were 
quite heterogeneous in orientation, including two 
therapists who could be described as client-centered, 
four therapists who could be described as psycho- 
analytically oriented, and two who would describe 
themselves as eclectic in orientation.’ Of the thera- 


* Several groups involved co-therapists in addition 
to the therapist responsible for the group. 

5 The present research would not have been possible 
without the dedicated contribution of the therapists 
who gave so freely of their time, energy, and talent. 
Appreciation is very gratefully extended to the fol- 
lowing therapists and co-therapists: Arnold Marx, L. 
Stein, Donald Kiesler, Donald Price, Robert R. 
Carkhuff, Jerome Sczymanski, Walter A. Dickenson, 
Joseph Willett, Dean Allen, and Joe Havens. 
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pists and co-therapists, four were psychiatrists and 
seven were psychologists. 

The therapists were, in general, positively disposed 
toward the study. They were optimistic about the 
usefulness of VIP, and they were interested in the 
effects of alternate sessions. The latter constituted 
a minor administrative annoyance, but there was no 
evidence that therapists felt threatened by their 
groups’ meeting without them. 


Measurement of Patient Self-Exploration 


The Depth of Intrapersonal (Self) Exploration 
Scale (Truax, 1962c) was used as the basic measure 
of patient self-exploration. From the set of tape 
recordings of regular sessions of the eight groups, one 
3-minute sample was selected from the middle third 
of every other recorded therapy session (89 samples). 
These samples were obtained by running the tape 
recorder at rewind to the middle section of the 
tape and stopping. The 3-minute samples were then 
taken beginning with the first new utterance by a 
patient or therapist and continuing for 3 minutes, 
or until the conclusion of a statement to preserve 
intelligibility. These samples were rerecorded indi- 
vidually onto small single spools which were then 
randomly assigned code numbers and presented to 
the raters in boxes of 12 samples. 

Four undergraduate college students who were 
naive with respect to psychotherapy theory and 
practice were trained in the use of the rating scale. 
A criterion of a éorrelation of .50 for interjudge 
and rate-rerate reliability on training samples was 
used to screen the raters. The raters rated the coded 
samples in different orders of blocks of 12 samples. 
The mean ratings from the four judges per sample 
were used in the analyses. 


Measurement of Patient Personality Change 


Although 80 patients were given pretesting, a 
number of patients dropped out of group therapy 
before the tenth session, some refused to participate 
in testing posttherapy, and for a few the test admin- 
istrations were judged invalid and discarded without 
examination of the results because of patient mis- 
understanding or inability to perform on the measure, 
Thus, 62 patients were available for data analysis on 
some measures, while as few as 51 patients were 
available for analysis on the measures of Q sorts for 
self-concepts. 

The test battery in the present study adminis- 
tered pretherapy and posttherapy included the 
MMPI, with the 10 clinical scales, the Sum of Clini- 
cal scales, the Barron Ego Strength scale, the Truax 
Constructive Personality Change Index, the Welsh 
Internalization of Anxiety Ratio, Edwards Social 
Desirability, Sum of Validity Deviations, Welsh’s 
Anxiety Index; the Q sort for self- and ideal concept 
(Butler & Haigh, 1954) with the five specific mea- 
sures of self- and ideal concepts; and the Finney Palo 
Alto Group Therapy scale (Finney, 1954). 

The 80-item Q-sort deck was used for only two 
sorts (self and ideal) both pretherapy and post- 
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therapy. The five measures obtained were (a) change 
in the correlation between self and expert Q sorts 
from pretherapy to posttherapy, (b) change in the 
correlation between self and ideal Q sort, (c) change 
in the correlation between ideal and expert Q sort, 
(d) change in the self Q sort adjustment scores, and 
(e) change in ideal adjustment scores. The “expert” 
Q sort is a Q sort representing ideal adjustment 
made by a panel of 14 experts (Lewis, 1959). The 
“adjustment” Q sorts, both self and ideal, refer to 
a method of scoring the Q sort on the basis of an 
imposed true-false type of dichotomy. 

The Palo Alto Group Therapy scale consists of 
a number of true-false items referring to social inter- 
action behavior engaged in by the patient during 
group psychotherapy itself. It is filled out by the 
therapist. In the Forsythe and Fairweather (1961) 
study it was the only measure in a large battery 
that was predictive of follow-up adjustment post- 
therapy. 

For the most part the outcome battery consisted 
of self-report measures, however. Ideally, behavioral 
indexes of change might have been included, such 
as ratings by Os, changes in earnings, grades, etc.; 
but the heterogeneity of the patient population 
and geographic dispersion of the groups made this 
infeasible. 


RESULTS 


Change scores for each patient on each of 
the 23 measures of outcome were obtained 
by subtracting the score earned pretherapy 
from the score earned posttherapy. To guard 
against possible differences between groups in 
initial status, analysis of covariance was used 
to partial out the pretherapy level in its effect 
on measured change. 

The basic design of the present study might 
be described as a 2 X 2 factorial design with 
the additional factor of high versus low self- 
exploration within each cell. Perfect design 
balance was disturbed, however, by missing 
data due to invalid testing, refusal to take 
tests, and premature terminations. As such, 
cell frequencies were unequal requiring that 
the unweighted means method for computing 
the analyses of covariance be used. In addi- 
tion, since cell frequencies become more dis- 
Proportionate with finer subdivisions, and 
since presumably the greater the dispropor- 
tionality the more questionable the validity 
of the analysis, the major variables were ana- 
lyzed separately by one-way analyses of co- 
variance. None of the hypotheses in the study 
are concerned with interactions of the vari- 
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ables, thus lack of information regarding these 
is not a great loss. To test the main effects 
in each analysis, group or therapist variability 
within cells was used as the estimate of error, 
This constitutes a relatively conservative 
error estimate. Differences in outcome associ- 
ated with different therapists and therapy 


groups are assessed by testing between group | 


variability against S variability. 


Effects of Vicarious Therapy Pretraining 


The mean values of outcome measures for 


patients receiving VTP versus those not re- 
ceiving VTP (NVTP) is shown in Table 1, 


There is a striking tendency for VIP to 
show therapeutic advantage over NVTP. On 
the 23 measures of outcome, patients receiving 
VTP showed greater improvement on 21 and 


less improvement on only two (x? = 15.696, 


$ < .001). Also presented in Table 1 are the 
results of the analyses of covariance for the 
effects of VTP on patient outcome. Patients 
receiving VTP showed significantly greater 
improvement than patients not receiving VIP 
on the following MMPI Scales: D, Hy, Pl, 
and the Sum of Clinical Deviations. Thus, the 
data tend to provide relatively strong support 
for the initial hypothesis. It appears that the 


greatest influence of VTP was on neurotic 


symptomatology. 


Effects of Alternate Sessions 


Table 2 presents the mean values on the 
23 outcome measures for patients receiving 
alternate sessions in addition to regular 
sessions versus patients receiving only regu 
sessions. Patients receiving alternate sessions 
showed greater improvement on 17 measures 
and less improvement on six (x? = 2.630, ns); 
whereas, patients receiving only regular ses 
sions showed greater improvement on only 
six measures and less improvement on 17. 

The results of the analyses of covariance 
on the 23 measures also appear in Table 2. 
There it can be seen that the addition of 
alternate sessions to regular group therapy 
results in significantly greater outcome 00 the 
following MMPI measures: Hs, Hy, Sum of 
Clinical Deviations, and the Constructive 
Personality Change Index. 


Thus, the data suggest moderately positive 
consequences by the addition of alternate 
sessions to regular outpatient group meetings. 


Patient Degree of Self-Exploration 


The level of self-exploration was calculated 
{for each group, and then the groups were sub- 
‘divided into those showing highest versus 
those showing lowest levels of self-exploration. 
Ratings were made with raters using the Scale 
of Depth of Self-Exploration (Truax, 1962c). 

The mean outcome scores on the 23 mea- 
ures for patients in groups that engaged in 
ligh levels of self-exploration and patients 
jit groups engaging in low levels of self- 
loration are presented in Table 3. There 
tcan be seen that patients in groups engaging 
high levels of self-exploration showed 
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greater improvement on 15 measures and less 
improvement on eight (x? = 2.130, ns); 
whereas, for patients in groups engaging in 
low levels of self-exploration, greater improve- 
ment occurred on only eight measures and 
less improvement on 15. 

The results of the analyses of covariance 
for high and low levels of self-exploration are 
also presented in Table 3. Differences favoring 
better outcome with high rather than low 
levels of self-exploration occurred on the fol- 
lowing measures; Q sort for self-adjustment, 
Q sort congruence between self- and ideal 
concepts, and the Constructive Personality 
Change Index from the MMPI. These findings 
give modest support to the hypothesis sug- 
gesting greater improvement for high self- 
exploration groups. 


TABLE 1 


MEAN VALUES AND ANALYsIS OF COVARIANCE F Ratios: EFFECTS or VICARIOUS 
THERAPY PRETRAINING ON OUTCOME 


Mean values Analysis of covariance F ratios 
Measures of outcome 
VTP NVTP VTP-NVTP Therapists/Cells 
Self, adjustment 9.950 4.065 32,745" 
If-expert r 335 138 1.187 34.2560" 
Ideal, adjustment 952 — 871 53,533*** 
lf-ideal 7 .354 177 27.884*** 
Weal-expert 7 ‘001 003 45.896» 
b Alto Group Therapy scale .490 179 4.019 1.475 
Sum of Clinical Deviations 23.792 —.750 8.034* 
Constructive Personality Change Index| 14.917 4.125 
[äs 1.125 —1.188 2.033 
4,000 —.125 5.758* 
3.375 —1,188 19.631** 
2.042 —.906 2.958 
.792 1.406 1,227 
2.792 469 
4.875 313 5.554* 
6.083 1.250 1.779 
a 1.583 781 4.191%** 
H 5.500 2.531 
atton’s Ego Strength 3.667 1.500 
wards’ Social Desirability 3.750 2.500 1.124 
of Validity Scales’ Deviations 3.875 —.938 2,010 1.072 
flsh’s Anxiety Index 7.125 1,906 2,944 
Ish’s Internalization Ratio 025 016 1.394 


| 2 <.05, one-tailed test. 
| 142 <.05, two-tailed test. 
> <.01, two-tailed test. 
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Discussion 


In general, the results of the present study 
lend support to the hypotheses stated at the 
outset. As such, the study was successful in 
serving to establish the effectiveness of the 
variables investigated in producing con- 
structive personality and behavioral change 
through the treatment known as group 
psychotherapy. 

The effect of inserting alternate sessions 
with the therapist absent between regular 
group therapy sessions with the therapist 
present would appear to be slightly facilitative 
from the present results. They generally sup- 
port the recommendations of Wolf (1961) and 
Wolf and Schwartz (1962), and refute the sug- 
gestions of Slavson (1963) and Ginott (see 
Footnote 3) who have objected to the use 


TABLE 2 


MEAN VALUES AND ANALYSIS OF COVARIANCE F Ratios: EFFECTS OF 
ALTERNATE SESSIONS ON OUTCOME 
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of alternate sessions. This finding is of par. 
ticular importance as the shortage of trained 
therapists increases. It is an economical pro. 
cedure since it extends the effectiveness oj 
scarce therapists. It should be cautioned 
however, that alternate sessions may not be 
effective with other types of patient popula: 
tions. Indeed, it has been found by ttl 
authors (Truax & Wargo, 1968a, 1968) 
that the use of alternate sessions with hos 
pitalized mental patient groups and with juve} 
nile delinquent groups led to considerably les} 
improvement than did the use of regulay 


be more socially responsible in order to dent 
fit from alternate sessions; indeed, in or 
to be not harmed or retarded by altern 
sessions. 


Mean values Analysis of covariance F ratios 
Measures of outcome its) 
sts 
Alternate Regular ini i, 
Q sort 
Self, adjustment 8.000 4,048 32,745" 
Self-expert r .260 151 34,2560" 
Tdeal, adjustment —.226 0.000 37.848" 
Self-ideal r 1324 “125 27.888" 
Tdeal-expert 7 ‘025 — 032 40,761" 
Palo Alto Group Therapy Scale (254 421 1.475 
MMPI 
Sum of Clinical Deviations 15.290 2.920 11.517** 
Constructive Personality Change Index 12.419 4,200 21.304** 
Hs 1,032 —1,720 7.049* 
D 1,903 1,320 
Hy 2.194 —1,000 23.572** 
Pd 387 320 
My 452 2,000 
Pa 1.645 1.240 1.244 
Pt 3.290 1,000 2.923 
Se 4,290 2.120 2.994 
Ma 323 2.120 
St 4.484 2.960 
Barron’s Ego Strength 3.323 1.320 4.210 
Edwards’ Social Desirability 2.903 3.200 
Sum of Validity Scales’ Deviations — 161 2.720 1.712 
Welsh’s Anxiety Index 3.677 4.720 
Welsh’s Internalization Ratio 043 —.008 


* p <.05, one-tailed test. 
** b <,05, two-tailed test. 
*+ p <01, two-tailed test. 
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The effects of vicarious therapy pretraining 
were quite positive. This supports the hy- 

thesis based on earlier suggestions by Truax 
(1962a, 1962b) that this form of systematic 
preparation for psychotherapy might well be 
beneficial to outcome. Again, however, this 
finding should not be generalized to other pa- 
tient populations since this same hypothesis 
js clearly not supported by the findings in 
the previously mentioned study of institu- 
tionalized juvenile delinquents. It does appear, 


_ however, that the structuring of patient roles 


through the use of tape-recorded modeling is 
facilitative with psychoneurotic outpatients. 
This procedure is of therapeutic value when 
incorporated into the treatment of neurotic 
outpatient groups. 
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As for the relationship between level of 
self-exploration and psychotherapeutic out- 
come, the results of the present study pro- 
vide modest support for the hypothesis that 
high levels of self-exploration will be related 
to high degrees of outcome. It is of particular 
interest to find that it holds for group ther- 
apy, since level of self-exploration in group 
therapy is determined by the overall level of 
the group rather than by the levels of self- 
exploration demonstrated by individual mem- 
bers of the group. It is most likely that there 
is considerable within-group heterogeneity of 
self-exploration in most groups. As such, 
measurement of the variable is gross. At the 
same time, it is likely that some benefit 
accrues to the patient who sits quietly but 


TABLE 3 


MEAN VALUES AND ANALYSIS OF Covarrtance F Ratios: EFFECTS OF DEPTH OF 
JINTRAPERSONAL EXPLORATION ON OUTCOME 


Mean values Analysis of covariance F ratios 
Measures of outcome i 
High DX LowDX | High-Low DX Ha 
paire Ba iinvnsrdh ess 

Q sort 

Self, adjustment 9.233 2.286 4,553* 19.506"** 

Self-expert r .293 104 3.510 21,961" 

Ideal, adjustment —.290 095 38,288" 

Self-ideal r 330 116 6.336" 16,556"** 

Tdeal-expert r —.024 041 2.255 30.675*** 
Palo Alto Group Therapy Scale .283 .386 1.485 EWEN an 
MMPI 

Sum of Clinical Deviations 12.533 6.577 1.207 

Constructive Personality Change Index 15.833 577 4.950* 

Hs —.267 =S 1.188 

D 1.967 1.269 

Hy 1.133 346 1,432 

Pa .233 500 

af 633 1.731 3.377 

Pa 2.067 769 

Pt 3.333 1.038 1,017 

Se 4.367 2.115 1.049 

Ma 167 2.231 1.677 2.577** 

Si 5.833 1462 3.570 

Barron’s Ego Strength 3.600 1.077 j 

Edwards’ Social Desirability 4.533 1.308 2.729 \ 

Sum of Validity Scales’ Deviations 833 1.462 1.324 

Welsh’s Anxiety Index 5.500 2.577 

Welsh’s Internalization Ratio 050 —.015 2.451 


Note-—DX = depth of intrapersonal exploration. 
wn? S05, one-tailed test. 
ary 2 5-05, two-tailed test. 
b <.01, two-tailed test. 
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is able to vicariously benefit from the explora- 
tions of fellow patients within the group. In 
this sense, then, the level of exploration mani- 
fested by the group as a whole may somewhat 
accurately reflect the benefit gained by each 
member individually. It may be that levels 
of overt exploration on the part of one person 
leads to greater exploration on the part of the 
second, even though the second never voices 
his explorations publicly. Self-exploration, as 
the name suggests, may be a kind of working 
through of one’s problems and it is in this 
manner that it may lead to constructive 
personality change. 

A very interesting, but unpredicted finding 
has to do with the self-concept measures. 
Tables 1, 2, and 3 show that the therapists- 
within-cells terms are significant for all five 
Q-sort measures. (Actually, this term might 
better be called “therapy-groups-within-cells” 
for purposes of interpretation, rather than 
“therapists-within-cells,” which is a handier 
designation.) It appears that there were dif- 
ferences from group to group with regard to 
the amount of improvement made in self- 
concept which is not attributable to the vari- 
ables included in the study. This very specific 
effect, apparently independent of other per- 
sonality and behavioral changes, obviously 
resulted from something occurring in the 
group process. What this might be is not 
clear. One might suggest any number of 
hypotheses, however. A likely explanation has 
to do with differences between therapists in 
their reinforcement of self-references. That is, 
therapists could conceivably condition posi- 
tive changes in self-concepts through differ- 
ential reinforcement of self-reference state- 
ments. This is likely, since self-reference 
statements are one indication of degree of 
self-exploration. This explanation receives 
some support from the results, presented in 
Table 3, which show that of the five Q-sort 
measures used, the three which are measures 
of self-concept against a standard all changed 
more in high-exploration groups, and two of 
these were significant. Thus, it would appear 
that change in self-concept is at least par- 
tially related to level of self-exploration in the 
therapy group. 
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The present study provides moderate sup- 
port for the use of two procedures which 
differ from standard group therapy practice, 
Further, it suggests some relationship be. 
tween self-exploration and outcome in group 
psychotherapy. 
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MEASUREMENT OF SOCIAL-EVALUATIVE ANXIETY* 


DAVID WATSON ? 
University of Hawaii 


The specification of a construct of social anxiety, the subsequent development 
of two scales, and validational studies are reported. The two scales are the 
Fear of Negative Evaluation scale (FNE) and the Social Avoidance and Dis- 
tress scale (SAD). The scales had very high indexes of homogeneity and pos- 
sessed sufficient reliability. Three experiments and other correlational data are 
presented. People high in SAD tended to avoid social interactions, preferred to 
work alone, reported that they talked less, were more worried and less con- 
fident about social relationships, but were more likely to appear for appoint- 
ments. Those high in FNE tended to become nervous in evaluative situations, 
and worked hard either to avoid disapproval or gain approval. Certain con- 
vergent and discriminant relationships had been part of the construct of social 
anxiety, and the correlational data supported these differentiations. 


Several lines of research and theory point 
to the potential heuristic value of the devel- 
opment of a scale or scales to measure anxiety 
experienced in social situations, Those studies 
on the effects of social censure on the per- 
formance of psychotics (Rodnick & Garmezy, 
1957), on the effect of disturbed family 
and social relationships in psychopathology 
(Gerard & Siegel, 1950), and on the cor- 
relation between social incompetence and 
hospitalization (Zigler & Phillips, 1962) have 
all suggested that distressful social relation- 
ships may be one powerful determinant of 
psychopathology. A second line of research 
has followed the idea that some individuals 
are more anxious than others in social situa- 
tions, and might therefore be more persuasible 
(Sears, 1967), more concerned with others’ 
evaluations of themselves (Diggory, 1966), or 
simply be upset at having to interact with 
other people (Byrne, McDonald, & Mikawa, 
1963). A third line of evidence which points 
to the need for social anxiety scales is that 
which suggests that anxiety scales which con- 
fine their questions to specific situations or 
conditions, (Endler & Hunt, 1966) have 
greater predictive validity for those specific 
situations than those scales which sparsely 


1 This research was conducted while the first author 
was at the University of Toronto and was supported 
by Grant No. 67 from the Ontario Mental Health 
Foundation. 

2 Requests for reprints should be sent to David 
Watson, Psychology Department, University of 
Hawaii, Honolulu, Hawaii 96822. 
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sample diverse situations. There do ¢ 
sets of items which have been non 
measuring social anxiety (Dixon, del 
& Sandler, 1957; Sears, 1967) but 
either are too few or have no under! 
struct, nor have they been control 
sponse style problems such as acqui 
social desirability, and they have nev 
validated. 
Social-evaluative anxiety was init 
fined as the experience of distress, | 
fort, fear, anxiety, etc., in social 
as the deliberate avoidance of so 
tions; and finally as a fear of recei 
tive evaluations from others. The f 
aspects were combined to composé 
Avoidance and Distress (SAD) scale, 
last factor was employed to compose 
of Negative Evaluation (FNE) scale, 


Specification of the Constructs 


The two scales were constructed at t 
time. Three general goals were 
suppress response style errors, to fi 
homogeneity, and to foster discri 
convergent relationships with certa 
scales. These goals can be appro 
paying careful attention to the naturi 
constructs from which the items follo 

Anxiety often is inferred from Vé 
subjective distress, the execution of av 
responses, impaired performance, OF 
physiological signs. Almost all ans! 
are a haphazard mixture of items 4 
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report on these four aspects of his behavior 
or experience. It was decided in the construc- 
tion of the present scales to exclude items 
asking about physiological signs or impaired 
performance. This would make clear what be- 
havior was used as a sign for membership 
in the class “anxious”—subjective distress 
and avoidance—and what behavior was con- 
sidered a correlate of class membership—im- 
paired performance and physiological signs. 

The advantage of this specification is that one 
. may later more easily distinguish between 
construct and predictive validity. 

A second, rarely followed, requisite for 
careful delineation of a construct is the neces- 
sity for adequate definition of the range of 
the trait, particularly its opposite instance 
(Jackson, 1966a). To control for acquiescence 
response set in a true-false format, as was 
done here, one must have approximately half 
of the items worded so that answering “false” 
indicates presence of the trait, which requires 
an adequate definition of the opposite in- 
stance. For example, the opposite instance of 
| social avoidance is not necessarily social 
approach, that is, affiliation, but simply a 
lack of avoidance. If the opposite instance of 
avoidance were allowed to be affiliation it 
would have obscured any possible discrimi- 
nant relationship between the SAD scale and 
measures of affiliation. Particular attention 
was paid to wording items so that the op- 
posite instance of a trait simply indicated 
absence of that trait, not the presence of some 
other trait. 

Fear of negative evaluation was defined as 
apprehension about others’ evaluations, dis- 
tress over their negative evaluations, avoid- 
ance of evaluative situations, and the ex- 
pectation that others would evaluate oneself 
negatively. The presence of high FNE does 
not necessarily imply that an individual evalu- 
ates himself negatively, or that he is con- 
cerned about revealing his inferiority (Dixon 
et al, 1957). Fear of loss of social approval 
Would be identical to FNE, but the opposite 
instance of FNE is simply lack of anxiety 
about others’ evaluations, not inevitably a 
desire for positive evaluation. High ENE dif- 
fers from test anxiety in that it is not specific 
to testing conditions, but may operate in any 
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social, evaluative situation, such as being on 
a date, talking to one’s superiors, or being 
interviewed for a job. 

The SAD scale was divided into two sub- 
scales, social avoidance and social distress. 
The purpose was to create a general scale, so 
the respondent was not asked why he experi- 
enced distress or avoided social encounters. 
Social avoidance was defined as avoiding 
being with, talking to, or escaping from others 
for any reason. Both actual avoidance and the 
desire for avoidance were included. The op- 
posite instance was simple lack of an avoid- 
ance motive, not desire to affiliate. Social dis- 
tress was defined as the reported experience 
of a negative emotion, such as being upset, 
distressed, tense, or anxious, in social inter- 
actions, or the reported lack of negative emo- 
tion, such as being relaxed, calm, at ease, or 
comfortable. The opposite instance of distress 
was lack of unhappiness, not the presence of 
some positive emotion. 

The item selection procedure was explicitly 
chosen to eliminate as much social desirability 
variance as possible, for questions about anx- 
iety in social situations would tend to elicit 
great concern with issues of social desirabil- 
ity, and the additional variance would not 
have been helpful. In two other instances 
discriminant relationships were explicitly en- 
couraged. Questions about S’s reaction to 
testing situations were excluded, as there are 
already several test-anxiety scales in existence, 
and because the social anxiety scales could 
be more explicitly oriented to social situa- 
tions. Finally, the authors attempted to foster 
a discriminant relationship between the SAD 
and ENE scales themselves, in the hope that 
separating them would increase their heuristic 


value. 
CONSTRUCTION OF THE SCALES 


Following the stringent criteria for evalua- 
tion of items suggested by Jackson (1966a), 
145 items were selected by rational analysis 
from a much larger pool.? These were then 
subjected to an empirical test, and the final 
58 items which survived as the two scales 


8The authors wish to thank Barney Gilmore, 
Stewart Page, and Sybil Paterson, who either con- 
tributed suggestions for items or aided in their 
evaluation. 
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are presented in Tables 1 and 2. The scoring 
key is given after each item. The SAD scale 
is evenly divided between true and false 
items; the FNE scale has 17 true and 13 
false items, 

The 145 items initially selected were ad- 
ministered to 297 undergraduates at the Uni- 


TABLE 1 
SOCIAL AvomaANcE AND Distress (SAD) SCALE 


1. I feel relaxed even in unfamiliar social situ- 
ations. (F) 

2. I try to avoid situations which force me to be very 
sociable. (T) 

3. It is easy for me to relax when I am with 
strangers. (F) 

4, I have no particular desire to avoid people. (F) 

5. I often find social occasions upsetting. (T) 

6. I usually feel calm and comfortable at social 
occasions. (F) 

7. I am usually at ease when talking to someone of the 
opposite sex. (F) 

8. I try to avoid talking to people unless I know them 
well. (T) 

9. If the chance comes to meet new people, I often 
take it. (F) 

10, I often feel nervous or tense in casual get-togethers 
in which both sexes are present. (T) 

11. I am usually nervous with people unless I know 
them well. (T) 

12. I usually feel relaxed when I am with a group of 
people, (F) 

13, I often want to get away from people. (T) 

14. I usually feel uncomfortable when I am in a group 
of people I don’t know. (T) 

15. I usually feel relaxed when I meet someone for the 
first time. (F) 

16, Being introduced to people makes me tense and 
nervous. (T) 

17. Even though a room is full of strangers, I may enter 
it anyway. (F) 

18. I would avoid walking up and joining a large group 
of people. (T) 

19, When my superiors want to talk with me, I talk 
willingly. (T) 

20. I often feel on edge when I am with a group of 
people. (T) 

21. I tend to withdraw from people. (T) 

22. I don’t mind talking to people at parties or social 
gatherings. (F) 

23. I am seldom at ease in a large group of people. (T) 

24. I often think up excuses in eden ts pad anit 
engagements. (T) 

25. I sometimes take the responsibility for introducing 
people to each other. (F) 

26. I try to avoid formal social occasions. (T) 

27. I usually go to whatever social engagements I 
have. (F) 

28. I find it easy to relax with other people. (F) 
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TABLE 2 
Fear or NEGATIVE Evatuation (FNE) 


. I am usually worried about what kind of imp! 


. If I know someone is judging me, it has little el 


. I worry very little about what others may 


. Sometimes I think I am too concer! 


. I am often indifferent to the opinions 


. Lam usually confident that others will 


I rarely worry about seeming foolish to others, (f) 
I worry about what people will think of me even | 
when I know it doesn’t make any difference, m | 


. I become tense and jittery if I know someone is 


sizing me up. (T) 


. Tam unconcerned even if I know people are forming 


an unfavorable impression of me. 


(F) 


. I feel very upset when I commit some social 


0) 


error. 


. The opinions that important people have of me 


cause melittleconcern. (F) 


. Tam often afraid that I may look ridiculous or make 


a fool of myself. (T) 


. I react very little when other people disapprove of 


me. 


(F) 


. I am frequently afraid of other people noticing 


my shortcomings. (T) 


|. The disapproval of others would have little effect 


(F) 


on me. 


. If someone is evaluating me I tend to expect the 


worst. (T) 


. I rarely worry about what kind of impression I am 


making on someone. (F) 


. Tam afraid that others will not approve of me. (1) 
. Tam afraid that people will find fault with me. (I) 


Other people’s opinions of me do not bother 
me. (F) 


. I am not necessarily upset if I do not please some: 


one. 


(F) 


. When I am talking to someone, I worry about whit 


they may be thinking about me. (T) 


. I feel that you can’t help making social errors some 


times, so why worry about it. (F) 


ression 
I make. 


(T) 


. I worry a lot about what my superiors think of 


me. 


a fect 


on me. 


(F) 


. I worry that others will think I am not worth: 


while. (T) think 


of me. 


F) 
2 ned with what 
other people think of me. (T) 

I often worry that I will say or 

things. (T) 


do the wrong 


others bav? 


of me. (F) 


have a favor 


able impression of me. (F) 

I often worry that people who are impor! 
won’t think very much of me. (T) 

I brood about the opinions my friends have 
me: 4 (1) being 
I become tense and jittery if I know I am 
judged by my superiors. (T) ae 


tant tom 


about 
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versity of Toronto. The Crowne-Marlowe 
(1964) Social Desirability scale and the first 
10 items of Jackson’s (1966b) Infrequency 
scale were also administered. The latter was 
used to control for pseudorandom responding, 
and Ss who answered any of these items in- 
correctly were dropped from all analyses. 
Ninety-two Ss were dropped for this reason 
or because they did not answer all items. 
The Crowne-Marlowe scale provided an 
initial empirical criterion against which to 


evaluate the SAD and FNE items. Jackson’s 


(1966a, 1967) item selection procedure was 
employed, to minimize covariation with social 
desirability as a response style. For every 
item for 205 Ss a computer calculated the 
biserial correlation of the item with its own 
scale and the biserial correlation with all other 
scales in the sample. Items endorsed by less 
than 5% of the sample were dropped. 
Jackson’s Differential Reliability Index 
(DRI) was calculated for each remaining 
item. In the first instance, the DRI was calcu- 
lated for the item’s own scale and the Crowne- 
Marlowe, giving an estimate of the amount 
of correlation between an item and its own 
scale with social desirability variance re- 
moved. The first criterion for selecting items 
was that this DRI be as high as possible and 
in all cases above .50. It would not have been 
possible to apply further criteria had not the 
original rational analysis provided a large 
number of items which passed this first hurdle. 
i The second criterion was that the probabil- 
ity of endorsement of each item must have 
been above 10% and as close to 507% as 
Possible. Third, a discriminant relationship 
between the SAD and FNE was encouraged 
by using a DRI in which item-own scale 
correlation was estimated with variance at- 
tributable to the other scale removed, select- 
ing those items with minimal common vari- 
ance. Fourth, to control for acquiescence, 
items were selected by judging the adequacy 
of those representing the opposite instance of 
the trait, Finally, items were selected ac- 
cording to content dissimilarity and the repre- 
Sentativeness of the situations about which the 
items inquired. 


451 


Characteristics of the Scales 


One of the major goals was to foster scale 
homogeneity, This can be expressed as the 
mean biserial correlation of each item with its 
own scale, The mean biserial correlation of 
the selected FNE items, corrected for pres- 
ence of item in the total score, was .72; 
and of the selected SAD items, it was .77 
(N = 205, p<.01). The product-moment 
correlation of the two subscales of the SAD, 
avoidance and distress, was .75. A second 
index of homogeneity is the KR-20 reliability 
statistic, The KR-20s of both the selected 
FNE and SAD items were .94. The KR-20 
of the Crowne-Marlowe was .79. In a second 
sample of 154 Ss the KR-20 of the FNE was 
96 and that of the SAD was again .94. The 
two scales are very homogeneous. 

A second goal was to minimize the relation- 
ship of the scales to social desirability. The 
product-moment correlations of both the FNE 
and SAD with the Crowne-Marlowe scale 
were —.25 (N = 205, p< 01). These cor- 
relations might have been less had the 
Crowne-Marlowe itself been more homogene- 
ous. In any case, the relationship with social 
desirability has been minimized. Another goal 
had been to foster a discriminant relationship 
between FNE and SAD. In the first sample, 
the product-moment correlation between the 
two was .51 (N = 205, $ < .01). A later sam- 
ple of 42, with which a Spearman correlation 
was used, showed a correlation of 32 (N= 
42, p< .05). The attempt to foster a dis- 
criminant relationship between SAD and 
FNE did not eliminate all the common 
variance, probably because some people score 
highly on SAD just because they are fearful in 
social-evaluative situations. However, the com- 
mon variance has been minimized. 

Neither of the scales was normally dis- 
tributed. The distribution of the FNE was 
nearly rectangular. The mean was 15.47, the 
standard deviation was 8.62, and the median 
was 16, This rectangularity indicated that 
the entire range of the scale was well used. 
The distribution of the SAD was skewed. 
Although the modal score was zero, the mean 
was 9.11, the median was 7, and the standard 
deviation was 8.01. Separate analysis of the 
social avoidance and social distress subscales 
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indicated that the distribution of social avoid- 
ance was most skewed. High levels of SAD 
were not as common as high levels of FNE. 
High levels of SAD may be more pathological. 
Variables determining extreme social with- 
drawal or distress, which might be termed 
schizoid, are probably not normally distribu- 
ted within the general population, which 
would explain the skew in the SAD data. This 
lack of normality in the SAD may make 
easier the task of early identification of those 
who later show schizoid reactions. 

There were differences between the sexes in 
scores on the two scales. The mean scores on 
the SAD were: males, (WV = 60) 11.20; fe- 
males (V= 145) 8.24. This difference is 
significant (t= 2.64, p< .01). Males re- 
ported more social avoidance and distress 
than females. The mean scores on the FNE 
were: males, 13.97; females, 16.10. This dif- 
ference approached significance (N the same 
as before; t = 1.76, p < .10). In this case the 
direction was reversed, women reporting more 
fear of negative evaluation than men. 

Data for test-retest reliability were gath- 
ered on a sample of 154 Ss in the summer 
school of the University of Toronto. The 
SAD and FNE scales were administered 
alone, during class time, and 1 month passed 
between administrations. The product-mo- 
ment, test-retest correlation of the FNE was 
.78, and that of the SAD was .68. A second 
sample of 29 gave the figures .94 for FNE 
and .79 for SAD, but the larger size of the 
first sample makes it a better estimate. These 
figures indicated sufficient reliability. 


EXPERIMENTAL STUDIES ‘ 


The SAD and FNE scales were adminis- 
tered to 358 students in the summer school at 
the University of Toronto. From this pool Ss 
were drawn for three experiments, whose pur- 
pose was to validate the two scales. 


SAD and the Prospect of Working 
Alone or Together 


Persons who are high on SAD would be 
expected to be uncomfortable in social situa- 


*The authors wish to thank Betty Jean Findlay, 
Donna Hutchinson, and Mary Ann Linseman for 
serving as Es. The authors appreciated the useful 
advice of John Arrowood on the design of one 
of the studies. 
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tions, and to prefer to be alone, Even the 
prospect of having to be in a future social 
interaction might make those high on SAD 
anxious. In this study Ss were told that they 
would later be required to participate in g 
group discussion or write an essay alone, and 
the effect of this expectation on their per. 
formance, anxiety, and other attitudes was 
observed. 

From the 82 Ss scoring zero or one on the 
SAD, 46 were randomly selected as the low 
anxious (LA) group, and from the 85 scoring 
12 or above, 52 were selected as the high 
anxious (HA) group. Within these two groups 
Ss were randomly assigned either to an Essay 
or Group Discussion condition. The Ss par- 
ticipated in the study in heterogeneous groups 
varying in size from 6 to 15. All Ss had the) 
same female Æ. The Ss were told that they | 
would actually be in two experiments, one to 
occur immediately and the second afterward. 
In the Essay condition, Ss were informed that 
in the second experiment they would be placed 
alone in a cubicle where they would write an 
essay on an interesting, controversial topic. In 
the Group Discussion condition, Ss were told 
that they would be placed in a small group it 
which they would be expected to participate 
actively in a discussion of an interesting, cont | 
troversial topic. Envelopes labeled “Essay 
Writing” or “Group Discussion” which oš 
tensibly contained material for the later work 
were provided, The Ss then participated it 
what was for them the first of two experiments) 
in which they had administered to them the) 


out a questionnaire. After this Æ said i 
the second experiment could not take plate 
at that time, and elicited an indication of how) 
interested Ss might be in returning for W 
second study. That concluded the experimen 
All of the significant results came from 
questionnaire, as the simple expectation p) 
later social interaction apparently did 1 
arouse the degree of anxiety necessary E 
interfere with performance at the Digit Sym 
bol task. 

The Ss indicated how interested the 
in returning at a later date by checking 
point scale labeled “Not at all” at onè 


y vet 
Beh 
end) 
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writing condition produced more concern than 
the Group Discussion condition (F = 7.39, 


TABLE 3 
QUESTIONNAIRE DATA FROM EXPERIMENT I 
Condition 
Item Group 
Essay | Discus- 
sion 
Degree of interest in returning 
HA 
M 2.67 | 2.39 
N 24 28 
LA 
M 3.19 | 3.44 
N 21 25 
Number of Ss choosing each condition 
HA* 27 24 
LA 5 41 
Degree of concern 
HA 2.33 | 1.89 
LA 1.28 | 1.16 
Degree of confidence 
HA 3.20.| 2.64 
LA 3.28 | 3.68 


a One S did not respond. 


and “Very much” at the other. Analysis of 
variance for unequal NM was performed on 
these and all other data. Table 3 presents the 
data. The difference between the HA and 
LA groups (F = 9.49, df = 1/94, p < 01), 
indicated that as expected the HA Ss were 
less interested in returning, The overall dif- 
ference between the Essay and Group Dis- 
cussion conditions was also significant, but 
more interesting was the interaction (F = 
45.73, df = 1/94, p < .001), which indicated 
that while HA Ss were more interested in 
returning if they were to be in the Essay con- 
dition, the LA Ss were more interested if they 
were to be in the Group Discussion. The Ss 
were also asked to indicate their choice of 
being in the Essay or Group Discussion con- 
ditions when they returned. The data, also in 
Table 3, were analyzed by chi-square CaS 
19.11, df= 1, p < .001). The LA Ss showed 
a much greater preference for the Group Dis- 
cussion condition. These two analyses pointed 
up much greater interest in þeing sociable 
among LA Ss. 

The Ss were asked how worried or con- 
cerned they were about the second experi- 
ment, and they responded on a 5-point scale 
as above. The HA Ss were more worried (F 
= 26.38, df = 1/94, p < 001). The Essay 


df = 1/94, p < .01), perhaps because an es- 
say is tangible and more easily evaluated 
while one’s participation in a group discus- 
sion is not. If this is so, then it would explain 
the interaction effect (F = 4.25, df = 1/94, 
p <.05), in the direction opposite that ex- 
pected, as the HA Ss were most concerned in 
the Essay condition, The HA $s in that con- 
dition may have been worried about some 
possible later evaluation of their essay. 

Another question asked Ss to predict how 
uneasy or nervous they expected to feel in the 
second experiment, and they responded on a 
5-point scale, There was a difference between 
the anxiety groups (F = 26.77, df = 1/94, p 
< .001), but no other effects. The means 
were: HA, 3.02 and LA, 1.80. In another 
question using a 5-point scale, LA Ss reported 
that they expected to be calmer than HA Ss 
did in the coming experiment (F = 14.46, df 
= 1/94, p< 001). The means were: HA, 
2.98 and LA, 3.80. The HA Ss expected to be 
more nervous, less calm, in the second experi- 
ment. A final question asked about the degree 
of apprehension felt about participating in the 
coming study. A 5-point scale was labeled 
“apprehensive” and “confident” at its ex- 
tremes. The means are in Table 3. The LA Ss 
reported more confidence (F = 6.70, df= 
1/94, p < .05), and there was an interaction 
(F = 4.92, dj = 1/94, p < .05) which showed 
the expected pattern: HA Ss were more con- 
fident or less apprehensive if they were to be 
in the Essay condition, while LA Ss were more 
confident if assigned to the Group Discussion. 

To sum up, people who scored highly on 
the SAD scale were less likely to be in- 
terested in returning, and more likely to 
choose to be alone, both indexes of social 
avoidance. The HA Ss were more worried and 
uneasy as well as less calm and confident—all 
indexes of distress—and these discomforts 
were magnified in the together condition. Two 
other studies provided modest, additional vali- 
dation for the two scales. 


SAD and Being Alone or Together 


From the same distribution of SAD scores, 
97 HA Ss, with scores of 11 and above, and 
58 LA Ss, with scores below three, all of whom 
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performed at the same tasks, were randomly 
assigned to work either alone, together in a 
cubicle with another person but without talk- 
ing, or together and talking to the other per- 
son in the cubicle. The tasks were anagrams, 
tisk-level, and the learning of paired-associa- 
tive nonsense syllables. The experimental con- 
ditions produced results in the predicted di- 
rections, but were too weak to elicit enough 
anxiety to show significant effects on per- 
formance. In the Talking condition, the Ss 
were encouraged to talk, but how much they 
actually talked was left up to them, When 
asked to indicate how much they had actually 
talked on a scale, labeled “Briefly, occasion- 
ally, frequently, and almost all the time” at its 
four points, HA Ss reported significantly less 
talking (f= 2.20, df=42, p< 05). The 
means were: HA, 2.7; LA, 3.3. Not talking to 
others is a kind of social avoidance, and the 
SAD scale is supported here in that it differ- 
entiates Ss according to how much they re- 
port talking to others, 


FNE, Approval Seeking or Disapproval 
Avoiding 

If fear of negative evaluation is an avoid- 
ance motive, then a person high in FNE 
might try to gain social approval simply as a 
way of avoiding disapproval. Individuals high 
on FNE might be expected to be most af- 
fected by the possibility of disapproval, while 
those low on FNE might be more affected by 
the possibility of gaining approval. This study 
attempted to manipulate these incentive con- 
ditions. 

The S, seated alone in a cubicle, believed 
that the work he was doing was part of a 
group effort, and that his performance would 
be evaluated by an unseen, unknown group 
leader. From the top 25% of the same FNE 
distribution—scores above 18—and from the 
bottom 25%—scores less than 9—48 high 
FNE and 48 low FNE Ss were randomly se- 
lected. Within each of these two groups Ss 
were randomly assigned either to a condition 
in which they were told that their group 
leader might approve of their work, but 
would never disapprove of it (Approval), or 
to a condition in which the leader might dis- 
approve, but would never approve (Disap- 
Proval). Actually there were no groups and 
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Ss never heard from their leader, The 3s per- 
formed a letter-number substitution task for 
4 minutes, the dependent variable being the 
number of items completed. Analysis of vari- 
ance indicated that the only effect which ap- 
proached significance was the interaction 
pattern (F = 3.76, df = 1/92, p < 10), The 
mean scores were: high FNE Approval, 148.2; 
high FNE Disapproval, 145.2; low FNE Ap- 
proval, 132.9; low FNE Disapproval, 149.5, 
If this unexpected, borderline effect may be 
believed, it indicates that those high on FNE 
not only avoided disapproval but sought ap- 
proval, while the low FNE Ss were not ap- 
proval seekers, and were motivated to work 
hard on a relatively dull task only if dis- 
approval were threatened. 

On a questionnaire, Ss were asked how 
nervous or uneasy they felt during the ex- 
periment, and they responded on a 5-point 
scale. As expected, those high on FNE re- 
ported more uneasiness (F = 17.01, df =1/ 
92, p < .001). The means were: high FNE, 
2.5; low FNE, 1.6. The Ss had also been 
asked how they evaluated their own per- 
formance and how they thought their group 
leader would evaluate their performance. 
Neither of these questions revealed differ- 
ences between groups, but some Ss—18 out 
of 96—reported that they expected the 
group leader to think less well of their work 
than they did. Of the 18, 10 were high FNE 
and 8 were low, which is a chance occurrence, 
but all 10 of the high FNE Ss were in the 
Approval condition, while the low FNE Ss 
were evenly divided between Approval and 
Disapproval. By Fisher’s exact test this pat- 
tern is significantly different from chance (f 
< .025). If this post hoc analysis indicates 4 
real difference, then it suggests a defensive 
sort of expecting the worst among some high 
FNE Ss. 


CORRELATIONAL STUDIES 


To check on certain discriminant or cot 
vergent relationships which had been speci- 
fied, and to search for new relationships, ° 
SAD and FNE scales were correlated Wi 
other measures. Table 4 presents the correla- 
tions of FNE and SAD with Taylor’s (1953) 
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SoctaL-EVvALUATIVE ANXIETY 


TABLE 4 
CorreaTion or FNE Anp SAD WITH OTHER SCALES 


Original sample* 


MA LC AH—* E-H** 
FNE 60 AS 28 AT 
N 171 172 173 35 
SAD 54 ns 18 45 
N 171 173 35 
Second sample** 
asr | SRG! | Athi- | Desir- | Auton-| Defend-| Achieve- 
eil iation | ability] omy | ence | ment 
FNE | .39 | .77 ns —.58 | —.32 42 ns 
SAD | .76 ns —.76 | —.42 ns ns =.33 
Third sample** 
Aggres- | Domi- Abase- | Exhibi- | Impul- 
sion nance ment tionism sivity 
FNE ns —.50 .29 —.39 ns 
SAD ns ns ns ns ns 


D ete —For the second sample, N = 42; for the third, 
* There were no significant correlations with AH +. 
*ifr = 15, p <.05; ifr =.19, p <.01. 
ifr = 30, p <.05; ifr = .39, p < 01. 


(1966) Locus of Control scale (LC), Alpert 
and Haber’s (1960) Achievement Anxiety 
scale, divided into debilitating and facilitating 
anxiety subscales (AH— and AH+), the so- 
cial and evaluative parts of the Endler-Hunt 
(1966) S-R Inventory of Anxiousness (E-H), 
Paivio’s (1965) Audience Sensitivity Index 
(ASI), and 11 of the subscales of Jackson’s 
(1966a, 1966b) Personality Research Form 
(Social Approval, Affiliation, Desirability, 
Autonomy, Defendence, Achievement, Aggres- 
sion, Dominance, Abasement, Exhibitionism, 
and Impulsivity). The table presents only 
significant or borderline correlations. 

There should be some common variance 
between a general and a more specific mea- 
sure of anxiety, If the MA scale is a measure 
of general anxiety, then the FNE and SAD 
should have a moderate relationship to it, 
which was the case. An attempt was made to 
foster a discriminant relationship with test 
anxiety, and the low correlation of both the 
social anxiety scales with AH— indicates that 
this was accomplished. If the situation elicit- 
ing the state of anxiety is as important or 
even more important than the trait itself 
(Endler & Hunt, 1966), then fostering dis- 
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criminant relationships between situationally 
specific scales is valuable. Indeed, Cherlin 
(1967) showed that, compared with other 
anxiety measures, only a measure of state 
anxiety in social situations correlated with 
behavior in those situations. 

A scale measuring social avoidance would 
have to be negatively correlated with affilia- 
tion, and the SAD met this criterion. Whether 
lack of affiliation necessarily implies presence 
of social anxiety is a moot point. The prob- 
lem is that a person low on both traits would 
answer the opposite instance items of the two 
scales similarly, thus producing a spurious 
correlation, that is, a correlation between 
scales which did not imply a relationship be- 
tween constructs, The present data indicate 
that it is impossible to be high on both traits, 
which is as it should be, given the nature of 
the constructs, but whether it is possible to 
be low on both traits cannot definitely be 
answered by these data. This possible spurious 
correlation would be a general problem in 
specifying the relationship between scales 
which control for acquiescence by employing 
opposite instance items which inquire simply 
about the absence of the trait in question. 
Yet the paradox is that the best kind of item 
does just that to avoid confounding with an- 
other trait. 

The Endler-Hunt items which were used 
here ask S to report how he would respond in 
social-evaluative situations such as going to 
meet a new date, going on an interview for a 
very important job, going into a psychological 
experiment, etc. The moderate correlation of 
the SAD and FNE scales with reported reac- 
tions in these situations supports the validity 
of the two scales. 

Paivio’s ASI scale inquires about both the 
reaction of S to audience situations and pos- 
sible causes for that reaction. The evaluative 
nature of audience situations is indicated by 
the correlation of FNE and ASI. Note also 
the negative relationship of FNE and Exhi- 
bitionism. People who fear others’ evaluations 
don’t want to be the center of attention. The 
very high relationship between SAD and ASI 
delineates the experience of stage fright as 
intense discomfort and a desire to flee. 

The high correlation of FNE and Social 
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Approval was consistent with the tentative 
results of the third experiment, suggesting 
that people high in FNE are very concerned 
with gaining others’ approval. It should be 
noted that one item is identical on the two 
scales, and that the wording of the opposite 
instance items again raises the possibility of 
a spuriously high correlation. High FNE may 
imply a desire for social approval, but it is 
not clear that the latter necessarily implies 
FNE, nor is it known if it is possible to be 
low on both traits. This correlation does not 
imply that FNE is heavily loaded with social 
desirability as a response set, which was mini- 
mized by the use of the Marlowe-Crowne 
scale in the item selection procedure. Rather, 
people who fear other’s negative evaluations 
want their approval as a signal that the 
feared outcome has been avoided. 

Jackson’s Desirability scale, on its face, 
seems to measure the degree to which a person 
actually engages in socially undesirable be- 
havior. Presumably both high levels of SAD 
and FNE are somewhat undesirable, hence the 
moderate relationship with Desirability. Note 
that this is not desirability as a response 
style, which was assessed by the Crowne- 
Marlowe scale. The remainder of the signifi- 
cant correlations gave validational support to 
the FNE and SAD scales in a fairly straight- 
forward manner. People high on FNE tend to 
be defensive, not autonomous, not dominant, 
and perhaps self-effacing. Some of the De- 
fendence items, in fact, inquire about de- 
fensiveness in evaluative situations. To be 
autonomous or dominant opens one to criti- 
cism which is exactly what the person high 
on FNE fears. The negative correlation of 
SAD and Achievement emphasizes the social 
nature of the competitive achievement motive. 

One other validational study will be 
briefly reported. Summer school students at 
the University of Toronto were notorious for 
not keeping appointments to serve as Ss in 
experiments. A test was made to see if this 
behavior was related to social anxiety. Fifty- 
two of the 81 high SAD Ss, with scores above 
10, versus 36 of 80 low SAD Ss, scores below 
3, appeared for an experimental appointment 
(x? = 5.99, df= 1, p < 02). There were no 
effects of FNE. Apparently these high SAD 
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Ss were sufficiently afraid of the Æ or their 
professor (neither of whom had any power to 
enforce appearance for appointments) or the 
generalized other, to do what was asked of 
them. People high in SAD may be more in- 
gratiating or conforming than others, 


Discussion 


As expected, people high on SAD did avoid 
social situations, and were anxious in social 
interactions. Individuals high on FNE be- 
came nervous in evaluative conditions, and 
seemed to seek social approval. Both scales 
showed correlations with other relevant mea- 
sures. The results lend validity to the two 
scales. 

It is worth mentioning constructs which 
might have been embodied in the two scales, 
but were not: fear of revealing one’s in- 
feriority, concern about the appearance of 
one’s body, fear of lack of control of bodily 
processes, fear of one’s unmanageable motives, 
inadequate expectations concerning what oth- 
ers will do; not knowing how to behave in 
social situations, These and other notions may 
express some of the origins of social anxiety. 
They were not included in the present con- 
struct to avoid begging the question of ori- 
gins, but now that relatively unambiguous 
measures of the trait have been developed it 
will be possible to explore the relationships 
between variables. For example, one may ask 
how fear of other’s evaluations is related to 
self-evaluation. 

Social anxiety is a variable which may act 
as a moderator in a number of social situa- 
tions. For example, the threat of negative 
evaluation might increase the chances of elic- 
iting compliant behavior if the individual is 
in a state of anxiety and appraises that he 
can reduce anxiety through compliance. Thus 
FNE or SAD might act as moderator varia- 
bles in studies on ingratiation, conformity, 
persuasibility, the demand characteristics of 
experiments, social approval seeking, social 
comparison processes, etc. Studies of the s0- 
cial processes within groups, such as the study 
of communication, might use the SAD as @ 
moderator variable, In general, the FN 
might be relevant in any potential evaluative 
situation, and the SAD could be useful in 
studies of social interaction. 
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SoctaL-EVALUATIVE ANXIETY 


The situation eliciting social anxiety is im- 
portant, and one would expect to find that dif- 
ferent situations have differential effects on 
individuals. The potential item universe from 
which items were selected for the FNE and 
SAD was intended to be all social situations, 
though only a few situations were unsystem- 
atically sampled by the items actually chosen. 
The sex of the person with whom S is inter- 
acting, his status relative to S, whether he is 
a friend or stranger, in public or private con- 
ditions, under evaluative or nonevaluative 
circumstances, with covert or overt evalua- 
tion—all of these varied conditions will dif- 
ferentially affect people high or low in social 
anxiety. In an evaluative situation, how do 
the characteristics of the evaluator affect a 
person’s anxiety level, and how does the be- 
havior of the anxious person affect the judg- 
ment of the evaluator? Social avoidance and 
distress, measured by the SAD, may be a 
general reaction for some people, while for 
others it may be specific to certain conditions, 
such as dealing with authorities or members 
of the opposite sex. 

Very little is known about the development 
of high levels of social anxiety, or about what 
affect they have on individuals. The amount 
of FNE may be a simple function of the 
amount of prior disapproval received, or a 
more complicated function combining prior 
disapproval with desire for approval. It seems 
best to conceive it as acquired on the basis of 
some sort of frustration or punishment. The 
relationship between approval seeking and 
disapproval avoiding is unclear. Individuals 
with high levels of FNE may seek nonevalua- 
tive social situations. They may fear not only 
obvious evaluative conditions, but also expect 
and fear evaluation where none is intended. 
The amount of SAD may also be a function 
of prior frustration or punishment in social 
interaction, The motives or frustrations under- 
lying the acquisition of high SAD are unclear. 
A person with high SAD is clearly isolated 
and often fearful. The relationship of social 
anxiety to psychopathology is hinted here: 
the SAD scale may make it possible to study 
not simply the schizoid versus the normal, 
but intermediate types as well. Hopefully the 
development of the FNE and SAD scales will 
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make it possible to study several of the inter- 
actions of personality and social variables 
discussed here. 
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EFFECT OF PHYSICAL AND COGNITIVE SOMATIC AROUSAL 
ON RORSCHACH RESPONSES: 


AN EXPERIMENTAL TEST OF THE ASSUMPTION THAT BODY 
IMAGE INFLUENCES THE PERCEPTUAL ORGANIZATION OF 


UNSTRUCTURED STIMULI 


STEVEN N. VAN DE MARK? anp CHARLES NEURINGER °? 


University of Kansas 


An attempt was made to experimentally verify the assumption that body image 
influences the perceptual organization of unstructured stimuli. It was first 
established that the stimulus manipulation procedures used in this study 
produced adequate somatic focus effects. Then male and female introductory 
psychology students were randomly assigned into six somatic focus arousal 
conditions, After being administered the somatic focus arousal stimuli, Ss were 
administered the Rorschach test which was scored for Barrier and Penetration 
Index scores. It was found that the externally aroused somatic focus Ss pro- 
duced significantly higher Barrier scores than did the other Ss and that signifi- 
cantly more Penetration responses were made by the internal somatic focus 
group when compared with the other groups. It was concluded that the results 
of this study verified the underlying assumption that body image states influ- 


ence perception. 


Fisher and Cleveland (1958, 1965) feel 
that one’s body image is crucial in the separa- 
tion of self and world, is fundamental for the 
establishment of personal identity, and is basic 
in the development of adjustment strategies. 
They have proposed two Rorschach scoring 
procedures to assess body image boundary 
status. The basic assumption underlying their 
Rorschach scoring method is that one’s body 
image influences perceptual organization of 
unstructured stimuli. 

The body image status assessment proce- 
dure developed by Fisher and Cleveland is 
in terms of (a) a Barrier Index in which 
Rorschach responses emphasizing an external 
somatic locus (e.g., “rocky walls,” “person 
covered up with a blanket,” “mummy 
wrapped up,” etc.) reflect the existence of 
a definite body-world boundary, and (b) a 
Penetration Index in which responses em- 
phasizing an internal somatic locus (e.g., 
“bullet piercing flesh,” “X-ray of the body,” 
“dog eating a bone,” etc.) reflect indefinite 
body image. 


1Now at the Austin Mental Health Center, Aus- 
tin, Texas, 

* Requests for reprints should be sent to Charles 
Neuringer, Department of Psychology, University of 
Kansas, Lawrence, Kansas 66044. 
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An impressive amount of research (Fisher, 
1963; Fisher & Cleveland, 1965), mostly 
correlational, has been done using these two 
indexes. However, questions have arisen con- 
cerning Fisher and Cleveland’s assumption 
that the body image influences perception of 
unstructured stimuli on the Rorschach test. 
Mednick (1959) and Wylie (1961) have at- 
gued that the Barrier and Penetration scores 
may reflect general modes of dealing with 
Rorschach inkblots rather than being the 
result of mediating body image experiences. 
Fisher and Cleveland (1965) have acknowl- 
edged these criticisms and stated that “avail- 
able information does not at this time permit 
a meaningful answer [p. 65].” Howevel, 
Fisher (1963) does feel that the underlying 
assumption has some validity since much 0! 
his, and of his associate’s research, has estab- 
lished solid correlations between body phe- 
nomena and the two Rorschach indexes. Sev- 
eral studies (Fisher, 1965; Fisher & Fishel, 
1964) have related index scores to awareness 
of body sensations. Unfortunately, these 
studies were correlational in nature and thus 
do not provide a critical test of the assump- 
tion that body sensations initiate particular 
effects to appear on the Rorschach. These 16- 
searches only indicated that body sensation 
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PHYSICAL AND COGNITIVE Somatic AROUSAL 


awareness and certain kinds of Rorschach 
responses co-vary together. Two later attempts. 
(Fisher & Renik, 1966; Renik & Fisher, 
1968) to study the effects of concentration on 
the interiors and exteriors of the body found 
that concentrating on body interiors lowered 
Barrier scores in males; females who con- 
centrated on body exteriors raised their 
Barrier scores. 

A critical test of the assumption that the 
body image influences perceptual organization 
of unstructured stimuli would have (a) to 
be based on actual presentation and manipu- 
lation of arousal conditions that would lead 
Ss to experience their bodies in certain known 
ways and (b) to then relate these experi- 
mentally induced states to Rorschach re- 
sponses. If it can be demonstrated that the 
experimental manipulation of body awareness 
focus produces concomitant Rorschach re- 
sponses, then Fisher and Cleveland’s under- 
lying assumption is substantially corroborated. 
Such a demonstration would also support the 
wider contention (e.g., Freud, 1927; Wapner 
& Werner, 1965; Witkin, 1965) that the 
matrix of body experiences in one’s life, 
referred to as the body image, can contribute 
to the factors mediating and influencing one’s 
perception of self and world. 

In this study an attempt was made to 
manipulate physical and cognitive arousal 
procedures in such a way as to induce external 
or internal somatic focus and to ascertain 
whether these affected Barrier and Penetration 
scores, 


METHOD 


_ In order to evaluate the effect of experimentally 
induced body focus states on Rorschach responses, it 
must first be established that the techniques used to 
arouse somatic focus are adequate. Without the 
establishment of their adequacy, it would be difficult 
to relate Rorschach responses to body image focus. 
Failure to demonstrate a relationship between body 
focus and Rorschach indexes could either be due to 
the actual nonexistence of the relationship, insensi- 
tivity on the part of the Rorschach, or to ineffective 
experimental manipulations of the independent vari- 
able. In order to avoid this cul-de-sac, & validating 
study of the experimental manipulations was carried 
out before evaluating their effect on the Rorschach. 


Subjects 


All Ss in the validating of the arousal procedure 
study and the investigation of their effect on Ror- 
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schach responses were chosen at random from intro- 
ductory psychology classes at the University of 
Kansas during the 1968 spring semester. They were 
randomly assigned to one of six arousal conditions 
in such a manner as to insure equal sex representa- 
tion in each group. All of the Ss went through the 
routines on an individual basis. The scoring of the 
independent variable measures was carried out with- 
out knowledge of S’s group assignment. Scoring 
reliability evaluations were made for all the measures. 


Arousal Procedures 


An attempt was made to develop procedures that 
would physically and cognitively induce internal and 
external somatic focus. A set of procedures were 
also developed for the control group. These latter 
manipulations were designed to induce focus on 
environmental aspects which had no direct reference 
to body procedures. Half of the procedures attempted 
to induce arousal through physical manipulations, 
For each of these sets of physical procedures there 
was a comparable condition in which Ss were asked 
to imagine carrying out or experiencing the activities 
for equivalent amounts of time. There were six 
arousal conditions. 

Internal focus arousal. In the physical internal focus 
arousal condition, S swallowed a small cup of crushed 
ice, sniffed from a bottle of liquid Bar-B-Q smoke 
for 1 minute, listened to his own heart beat for 
1 minute, and drank a small cup of warm cola. In 
the comparable cognitive arousal condition, S was 
asked to imagine doing the things described above 
for the same amounts of time. 

External focus arousal. In the physical external 
focus arousal situation, Ss were asked to hold arms 
horizontally straight outward while sitting for 1 
minute, hold both hands under cold water for 1 min- 
ute, hold both legs horizontally straight outward 
while sitting for 1 minute. In addition, Ss also 
vibrated their neck, shoulders, and arms for 1 minute 
with an electric hand vibrator. In the comparable 
cognitive arousal condition, Ss were asked to imagine 
doing these things for identical periods of time. 


8 The selection of procedures that would lead to 
only internal or external somatic focus is extremely 
difficult since these two body areas are not isolated 
from each other and seem to have interactive prop- 
erties. Primary stimulation of one of these sectors is 
accompanied by secondary arousal of the other, One 
of the external focus arousal procedures used in this 
study (holding arms out straight for 1 minute), while 
stimulating the periphery, also gives rise to internal 
muscle sensations. In the same manner, a procedure 
which focuses on internal arousal has secondary 
stimulation effects at the periphery of the body. The 
selection of arousal procedures can only be based on 
the presumed primary locus of arousal. The presumed 
primary locus of arousal can be partially validated 
by phenomenological experience. For example, hold- 
ing hands or legs out straight is reported by Ss to 
effect their “outsides” rather than their “insides.” 
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Neutral focus arousal. For the physical neutral 
focus arousal condition, Ss were asked to view a 
mountain scene for 1 minute, inspect a United States 
road map for 1 minute, listen to taped instrumental 
music for 1 minute, and look at a modern painting 
for 1 minute. In the comparable cognitive condi- 
tion, Ss were asked to imagine carrying out the 
above-cited activities for comparable lengths of time. 


Arousal Validating Methods 


In order to evaluate whether the arousal condi- 
tions, described in the preceding section, affected 
somatic focus, Ss were administered both the Homo- 
nym test (Secord, 1953) and the Body Sensation 
Report technique (Fisher, 1965) immediately after 
undergoing the arousal manipulations. 

The Homonym test is a list of 100 homonyms, 75 
of which may have either somatic or nonsomatic 
meanings (e.g, colon, digit, gag, nail, vessel, etc.). 
It was hypothesized that if the arousal procedures 
were effective, the internal and external focus Ss 
should give more somatic homonym definitions than 
the neutral focus group. The Body Sensation Report 
technique involves having an S close his eyes, focus 
his attention on his body, and report each time a 
sensation is experienced. The sensations reported by 
Ss are scored either internal or external depending 
on the reported body locus. It was hypothesized that 
if the arousal conditions were adequate for effecting 
a differential somatic focus, the external focus arousal 
Ss should report significantly more external body 
sensations than the internally aroused group. It was 
also hypothesized that the internal focus arousal Ss 
should report more internal body sensations than the 
external arousal group. 

Since the Homonym test does not seem to be as 
obviously concerned with somatic variables as the 
Body Sensation Report technique, it was always 
administered first in order not to bias Ss toward 
bodily responses. Forty-eight Ss (24 male and 24 
female) were used for this part of the study. 


Rorschach Indexes 


After establishing the adequacy of the arousal 
conditions, a new set of 120 Ss (60 males and 60 
females) were used to evaluate the arousal effects 
on the Rorschach Barrier and Penetration Indexes. 
The Ss were immediately administered the Rorschach 
test after they had gone through their particular 
arousal condition. The protocols were scored for 
Barrier and Penetration according to the procedures 
described by Fisher and Cleveland (1958, pp. 58-71). 
It was hypothesized that if the internal somatic 
states affect the perceptual organization of unstruc- 
tured stimuli, then those Ss in the external somatic 
focus group should have higher Rorschach Barrier 
Scores than do those Ss in the internal and neutral 
somatic groups. It was also hypothesized that the 
internal somatic focus Ss would give more Rorschach 
Penetration responses than Ss in the external and 
neutral focus conditions, 
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RESULTS 


Tt was found that the arousal procedures 
differentially affected the number of homo. 
nyms and body sensations reported by the 
Ss. In addition, Ss in the external somatic 
focus group gave significantly more Rorschach 
Barrier responses than the other somatic 
focus Ss. Significantly higher Rorschach Pene- 
tration scores were earned by the internal 
somatic focus group than by the other Ss, 
Scoring Reliability ` 

A random sample of each of the dependent 
measures was given to a second scorer* who 
knew nothing about the purpose of the 
study. Interscorer reliability coefficients 
(Pearson product-moment correlation) were 
found to be .95 for the Homonym Test; .87 
for both the Internal and External Body 
Sensation Report Techniques; and .98 and 
85 for the Barrier and Penetration scores, 
respectively. 


Arousal Validating Measures 


The mean number (and accompanying 


standard deviations) of somatic homonyms, 


internal and external body sensation reports, i 


given by the male and female Ss in the physi- 
cally and cognitively aroused internal, exter- 
nal, and neutral focus groups can be found 
in the first three columns of Table 1. For the 
Homonym Test (Column 1 of Table 1), 5s 
in both the internal and external somatic focus 
groups produced a higher mean number of 
somatic homonyms than did Ss in the neutral 
somatic focus condition (19.25 and 23.69 ya 
15.50). The Ss in the various cognitive 
arousal conditions gave more somatic homo- 
nym definitions (M = 20.67) than did those 
in the physical arousal conditions (M= 
18.29). The male and female Ss produced 
similar mean amounts of somatic homonyms 
(19.21 and 19.75). 

In order to evaluate whether any of the 
mean somatic homonym score differences 
found among S$ populations were statistically 
significant, a 3 X 2 X 2 factorial analysis of 

4The authors wish to express their gratitude and 


i a 
appreciation to Wynand D. Pienaar for serving as 
second protocal scorer. 
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TABLE 1 
Mean NUMBER OF SOMATIC HOMONYMS, INTERNAL AND EXTERNAL Bopy SENSATIONS, AND 
RORSCHACH BARRIER AND PENETRATION RESPONSES MADE BY THE MALE AND FEMALE 
SUBJECTS IN THE PHYSICALLY AND COGNITIVELY ÅROUSED INTERNAL, EXTERNAL, 
AND NEUTRAL Somatic Focus GROUPS 
Arousal validating measures Body boundary indexes 
(N=48) (N=120) 
Groups Somatic atea bony, aes Barrier Penetration 
homonyms reports reports responses responses 
M SD M SD M SD M SD M SD 
Treatment. groups 
Male internal physical 16.25 | 3.6 | 9.25 | 1.0 | 10.75) 14 3.40 | 3.0 | 6.40 | 28 
Female internal physical 19.25 | 7.3 | 9.00 | 2.6 $50 | 3.4 | 3.10 | 1.0 | 3.60 | 20 
Male internal cognitive 21.25 | 3.4 | 6.26 2.6 | 13.75 | 2.8 3.20 14 4.10 2.2 
Female internal cognitive | 20.25 | 3.7 950 | 2.2 | 10.50] 2.2 | 3.30 | 2.0 | 6.10 2.4 
Male external physical 26.25 | 12.8 | 3.75 | 1.4 | 16.00 | 2.2 7.00 | 1.7 | 240 | 17 
Female external physical 1700 | 4.2 | 4.50 | 13 | 15.50] 13 6,00 PHM Ln eee 
Male external cognitive 24.50 | 6.1 | 4.25 1.0 | 15.75 | 1.0 6.50 2.4 2.90 2.4 
Female external cognitive | 27.00 6.7 | 5.00 1.7 | 15.00 | 2.0 5.60 2.2 1,90 hy 
Male neutral physical 14,50 91 | 4.50 3.0 15.50 | 2.4 2.10 1.4 2.50 1.7 
Female neutral physical 16.50 | 43 | 7.75 | 2.6 | 12.25 2.6 | 2.20 | 1.0 | 280 | 1.7 
Male neutral cognitive 12.50 | 38 | 6.25 | 1.0 | 13.75 1.0 | 360 | 1.7 | 3.40 | 14 
Female neutral cognitive 1850 | 5.4 | 4.06 | 1.4 | 16.00 1.7 | 3.10 £75.) 2.00 yal sant 
Arousal procedures 
Physical 18.29 | 5.2 | 6.46 | 1.8 | 13.08 2.8 | 3.97 1.8 | 3.33 | 1.7 
Cognitive 20.67 | 48 | 5.88 | 19 1413 | 22 | 4.22 | 20 | 3.40 | 18 
Somatic focus 
Internal 19.25 | 3.4 | 8.50 | 2.1 | 10.88 2.2 | 3.25 1.3 | 5.05 | 2.3 
External 23.69 | 6.7 | 438 | 13 1556 | 18 | 6.28 | 2.8 | 238 | 1.9 
Neutral 4550 | 57 | 5.63 | 15 | 14.38 19 | 2.75 1.4 | 2.68 | 1.6 
Sex differences 
Male 19.21 | 5.6 | 5.71 1.7 | 14.25 | 1.7 | 4.30 1.7 | 3.62 1.7 
Female 19.75 | 4.3 | 663 | 2.2 12.96 | 1.7 3.88 1.6 | 3.12 | 1.7 


variance was carried out. The summary of 
the analysis is presented in Table 2. Only the 
somatic focus main effect was found to be 
statistically significant (F = 6.53, df = 2/36, 
< 05), Further analyses using £ tests indi- 
cated that the mean number of somatic homo- 
nym definitions given by both the internal 
and external somatic focus Ss were signifi- 
cantly greater than those of the neutral 
somatic focus Ss. It would appear that the 
internal and external somatic focus arousal 
procedures sensitized Ss in such a way as to 
produce somatic homonym definitions. 


5Since the same statistical design was used 
throughout the study, there will be no further de- 
scriptions of the statistical analysis procedures used 
for the ensuing measures. 


An inspection of the mean number of 
internal body sensation reports (Column 2, 
Table 1) made by Ss in this study reveals 
that the internal somatic focus groups gave 
more internal body sensations (M = 8.50) 
than the external and neutral somatic focus 
groups (M = 438 and 5.63, respectively). 
The physical arousal condition Ss tended 
to report more internal body sensations 
(M = 6.46) than did those in’ the cognitive 
arousal conditions (M = 5.88). A greater 
mean number of internal body sensations 
tended to be reported by the female (6.63) 
than by the male Ss (5.71). 

The analysis of variance conducted to re- 
veal whether any of the differences were 
statistically significant is presented in Table 3. 
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The somatic focus main effect was significant 
(F = 15.22, df= 2/36, p< .05). Further 
analyses revealed that the internal somatic 
focus Ss generally reported significantly more 
internal sensations than did the external and 
neutral somatic focus groups. The Somatic 
Focus X Arousal Prodecure X Sex Difference 
interaction was also found to be significant 
(F = 4.37, df= 2/36). It seemed that 
although the internal somatic focus Ss re- 
ported more internal somatic sensations than 
did the other Ss, the male cognitively aroused 
internal somatic Ss gave significantly fewer 
internal body sensations than did the con- 
comitant female group. In addition, the female 
physically aroused neutral somatic focus Ss 
reported more internal body sensations than 
did the other physically aroused groups (ex- 
cepting the internal somatic focus Ss). All in 
all, it would seem that the internal somatic 
focus manipulations led to significantly more 
internal body sensation reports than did any 
of the other arousal procedures. 

Column 3 of Table 1 presents the mean 
external sensations reported by the Ss. The 
external somatic focus Ss reported more exter- 
nal sensations (M = 15.56) than did the 
internal and neutral somatic focus Ss (M = 
10.88 and 14.38, respectively). The cogni- 
tively aroused groups tended to produce more 
external sensations (M = 14.13) than did the 
physically aroused Ss (M = 13.08). The 
males also tended to give more external body 


TABLE 2 


ANALYSIS OF VARIANCE OF THE SOMATIC HOMONYM 
DEFINITIONS MADE BY THE MALES AND FEMALES 
IN THE PHYSICALLY AND COGNITIVELY 
AROUSED INTERNAL, EXTERNAL, AND 
Nevutrat Somatic Focus Groups 


Source df MS F 
Somatic focus (SF) 2 268.77 6.53* 
Arousal procedure (AP) 1 67.68 1.64 
Sex differences (SD) 1 3.52 08 
SF X AP 2 18.18 44 
SF X SD 2 55.02 1.34 
AP X SD 1 46.02 1.12 
SF X AP X SD 2 62.02 1.51 
Residual 36 41.18 

Total 47 
*p <05. 
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TABLE 3 


ANALYSIS OF VARIANCE OF THE INTERNAL Bor 
SATIONS REPORTED BY MALES AND FEMALES ] 

THE PHYSICALLY AND COGNITIVELY ÅROU; 
INTERNAL, EXTERNAL, AND NEUTRAL 
Somatic Focus Groups 


Source df MS 
Somatic focus (SF) 2 71.58 
Arousal procedure (AP) 1 4.08 
Sex differences (SD) 1 10.08 
SF X AP 2 3.58 
SF X SD 2 1,08 
AP X SD 1 1.33 
SF X AP X SD 2 20.58 
Residual 36 4.70 
Total 47 
*p <01. 


sensations than did the females (M 
and 12.96, respectively). 

The analysis of variance of the 
body sensation scores is presented in T: 
The somatic focus main effect was found t 
significant (F = 22.25, df = 2/36, p< 
Further analysis disclosed that the e 
and neutral somatic focus groups rept 
significantly more external body sen 
than did the internal somatic focus SS. 
though the external somatic focus group 
a higher mean external body sensation $ 
than did the neutral somatic focus 4 
(15.56 vs. 14.38), there was no stati 
significant difference between the g 


TABLE 4 


ANALYsIs OF VARIANCE OF THE EXTERNAL Bi 
SATIONS REPORTED BY THE MALES AND 
IN THE PHYSICALLY AND COGNITIVELY 
ÅROUSED INTERNAL, EXTERNAL, AND — 
Nevrrat Somatic Focus GROUPS 


Source df MS 
Somatic focus (SF) 2 95.02 
Arousal procedure (AP)| 1 | 13.02 
Sex difference (SD) 1 20.02 
SF X AP 2 8.27 
SF X SD 2 6.39 
AP X SD ii 6.02 
SF X AP X SD 2 12.64 
Residual 36 4.27 

Total 47 
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TABLE 5 


ANALYSIS OF VARIANCE OF THE RORSCHACH BARRIER 
Scores MADE BY THE MALES AND FEMALES IN 
THE PHYSICALLY AND COGNITIVELY AROUSED 
INTERNAL, EXTERNAL, AND NEUTRAL 
Somatic Focus Groups 
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TABLE 6 


ANALYSIS OF VARIANCE OF THE RORSCHACH PENETRA- 
TION SCORES MADE BY THE MALES AND FEMALES 
IN THE PHYSICALLY AND COGNITIVELY 
ÅROUSED INTERNAL, EXTERNAL, AND 
Nevtrat Somatic Focus GROUPS 


Source df MS F Source df MS F 

Somatic focus (SF) 2 145.50 41.45* Somatic focus (SF) 2 85.90 20.40* 
Arousal procedure (AP) al 1.89 Ex] Arousal procedure (AP) 1 13 03 
Sex differences (SD) 1 5.20 1.48 Sex differences (SD) 1 7.50 1,78 
SF X AP 2 7.27 2.07 SF X AP 2 01 — 
SF X SD 2 2.15 61 SF X SD 2 .07 01 
AP X SD 1 10 02 AP X SD 1 4.03 96 
SF X AP X SD 2 65 9 SF X AP X SD 2 31.40 7.46" 
Residual 108 3.51 Residual 108 4.21 

Total 119 Total 119 

*p <01. *p <01. 


means. The sex difference main effect was 
also found to be significant (F = 4.69, 
df = 2/36, p < .01). The male Ss tended to 
report more external body sensations than 
did the females. This result is the reverse of 
the sex difference trend found for the internal 
body sensation report data. It would seem 
that the external somatic focus manipulations 
induced more external body sensation reports 
than did the other somatic focus procedures. 


Rorschach Indexes 


1 The mean Rorschach Barrier and Penetra- 
tion scores (and accompanying standard de- 
viations) made by the physically and cogni- 
tively aroused male and female internal, 
external, and neutral somatic focus Ss may be 
found in Columns 4 and 5 of Table 1. 

An inspection of Column 4 of Table 1 
reveals that the external somatic focus Ss 
Produced higher mean barrier responses than 
did the other groups of Ss (6.28 vs. 3.25 and 
2.75), The cognitively aroused groups tended 
to give more Barrier responses (M = 4.22) 
than did the physically aroused groups 
(M = 3.97). Little difference in terms of 
Mean production of Barrier responses was 
found between the sexes (4.30 and 3.88 for 
the males and females, respectively). 

The analysis of variance carried out to 
establish whether any of the mean differences 
were statistically significant is presented in 
Table 5. Only the somatic focus main ef- 
fect was significant (F = 41.45, df = 2/108, 


p < 01). Further analyses indicated that the 
external somatic focus group gave signifi- 
cantly more Barrier responses on the Ror- 
schach than did the internal and neutral 
somatic focus Ss. It would seem that external 
somatic focus manipulations induce Barrier 
responses to appear on the Rorschach test. 

In Column 5 of Table 1 can be found the 
mean Rorschach Penetration scores made by 
the Ss in this study. The internal somatic 
focus groups gave more Penetration responses 
(M = 5.05) than did the internal and neutral 
somatic focus groups (M = 2.38 and 2.68, 
respectively). The mean Penetration score 
differences between the two arousal procedure 
conditions and between the sexes were rather 
small (3.33 vs. 3.40, and 3.62 vs. sii 

The analysis of variance of these mean 
differences can be found in Table 6. The 
somatic focus main effect was found to be 
significant (F = 20.40, df = 2/108, p < 01). 
Further analysis indicated that the internal 
somatic focus Ss gave significantly more Pene- 
tration responses than did the external and 
neutral somatic focus Ss. The Somatic Focus 
X Arousal Procedure X Sex Difference inter- 
action effect was significant (F= 746, df= 
2/108, P< 01). Although all the internal 
somatic focus Ss gave more Penetration scores 
than did all the other Ss, the male physically 
aroused internal somatic focus Ss gave more 
Penetration responses than the cognitively 
aroused internal somatic focus males, The re- 
verse was true for the female internal somatic 
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focus Ss.° It appears that the internal somatic 
conditions induce Penetration responses to 
appear on the Rorschach test. 


Discussion 


The results of this study support the propo- 
sition that body image boundary status af- 
fects the perceptual organization of unstruc- 
tured stimuli. This proposition has been the 
prime underlying theoretical assumption be- 
hind Fisher and Cleveland’s research program 
relating Rorschach Barrier and Penetration 
scores to various aspects of personality. 
They have, however, presented no direct evi- 
dence to support this assumption. Without 
such support, the criticism that the Barrier 
and Penetration scores are subsidiary to and 
therefore reflect just another facet of general 
cognitive-personality style is one that must 
be taken quite seriously. 

However, the present study corroborates 
Fisher and Cleveland’s underlying assumption 
and mitigates against the Mednick and Wylie 
skepticism about the relationship between 
body image processes and perception. If the 
exterior of the body is made the center of 
focus, it is reflected in Rorschach response 
productivity having to do with external 
somatic locus. When the interiors of the body 
are experimentally stimulated, there is a con- 
comitant rise in interior physical locus re- 
sponses on the Rorschach. It is not even 
necessary to physically stimulate the two 
body image sectors. These effects are so 
powerful that even instructions to think about 
the interior and exterior parts of the body 
can mediate differential perception on the 
Rorschach test. There were no cognitive- 
physical arousal procedure differences found 
in this study. 

The confidence in the confirmation of 
Fisher and Cleveland’s underlying assumption 
was strengthened by the method used in this 
study. Attempts to validate the effectiveness 


6 These differential sex difference findings, as well 
as those reported elsewhere (Fisher & Renik, 1966; 
Renik & Fisher, 1968), may be the result of either 
learned cultural sex-role patterns of body focus ori- 
entation, or endocrinological sex variations, or possi- 
‘bly be due to the effects of the sex of the examiner. 
Further systematic research is needed to shed light 
on the sources of sex difference results. 
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of the independent variable are rare in the 
research literature. In this study the arousal 
procedures seemed to successfully induce Ss 
to focus on the exteriors and interiors of their 
body. The results can therefore be attributed 
to the Rorschach’s sensitivity to perceptual 
organization induced by differential somatic 
focus. The use of an experimental manipula- 
tion procedure (rather than a correlational 
design) helped to demonstrate the existence 
of a direct relationship between somatic focus 
and Rorschach responses rather than one in 
which the two variables co-vary as a func- 
tion of some other process (such as general 
cognitive style). 

It is difficult to deny a major role to the 
body image as initiator in the organization 
of perception and cognition. That bodily 
states influence perception has already been 
demonstrated in the work of Clark (1952) 
and Levine, Chein, and Murphy (1942) on 
the effects of sex and hunger drives on per- 
ception. Somatic influence has also been dem- 
onstrated in perception in the sensory-tonic 
research conducted by Werner and Wapnet 


(1952) and Witkin (1949). It makes sense 


that the host of physical and cognitive stimu- 
lations leading to self-world differentiation 
affects self-perception and perception of the 
the environment. Weak and penetrated body 
images projected onto the world can lead to 
the perception of things as fuzzy, fluid, and 
open to invasion and insult. Strong body 
image boundaries should logically lead to per- 
ceptions of the world in which contours are 
articulated and resistant to stress. It is sti 
not known what mediational processes convert 
body image status into Rorschach Inkblot 
perception. This teasing out of the processes 
involved should be the major task of those 
psychologists interested in the psychosomatic 
basis of perception. 
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SUICIDE AND NATURAL DEATH IN A STATE 
HOSPITAL POPULATION: 


A COMPARISON OF ADMISSION COMPLAINTS, MMPI PROFILES, 
AND SOCIAL COMPETENCE FACTORS 


MILTON R. RAVENSBORG? anp ADELINE FOSS 
Fergus Falls State Hospital, Minnesota 


Completed suicides in a state hospital were contrasted with natural deaths and 
inpatients on admission complaints, MMPI profiles, and social competence 
factors. No significant differences were found among the groups on patterns of 
admission complaints. For all three groups, depression was found to be the 
most frequent admission complaint. Neither MMPI profiles nor a recently 
developed suicide threat scale served to discriminate suicides from the com- 
parison groups. Highly significant differences were found on social competence 
variables, with suicides tending to be younger, more intelligent, and better 


educated than comparison groups. 


A review of the literature suggests that 
there are three major approaches to the study 
of suicide: the clinical, the psychometric, and 
the sociological (Farberow, Shneidman, & 
Neuringer, 1966). Each of these approaches 
has yielded valuable insights to the concomi- 
tants of suicide. However, as Kahne (1966) 
pointed out, no approach has provided a 
royal road to understanding; each is a “style 
of explanation.” 

The clinical approach, based on the symp- 
toms and personality traits of identified sui- 
cidal cases, has resulted in such conclusions 
as “depression is the major suicidal illness 
[Shabshin, 1965].” However, Eisenthal, Far- 
berow, and Shneidman (1966), after a de- 
tailed follow-up of suicide status patients of 
a Veterans Administration Hospital, reported, 
“Our data did not show that the severity 
of depression was related to suicide potential- 
ity. Patients with psychotic depressions were 
less suicidal than most other psychotics and 
neurotics [p. 988].” 

The psychometric approach, based on test 
items and other indicators produced in the 
course of psychological testing, has resulted 
in scales which are alleged to discriminate 
between suicidal persons and others (e.g., 
Farberow & Devries, 1967; Holzberg, Cohne, 
& Wilk, 1951). However, such scales and 


1 Requests for reprints should be sent to Milton 
R. Ravensborg, Fergus Falls State Hospital, Fergus 
Falls, Minnesota 56537. 
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indexes fail sometimes to operate when ap- 
plied to new populations, or in settings which 
differ from those in which such indexes were 
originally validated. 

The sociological approach, based on social 
and economic factors presented by suicidal | 
persons and others (e.g. Farberow et al, 
1966) has so far had little impact on the 
mainstream of psychiatric thought about sui- 
cide. Kahne (1966) urged suicide researchers 
not to concentrate solely on predisposing 
factors within the patient, and chided physi- 
cians for their disregard of sociological and 
anthropological contributions. j 

Because clinical, psychometric, and socio- 
logical experts tend to prefer their own styles 
of explanation, one rarely encounters studies | 
which view suicidal phenomena from all three i 
standpoints. As a result, it is impossible 10 
ascertain, within any given population; 
whether the clinical symptom picture, 
psychometric profile, or the background 0 
socioeconomic factors is the most useful i 
discriminating suicidal patients from contro 
groups. na 

One of the few investigations of a 
which used the three approaches (clini 
psychometric, and sociological) at oo 
that of Simon (1950), who analyzed & sani 
of 50 veterans who attempted suicide. 
ever, Simon failed to employ a Te ‘he 
control group (e.g., other hospitalized ‘nf 
erans). Thus his results, while interes!” 


SUICIDE AND NATURAL DEATH 


did not lead to indexes which would help 
in identifying potential suicides. 

The present study was undertaken to learn 
whether the patterns of admission complaints, 
psychometric (MMPI) profiles, or social 
competence factors were most efficient in 
discriminating suicides from other psychiatric 
patients. 


METHOD 


From Fergus Falls State Hospital records, the 
complete roster of on-books psychiatric patients who 
had died in the period January 1, 1954, through 
June 30, 1967, was obtained (N =275). On the 
basis of death certificate information, this roster 
was divided into two groups, patients who died by 
suicide (W =27) and patients who died of natural 
causes, The list of patients who died natural deaths 
was then scrutinized, and the two largest sub- 
groups, patients who died from circulatory and 
respiratory diseases (N = 173), were culled and com- 
bined as the natural death population. The third 
group in the investigation, regular psychiatric pa- 
tients, was the current in-hospital population of a 
geographical psychiatric unit (WV = 109). 

Psychometric files were obtained for all Ss in the 
three populations, for whom such files were avail- 
able. MMPIs and Shipley-Hartford scales were 
available for 23 (859%) of the suicides (15 males, 8 
females) ; 23 (13%) of the natural deaths (15 males, 
8 females); and 60 (55%) of the inpatients (28 
males, 32 females). It should be noted that these 
percentages of “testable” Ss are significantly different 
(x? = 116.13, p < .001). This finding will be treated 
in the Discussion section. 

Because of the small Ns in the two death groups, 
males and females were combined and, to enable use 
of the profile analysis of variance (Block, Levine, 
& McNemar, 1951), a random sample of 23 inpatients 
was drawn. 

MMPI profiles were tabulated separately for the 
three groups, combining males and females. (Subse- 
quent comparisons are restricted to “testable” Ss: 
those for whom all types of data considered in this 
report were available.) Group profiles were then 
analyzed by analysis of variance. In addition, a 
recent Suicide Threat scale (Farberow & Devries, 
1967) was scored on the three groups. 

Admission diagnosis, based on the American Psychi- 
atric Association’s two-digit code (American Psy- 
chiatric Association, 1952) was obtained for each S, 
and the frequency of each diagnostic type was tabu- 
lated for the three groups. Frequency arrays were 
then compared by the chi-square test of independence. 

Median time-in-hospital was calculated for each of 
the three groups and the groups were compared 
with each other by the extension of the median test 
(Siegel, 1956). 

A schedule of admission complaints was derived 
by perusing admission records of the 109 inpatients 
and listing every admission complaint found in any 
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patient’s file. Individual patient schedules were then 
summarized for this group to obtain the frequency 
of each separate complaint. All complaints which 
occurred five or more times in the population of 109 
were included in the final schedule, which contains 
34 admission complaints. 

Using the admission complaint schedule, admission 
notes in the charts of the 23 “testable” suicidal 
patients and the sample of 23 “testable” natural 
deaths were analyzed. The frequency of each of the 
34 complaints in the two deceased populations was 
enumerated. Likewise, the frequency of each com- 
plaint was determined for the random sample of 
23 inpatients. Rank orders of the admission com- 
plaints were determined (Table 1), and Kendall’s 
Coefficient of Concordance (Siegel, 1956) was com- 
puted to determine the relatedness of the rank orders, 

Social competence factors were coded for the three 
populations on the basis of the Zigler and Phillips 
(1961) scale (Table 2). The social competence scale 
has six factors (age, intelligence, education, occupa- 
tion, employment history, and marital status). Each 
factor is scored 0, 1, or 2, with 2 representing higher 
social competence. In the present study, educa- 
tion was regrouped, with less than high school 
diploma =0; high school diploma = 1; education 
beyond high school = 2. The portions of each popu- 
lation which fell into the three levels of each social 
competence factor were expressed as percentages. 


RESULTS 
Diagnosis 

The various diagnostic types (based on the 
American Psychiatric Association’s two-digit 
code) were represented in about equal propor- 
tions in the three groups (x? = 2.83, p > .50). 
About half of the patients in each group had 
been diagnosed as suffering from some form 
of schizophrenic reaction (Table 3). 

The median time-in-hospital for inpatients 
was 22 months (range: 4-366); for suicides, 
17.5 months (range: 6-231); for natural 
deaths, 23.0 months (range: 1-356). The 
overall median for the three groups, combined, 
was 23.0 months. The extension of the median 
test (Siegel, 1956) resulted in a chi-square of 
34 (ns), implying that there were no sig- 
nificant differences among median lengths-of- 
stay for the three groups. 


Admission Complaints 


There was no significant difference in rank 
order of admission complaints among the 
three populations of suicides, natural deaths, 
and psychiatric inpatients (Table De 
Kendall’s Coefficient of Concordance (Siegel, 
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1956) yielded a W of .63, equivalent to chi- 
square 62.37 (p < .001). Of particular inter- 
est is the fact that “depressed” was the 
top-ranking complaint for all three groups. 


MMPI Profiles 

The analysis of variance of MMPI profiles 
yielded an F = 2.09. Following Greenhouse 
and Geisser (1959), degrees of freedom was 
adjusted from 28 and 924 (Interaction X 
Residual) to 2 and 66 (Groups X Individu- 
als) because of the lack of independence of 
the scale variables. The obtained F was not 
significant (p > .05). Thus, on the basis of 
present evidence, it does not appear feasible 
to distinguish suicides from other patients on 
the basis of MMPI profiles. 
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Application of the Suicide Threat scale to | 
MMPI answer sheets of the three groups of 
Ss yielded means of 21.65 for suicides and 
22.93 for other patients. Analysis of variance 
included a breakdown by sexes. F ratios for 
main effects did not achieve significance, im- 
plying that suicides, as a whole, did not 
differ from other patients on this scale; 
neither did males, as a group, differ from | 
females. The Sex X Patient Type interaction, 
however, was significant (p < .025), due to 
the fact that female inpatients achieved the 
highest mean of all. These results suggest that | 
the Suicide Threat scale was of little value in 
differentiating the state hospital population. 


TABLE 1 
RANKINGS OF ADMISSION COMPLAINTS 
Complaint Inpatient Suicide Natural death 
pote SE 

Depressed 1 1 | 
Suspicious 2 9.5 | 
Hallucinations 3 9.5 
Delusional 4.5 3.5 

Nervous, overactive | 4.5 18 

Aggressive and delinquent behavior 6 6.5 

Abusive and angry 7 9.5 
Preoccupations 8 26.5 

Seclusive 9 18 
Threatened injury to others 10 13.5 
Insomnia 11 2 

Loss of interest 12 22.5 
Threatened or attempted suicide 18 6.5 
Withdrawn 14 13.5 
Disoriented 15.5 31.5 

Somatic complaints 15.5 5 

Confused 17 9.5 

Overly dependent, unable to care for self 19 22.5 

Silly 19 31.5 

Violent emotional outbursts 19 26.5 
Forgetful 19 22.5 

Feelings of persecution 22 26.5 

Fearful 23.5 13.5 
Excessive drinking 23.5 18 

Bizarre behavior 25.5 31.5 

Feelings of rejection 25.5 31.5 
Inferiority feelings 27.5 18 | 
Guilt feelings 27.5 13.5 | 
Abnormal sex behavior 29 31.5 | 
Inability to work 30.5 18 

Irritable 30.5 22.5 

Moody 32 26.5 

Phobias 33 31.5 

Anxious 34 3.5 


Note.—Kendall’s Coefficient of Concordance of rank orders significant beyond the .001 level (x* = 62.37). 


SUICDE AND NATURAL DEATH 


Social Competence 

Considerable variance among the three 
populations was present in the tabulation of 
social competence factors (Table 2). All chi- 
square comparisons were significant beyond 
the .001 level. 

In regard to age, it was evident that in- 
patients tended to be somewhat older than 
suicides—18% of the suicides were in the age 
group of under 25, whereas only 5% of the 
inpatients and none of the natural deaths 
fell into this age bracket. 

As to intelligence, 27% of the suicides were 
above IQ 115, whereas only 8% of the in- 
patients and none of the natural deaths had 
tested at this intellectual level. 

Fourteen percent of the suicides had re- 
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ceived education beyond the high school 
diploma, while only 3% of the inpatients 
and 9% of the natural deaths had studied be- 
yond high school, On the other hand, 94% of 
the inpatients and 82% of the natural deaths 
had failed to complete high school. 

In regard to occupation, the vast majority 
of all three populations were coded at the 
unskilled or semiskilled levels. However, 9% 
of the suicides had held jobs at the profes- 
sional and managerial level, whereas only 1% 
of the inpatients, and none of the natural 
deaths, had progressed to such top-level jobs. 

With respect to employment history, more 
of the natural deaths had been usually un- 
employed and fewer had been employed on an 
irregular basis. There was little evidence, how- 
ever, that suicides could be differentiated from 


TABLE 2 


SocraL®COMPETENCE-CODING SYSTEM AND 
SUICIDES, AND NATURAL DEATHS 


PERCENTAGE OF INPATIENTS, 
IN EacHn CATEGORY 


Populations" 
Variables Code 
IP S NS 
Age 
24 and below o 2 As 2 
25-44 : z D 5 
45 years and above 2 a 38 i 
Intelligence 
84 or less 9 ee s a 
85-115 ; A 3 a 
116 and above 2 $ a i 
ain 
one or some grades includin; ungraded or special 
classes, and some high aes o mS x > 
High school diploma, no more 1 ° z K 
Education beyond high school: some college, or 
vocational training 2 $ i i 
Occupation 
Unskilled or semiskilled p ul Es i 
Skill ed, service, clerical, sales £ a m a 
Professional and managerial 2 ; 3 i 
Employment history 
peal unemployed 2 ie th wk 
easonal, fluctuating, frequent shifts, or art-time 
employment ool ned j 20 f 
Regularly employed B s $ 
Marital status 
Single 4 r i 22 
Separated, divorced, remarried, or widowed 1 16 P 35 
Single continuous marriage 2 2 £ 


aote A Nrs = 23. IP = Tapatlents, S = Suicides, ND = 
-squares of the percentages 
factors ranged from 17.47 to 66.10, all significant beyond -001. 


Natural deaths. 
of the three populations falling into each of 


the three levels of the six Social Competenc 
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TABLE 3 


FREQUENCY or DIAGNOSTIC TYPES IN INPATIENTS, 
SUICDES, AND NATURAL DEATHS 


Group 
Diagnostic type 
IP S ND 

Chronic brain syndrome 1 2 bi 
Psychotic disorders 

Affective reaction Se ap 1° 

Schizophrenic reaction 11 12 10 
Psychoneurotic disorders fe 3 4b 
Personality disorders 3 4 3 
Mental deficiency 2 0 0 

Total 23 23 23 

Note.—IP = Inpatients, S = Suicides, ND = Natural 


deaths, x? = 2.83, p > .50. 
a Includes three cases of depressive reaction, 
b Includes two cases of depressive reaction. 
© Includes one case of depressive reaction. 


inpatients or natural deaths on the basis of 
employment history. 

As to marital status, 75% of the inpatients 
were single, whereas only 23% of the natural 
deaths had failed to marry. Forty-six percent 
of the suicides has had a single continuous 
marriage, as did 45% of the natural deaths. 
By contrast, only 9% of the inpatients had 
been steadily married to one partner. 


Discussion 


Because of the very low percentage of 
“testable” natural deaths, and the relatively 
low percentage of inpatients whose files con- 
tained MMPIs and Shipley-Hartford scales, 
the conclusions of this study should probably 
not be generalized to all members of the 
inpatient and natural death populations. But, 
inasmuch as 85% of the suicides had been 
tested, the obtained mean MMPI profile, 
pattern of admission complaints, and social 
competence factors might be considered repre- 
sentative of the suicide group. 

The finding that a significantly larger per- 
centage of suicidal patients had been found 
intact enough to complete MMPIs in a group- 
testing situation adds support to the infer- 
ence, based on social competence comparisons, 
that suicides tend to be more intelligent and 
better educated (as well as younger) than 
inpatients and natural deaths. 
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On the basis of the present investigation 
it is evident that no pattern of admission 
complaints, let alone the single complaint, 
“depression,” is particularly useful in identi- 
fying potential suicides among psychiatric 
patients, 

Likewise, the MMPI profile appears to add 
nothing to discrimination of self-destructive 
patients as a class. A comparable conclusion, 
based on multiple administrations of the 
MMPI to five suicidal patients, was recently 
published by Devries and Shneidman (1967), 1 
The authors speculated that “the establish- 
ment of suicidal MMPI profile syndromes 
can best be accomplished if persons with 
homogeneous rather than . . . heterogeneous 
syndromes are grouped together for compari- | 
son purposes [p. 401].” | 

However, the worth of Kahne’s (1966) 
admonitions to look at sociological factors in 
suicide research was borne out by the finding 
of significant differences in social competence 
between suicides and other state hospital pa- 
tients. It is not, however, immediately ap- 
parent why the suicides in this population 
should present a different pattern of back- 
ground factors from those of inpatients and | 
natural deaths. 
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FEMALES’ AFFECTIVE RESPONSES TO READING 
EROTIC LITERATURE 


DONALD L. MOSHER ! 
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Ohio State University 


The affective states of sexual arousal, guilt, anxiety, and 11 Nowlis mood fac- 
tors were studied in 72 college females before and after reading either an 
erotic or an academically oriented literary passage, with a female E either 
present or absent, and as a function of the personality disposition of sex guilt. 
Females who read the erotic passage showed a significant increase in sexual 
arousal. High sex guilt females significantly increased their affective state of 
guilt when they read the erotic passage in comparison to the females in all 
of the other cells. Anxiety increased as a function of reading the erotic passage 
in the presence of E. Conceptual distinctions between sexual behavior and 
sexual arousal, between guilt as personality disposition and as state, and between 
affective states of fear and guilt were discussed. 


Kinsey and his associates (Kinsey, Pome- 

roy, Martin, & Gebhard, 1953) in their study 
of sexual behavior in the human female 
pointed out major sex differences in response 
to erotic stimuli. In their surveys which in- 
cluded approximately 4,000 males and 5,700 
females, they found that the males were more 
responsive than the females to 29 of 33 cate- 
gories of stimuli that were potentially erotic. 
One of the three classes of sexual stimuli to 
which females were as responsive as males 
was the reading of erotic literary material. 
In their sample of 5,699 women, 16% re- 
ported definite and/or frequent erotic re- 
‘sponse to reading literary material, 44% 
reported some erotic response, and 40% re- 
ported never experiencing an erotic response 
to reading literary material (Kinsey et al., 
p. 670). The present experiment investigated 
the affective responses of college women to 
reading erotic literary material as a function 
of individual differences in the personality 
disposition of sex guilt and response to the 
presence of a female E. Reports of increased 
states of sexual arousal, guilt, and anxiety 
were the major dependent variables. 

The major hypothesis under investigation 
was that high sex guilt females will report 
a greater increase in the affective state of 
guilt after reading erotic literature than will 
low sex guilt females. Violation of prohibi- 


1 Requests for reprints should be sent to Donald 
L. Mosher, Department of Psychology, University 
of Connecticut, Storrs, Connecticut 06268. 


tions concerning sexual behavior are expected 
to lead to the experience of the state or feel- 
ing of guilt in guilt-disposed individuals. Okel 
and Mosher (1968) demonstrated that males 
scoring above the median on a measure of 
guilt over hostility reported experiencing 4 
significantly greater affective state of guilt 
after they had verbally attacked a stranger 
who became quite upset than did low-hostility- 
guilt males. The high-guilt males also de- 
creased significantly in their report of positive 
affects after guilt over aggression had been 
induced in the experimental situation. Also, it 
seemed plausible that high sex guilt females 
may report less sexual arousal than low sex 
guilt females. Perhaps, those women in 
Kinsey’s sample who reported no sexu 
arousal to reading erotic literature expe 
enced guilt-produced conflict which inhibited 
the development of erotic feelings and 
fantasies. i 
This experiment also explored the affective 
reaction of the women as a function of the 
presence or absence of a female E. The pres: 
ence of an observer might tend to inhibit the 
arousal of sexual feelings and fantasies or U 
increase anxiety or fear. The purpose 0° | 
manipulation was to add further empiri 
evidence to support the feasibility of ma 
a conceptual distinction between fear an 
guilt. Fear is a situational expectancy for pun 
ishment that may be increased by the re 
ence of another person who might potentia y 
censure the individual for sex-related behav 
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Responses TO EROTIC LITERATURE 


jor. Guilt is a generalized expectancy for 
self-mediated punishment for violating inter- 
nalized standards concerning sexual behavior. 
A person feels guilty when he violates his 
moral standards regardless of the presence 
or attitudes of others. Mosher (1965) found 
that males who were low on sex guilt were 
more influenced by situational cues concerning 
censure from an Æ than were high sex guilt 
males in responding to a perceptual defense 
task, Galbraith and Mosher (1968) found 
that both situationally manipulated expectan- 
cies for Æ censure and sex guilt predicted the 
inhibition of sexual associative responses to 
double entendre words in a word association 
task. In this experiment it was anticipated 
that females will report more of an increase 
in fear or anxiety—but not in guilt—when 
they read erotic literature in the presence of 
E than when alone. 


METHOD 
Procedure 


The Ss were 72 females enrolled in the introductory 
psychology course at the Ohio State University. The 
Ss were seen twice. Three groups totaling 85 Ss were 
administered the female form of the Mosher Forced- 
Choice Guilt Inventory. The Ss were subsequently 
divided at the median of 18 to form high- and low- 
guilt groups and randomly assigned to four experi- 
mental treatments in which they participated during 
a second individual experimental session. The Ss were 
partitioned into eight cells with nine Ss in each cell 
on the basis of high or low guilt, reading an erotic 
or an academic literary passage, and E presence or 
absence. The E was also female. All Ss were given 
several opportunities to withdraw from the experi- 
ment, and one S declined to read the erotic passage. 
Twelve Ss of the original 85 were not included in 
the final sample because they either missed their 
individual appointments or improperly filled out 
their inventory or checklists. 

After arriving for her individual experimental 
session, all Ss completed the Nowlis Mood Adjective 
Check List (MACL) which also included seven 
guilt adjectives and seven sex-arousal adjectives as 
a pretest of their affective state at that moment. 
Then they were instructed, “Now please read the 
following paragraphs. When you have finished again 
fill out the MACL, being particularly attentive to 
your current feelings.” The Ss were then given either 
an erotic or an academic passage to read. The E then 
indicated whether she would be present or absent 
by saying either, “I'll wait here until you are fin- 
ished with both” or “I'll wait outside. When you 
are finished with both just call me in.” After the 
reading of the passage and the posttest MACL were 
completed, the Ss were debriefed and thanked. 
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The erotic passage was Chapter 72 of the novel 
Eternal Fire by Calder Willingham (1964). The 
passage describes the seduction of a virginal girl by 
a young man. Sexual petting and foreplay lead to 
mounting sexual arousal in the girl, and her conflicts 
about sexual intercourse give way until she actively 
encourages the young man to have sexual intercourse 
with her. The literary style is vivid and realistically 
descriptive. The passage was selected because of its 
highly erotic nature and because it would seem to 
provide the college female with a heroine with which 
to identify. 

The academically oriented passage was taken from 
pages 714-721 of the text Psychology by Wickens and 
Meyer (1963). The passage discusses the associative 
cortex and its functions. The passage was taken from 
the text in use in Ss’ introductory psychology course, 
but it was unlikely that the Ss would have progressed 
that far in their reading of their text book. Both 
passages contained the same number of words, but, 
of course, the style of writing as well as the content 
was very different. 

Sex-Guilt. The Sex-Guilt subscale (Mosher, 1968) 
of the female form of the Mosher Forced-Choice 
Guilt Inventory was constructed from an item pool 
of 276 completions to the Mosher Incomplete Sen- 
tences Test (Mosher, 1961). An internal consistency 
item analysis and the elimination of items pulling 
greater than 75% of the responses in a single direc- 
tion reduced the item pool to 110 items. From this 
reduced item pool 39 forced-choice sex guilt items 
were selected. The items are weighted along a guilt 
dimension from very guilty (+2), guilty (+1), non- 
guilty (—1), to very nonguilty (—2). The items 
have a possible range from +64 to —61 and in this 
sample the obtained range was from +50 to —49. 
Sample items are: 


Sex relations before marriage . . - 
A. should be permitted. (SG — 2) 
B. are wrong and immoral. (SG +2) 


If I had sex relations, I would feel . . . 
A. very dirty. (SG +2) 
B. happy and satisfied. (SG — 2) 


The Sex-Guilt subscale has a corrected split-half 
reliability of .95. A multitrait-multimethod matrix 
analysis and a factor analysis based on the responses 
of 62 female Ss to three measures of three aspects of 
guilt, including the forced-choice measure of sex 
guilt, provided evidence for convergent and discrimi- 
nant validity, The forced-choice sex guilt subscale 
(FCSG) is not correlated significantly with either 
the Marlowe-Crowne Social Desirability scale or the 
Edwards Social Desirability scale, The FCSG sub- 
scale is viewed as measuring the personality disposi- 
tion of guilt rather than the feeling state of guilt. 
Nowlis MACL. The Nowlis MACL, which is com- 
posed of a number of adjectives applicable to 
momentary moods or affective states, acts as a series 
of prompts for statement of mood on the part of 
the individual. Everyone learns to label his con- 
scious mood states with thousands of adjectives, 
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and Nowlis believes these descriptive labels prompt 
a literal “self-report” which is a trustworthy index 
of present affective states. The aim of Nowlis’ re- 
search has been to “induce definite changes in mood 
with clear-cut operations and to determine the rela- 
tionship between these operations and verbal reports 
of mood [Nowlis & Green, 1964, p. 2].” The MACL 
has proven useful in a number of studies which have 
evaluated changes in mood as a function of such 
variables as stress, arousal, and drug administration 
(Cameron & Wendt, 1964; Lazarus & Alfert, 1964; 
Lazarus, Speisman, Mordkoff, & Davison, 1962; 
Nowlis, 1965; Nowlis & Green, 1964; Nowlis & 
Nowlis, 1956; Robbins, 1962). 

The form of the MACL used in the present study 
was composed of 49 adjectives representing 12 fac- 
tors isolated in a study of mood change of films 
(Nowlis & Green, 1964), Nowlis and Green report 
retest reliability coefficients for individual words 
ranging from .52 to .80. They conclude that “social 
desirability status of a word has very little, if any, 
effect on how it is checked when the subject is asked 
to report how he feels at the moment he reads each 
word [p, 2].” In addition to the MACL adjec- 
tives, seven guilt and seven sexual arousal adjectives 
were employed. The adjectives used to measure the 
state of sexual arousal were titillated, sensuous, 
aroused, tantalized, passionate, excited, and hot. The 
seven guilt adjectives, which had been used originally 
by Haefner (1956), were contrite, repentant, ashamed, 
blameworthy, guilty, conscience stricken, and re- 
morseful, Okel and Mosher (1968) provided con- 
struct validational evidence for the usefulness of 
these adjectives as a measure of the momentary state 
of guilt in their experiment on changes in affective 
state as function of induced guilt over aggression. 
Each adjective could be responded to as definitely 
applies (4), slightly applies (3), undecided (2), and 
definitely does not apply (1). 


RESULTS 


Sexual arousal, guilt, and the 12 affective 
factors of Nowlis were analyzed by a four-way 
analysis of variance with a repeated measure 
on one of the factors. Sex guilt (A), type of 
passage (B), and Æ’s presence or absence (C) 
were the between-group factors and the pre- 
test and posttest adjective checklist (D) was 
the repeated factor. 

Table 1 presents the results of the four-way 
analysis of variance of sexual arousal.? There 
was a significant interaction (B X D) between 
reading the sexual passage and an increase in 
sexual arousal. The Ss who read the erotic 
passage reported an increase in sexual arousal 
which was significantly different, using the 


2 The authors wish to acknowledge the assistance 
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TABLE 1 


ANALYSES OF VARIANCE OF THE AFFECTIVE STATES oy 
SEXUAL Arousal, GUILT, AND 
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Newman-Keuls test, from their pretest level 
and both the pretest and posttest levels of the 
Ss who read the academic passage. The sig- | 
nificant main effect of the repeated measure 
(D) merely reflects this overall increase in 
sexual arousal as a function of the significant 
B X D interaction. 

Table 1 also reports the results of a fout- 
way analysis of variance of the guilt adjec- 
tives. The major hypothesis was suppo! 
since the predicted interaction (A X B X 
between sex guilt, type of passage, & 
change in the state of guilt was significant, 
The mean of the second report of feelings 0 
guilt for the high sex guilt Ss who read the 
erotic passage was significantly higher 


the means of the other seven cells including 


their own pretest scores. From Table 1, i 
be seen that there were three significant i 
factor interactions in addition to the sign! 


cant triple interaction. The A X D and 


3 All significance tests between cell means ake 
significant F ratios were obtained used the Newni 
Keuls method. All significance levels report? 
two-tailed. 
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D interactions reflect the effects of the A X 

BX D interaction. In the A x D interaction 

high sex guilt Ss reported significantly more 

feelings of guilt on the second MACL than 

did Ss in the other three cells including their 
own pretest scores. Also, high sex guilt Ss 

checked more guilt adjectives on the first ad- 

ministration of the MACL than did the low 
"sex guilt Ss at either time. In the BXD in- 
teraction Ss who read the erotic passage re- 
ported a significantly higher mean on the guilt 
factor on the second administration of the 
MACL than did Ss in the other three cells. 
The significant C X D interaction is a func- 
tion of a significant increase in the second 
MACL over the pretest MACL for Ss who 
read either passage in the presence of E. Note 
that this interaction was not specific to either 
the erotic passage or to sex guilt. The signifi- 
cant main effect on the repeated measure was 
a function of an overall decrease in reported 
guilty state on the second administration of 
the MACL. While the high sex guilt Ss who 
read the erotic passage increased in guilt feel- 
ings, the other groups all declined in reported 
guilty state so that there was a significant 
overall decrease in the mean score on guilty 
state from the first to the second administra- 
tion of the MACL. 

Table 1 also contains the analysis of the 
state of anxiety. The significant B X C X D 
interaction for anxiety and the absence of a 
BX Cx D interaction for guilt support the 
feasibility of the conceptual distinction be- 
tween fear and guilt. The Ss who read the 
erotic passage with Æ present showed an in- 
Crease in the mean state of anxiety which was 
significantly greater than that of Ss in the 
other seven cells which includes their own 
first MACL anxiety scores. However, there 
was a significant increase in the state of anxi- 
ety in Ss who read the erotic passage when E 
Was not present as well. Note that it was the 
case, however, that Ss reading the erotic pas- 
Sage with E present reported more of an in- 
crease in anxiety than Ss reading the erotic 
Passage with E absent, The significant B main 
effect indicated that Ss reading the erotic pas- 
sage were more anxious than Ss reading the 
academic passage. The D main effect indi- 
cated greater anxiety in the second MACL. 
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Both of these main effects are secondary in 
interpretive significance to the B x C X D 
interaction. 


Other Affective States 


The results on the other 11 Nowlis mood 
factors will be briefly summarized. Surgency 
(p < 05), Elation (p < 01), and Social Af- 
fection (p< .05) all decreased significantly 
on the second administration as simple main 
effects, For the Egotism factor, there was a 
significant B X C X D interaction (p < .05) 
which is uninterpretable since high sex guilt 
Ss who were to read the neutral passage, but 
who did not know it, had a higher mean Ego- 
tism score than did three other cells including 
high sex guilt Ss who were unaware that they 
were about to read the erotic passage. Ego- 
tism (p < 01) decreased significantly on the 
second administration of the MACL. Then, 
these four positive affective factors declined as 
a function of participating in the experiment. 
On the Aggression factor, there was a signifi- 
cant B x D interaction (p < .05) such that 
Ss who read the erotic passage with Æ present 
were most angry while Ss who read the aca- 
demic passage with Æ present were least 
angry. On the Sadness factor, there was a sig- 
nificant BX D interaction (p< 01) such 
that Ss who read the erotic passage reported 
more sadness on the second MACL than Ss in 
the other three cells. There was also a signifi- 
cant C X D interaction (p < .05) such that 
Ss reported more sadness on the second MACL 
when E was present. A simple D main effect 
(p < 01) reflected the increase in sadness on 
the second MACL. On the Concentration fac- 
tor, there was a significant B X D interac- 
tion (p < .05) such that Ss who read the aca- 
demic passage increased in concentration while 
Ss reading the erotic passage decreased in con- 
centration on the second MACL. On the Non- 
chalance factor, there was a significant C X 
D interaction (p < .05) such that Ss who 
read either passage with Æ present reported 
feeling less nonchalant on the second MACL; 
the significant D main effect (p< 01) re- 
fects this overall decrease in nonchalance. On 
the Fatigue factor, there was a significant 
AXBXC interaction (p < 05) such that 
high sex guilt Ss are more fatigued with E 
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present while reading the erotic passage and 
more fatigued with E absent while reading 
the academic passage while the exact reverse 
holds for the low.sex guilt Ss. Fatigue de- 
creased on the second MACL as a main effect 
(p < .01). On the factor of Vigor, there is an 
uninterpretable A X B x D interaction (p < 
.05) in which high sex guilt Ss report a greater 
feeling of vigor on the first MACL preparatory 
to reading the academic passage than is the 
case in three other cells including high sex 
guilt Ss who are yet to be told they will be 
reading an erotic passage. There were no sig- 
nificant differences on the Skepticism factor. 


Discussion 


The data on sexual arousal indicated that 
females who read the erotic passage reported 
greater sexual arousal than did the females 
reading the academic passage. The anticipated 
inhibitory effect on sexual arousal of the per- 
sonality disposition of sex guilt was not sub- 
stantiated; neither did the presence of the 
female Æ inhibit sexual arousal for those fe- 
males who read the erotic passage. It may be 
that these variables require a more complex 
conceptual structure before relationships, if 
they exist, can be demonstrated. It may be 
that sex guilt inhibits sexual arousal when 
sexual cues are minimal, but there is a 
threshold of erotic cues such that sexual 
arousal will occur regardless of the female’s 
disposition toward sex guilt. If this is the 
case, however, a simple dimension of erotic 
realism, according to the Kinsey (1953) data, 
will not suffice for females. Love and sex are 
more inextricably intertwined in females than 
in males. In the erotic passage that was used, 
when it is taken out of the novel’s context, 
the sexual descriptions were accompanied by 
indications of apparent tenderness and con- 
cern. Further research is required to discover 
the precise roles of erotic realism and the 
male’s affection, tenderness, and commitment 
on the sexual arousal of females reading 
erotic literature. Another possible explana- 
tion of the failure to find either guilt over 
sex or E’s presence inhibiting sexual arousal 
might be that these variables inhibit the ex- 
pression of sexual behavior but not sexual 
fantasies or sexual arousal. If 40% of the 
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females in the Kinsey sample report never 
experiencing erotic arousal while reading 
literary passages, is it because erotic litera. 
ture fails to arouse them or is it because they 
avoid reading erotic literature? 

High sex guilt females who read the erotic 
passage reported an increase in their feeling 
of guilt which was significantly greater than 


their pretest state of guilt and that of the f 


remaining females at any time. While high sex 
guilt females were more likely to report feel- 
ing more guilty when they entered the experi- 
mental situation, they did significantly in- 
crease their feelings of guilt when they read 
the erotic passage. The affective state de- 
scribed as “feeling guilty” presumably follows 
transgressions of a moral code. Is the moral 
violation in this instance that of reading 
erotic literature as part of an experimental 
task? Despite repeated reminders that these 
Ss could decline to participate in this study 
at any time, only one chose not to continue 
after reading a few lines of the erotic pas- 
sage. Or, is the state of guilt engendered in 
females who are predisposed to guilt over sex 
when they experience sexual arousal by ac- 
tively participating in a sexual fantasy while 
reading the erotic passage? Further research 
is required before a definitive answer can be 


given. It does seem helpful to make conceptual 


distinctions between sexual behavior, thoughts 
and fantasies, and affects, and between guilt 
as an inhibitor of behavior and as an affective 
state which follows a sexual transgression 
which is either behavioral or cognitive-affec- 
tive. 

Females reported feeling significantly more 
“fearful,” “clutched-up,” and “jittery” after 
reading the erotic passage when E was pres: 
ent than did the other females in different 
treatments or than they had before they ™ 
the passage. This significant triple interaction 
and the failure to find any significant ee 
four-way interactions between high sex gu! 
reading the erotic passage, with E present, ae 
increases in negative affective states supp" 
the feasibility of making distinctions like thos? 
reviewed in the introduction between affe i 
states such as fear and guilt. Howevel 
should be noted that the females ee 
feeling an increase in guilt when Æ was pr 
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ent regardless of the type of passage read, and 
that the females were significantly more an- 
zious when they read the sexual passage with 
E out of the room than they were before or 
than were the females who read the academic 
passage. These are not crucial weaknesses to 
the argument, however, since the increase in 
guilt is not specific to reading the sexual pas- 
sage and the females who read the erotic pas- 
sage with E present were significantly more 
anxious than those who read the erotic pas- 
sage with Æ absent. 

The data from the other Nowlis mood fac- 
tors revealed that females have less positive 
affects after participating in an experiment of 
this sort. Reading erotic literature rather than 
academic literature, in addition to increasing 
sexual arousal and guilt, increases feeling of 
sadness while decreasing concentration. While 
they are reading either passage if E is pres- 
ent, in addition to feeling more anxious and 
guilty, the females feel more sadness and less 
nonchalance than if Æ waits outside. 
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OBJECT-DISCRIMINATION LEARNING SET ACQUISITION IN 
YOUNG CEREBRAL PALSIED CHILDREN IN RELATION TO 
TESTED AND RATED INTELLIGENCE * 


DOUGLAS PEARSON ° 
Pacific State Hospital 


Object-discrimination learning set formation was explored as a potential means 
of assessing the intelligence of children with neuromotor handicaps. Forty 
learning set problems were administered to the following groups, each con- 
taining six Ss, of 2- to 34-year-olds: cerebral palsied (CP) children who tested 
as retarded but.who were rated as not being retarded; cerebral palsied children 
who tested and were rated as being retarded; nonhandicapped retardates; non- 


handicapped children with average int 


elligence. The CPs who were rated as 


not retarded performed like the normals; both were significantly better learners 
than the other two groups. The discussion focused on the import of these 
findings for the future assessment of such children and the role that other 


variables play in such learning. 


The high degree of objectivity, standardiza- 
tion, reliability, and validity of the Stanford 
Binet and the WISC are qualities largely re- 
sponsible for their widespread use as intelli- 
gence tests for children. It is doubtful, how- 
ever, whether these qualities are retained 
when these tests are given to children who, in 
some way, are grossly different from those on 
which the test was standardized. One such 
group of children would be those, like the 
cerebral palsied, who have motor and/or 
sensory handicaps. In many instances such 
children lack the requisite motor and/or sen- 
sory skills for doing a wide variety of test 
items. For example, they often do not have 
sufficient motor coordination to manipulate 
objects in a manner consistent with the pur- 
poses of testing. Likewise, many of these 
children are not able to talk with the degree of 
facility needed to express themselves on 
verbal test items. Finally, it might be argued 
that because of their handicaps, these chil- 
dren suffer some amount of environmental 
deprivation and therefore do not bring the 
same relevant experiential background to the 


1This paper is based on a dissertation presented 
in partial fulfillment of the requirements for a PhD 
in Psychology at the University of California, Los 
Angeles. The author is grateful for the guidance and 
support of Wendell E. Jeffrey, major professor. 

2 Requests for reprints should be sent to Douglas 
Pearson, who is now at the University of Calgary, 
eee of Psychology, Calgary 44, Alberta, Can- 
ada. 


testing situation as do many of their non- 
handicapped peers. As a result of this, it 
would appear that the examiner is often 
placed in the position of having to guess if 
such a child’s failure to perform adequately 
on a test is indicative of an intellectual defi- 
cit, physical inability, or some combination of 
the two. 

Attempts at circumventing this problem 
have been made by devising new tests (egn 
the Columbia Mental Maturity scale); by 
relying on pictorial recognition tests, such as 
the Peabody Picture Vocabulary Test, that 
place a minimal demand on the motor s 
of the child (Dunn & Hartley, 1959); ot by 
combining a potpourri of “usable” test items, 
modified where necessary, from a variety 0 
existing intelligence tests for children (Haeus- 
sermann, 1952; Kogan, 1957; Taylor, 1959): 
These procedures have their drawbacks. M 
many instances they lack, to one degree oi 
another, the objectivity, standardization, 1 
liability, and validity of their more common- 
place counterparts. In other instances, 48 ne 
the use of pictorial recognition testi 
child’s performance may be “sabotaged” BY 
some of his more subtle problems st! i 
deficit in visual perception (Lacey, 1962) a 
less experience with his environment 1 com 
parison to most children his age. 4 fot 

In spite of these difficulties, the nee a 
intellectual assessment early in the life i 
children with neuromuscular handicaps 
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often imperative. For one thing, there is al- 
ways the concern on the part of the physician 
and the parent as to whether damage to the 
brain has resulted in impaired intellectual 
functioning. A more pressing reason for as- 
sessment in this case is that measured intel- 
ligence is an important factor in formulating 
future educational and social plans for such 
children. Here the gamut of choices is wide; 
they lie between placement in an institution 
for the retarded and the opportunity of stay- 
ing at home and attending a regular public 
school. Obviously such decisions have consid- 
erable moment for both the parents and the 
child and deserve to be made on the most 
accurate information that can be provided. 
In this investigation, the ability of cerebral 
palsied children to form object-discrimination 
learning sets was explored as an alternative 
to intelligence tests for assessing, on a gross 
level, the intellectual ability of such children. 
The primary reasons for adopting this alter- 
native were that (a) the ability to form ob- 
ject-discrimination learning sets has already 
been shown to discriminate between groups of 
nonhandicapped children of varying levels of 
intelligence (Ellis, Girardeau, & Pryer, 1962; 
Girardeau, 1959; Kaufman & Peterson, 
1958; Stevenson & Swartz, 1958); (b) the 
ability to perform on the learning-set task 
does not require speech or any but the barest 
amount of motor skill and, because of this, 
seemed like a technique that could readily be 
used with children with motor handicaps; 
and (c) it was felt that a measure of this 
kind of learning might ultimately prove to be 
a useful criterion for predicting how well such 
children might perform in a school setting. 
There is a vast amount of experimental lit- 
erature on the use of object-discrimination 
learning sets in this general context. Compara- 
tive studies, using this paradigm, have shown 
that the ability for such learning is a function 
of the rank of a species on the phylogenetic 
ladder. That is, a species ability to form 
learning sets will be superior to a species of 
lower phylogenetic rank, but inferior to a 
Species of higher phylogenetic rank, Thus far, 
the rat, the cat, the marmoset, the raccoon, 
the spider monkey, and the rhesus monkey 
have been tested on object-discrimination 
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TABLE 1 
DESCRIPTIVE STATISTICS OF THE Four GROUPS 


CA IQ 

Group N | 

M | SD| M | SD 

Normals 6 |32.7 | 4.6 |103.5| 5.4 
CP “rated-not- 

retarded” 6 |32.3 | 4.0 | 67.0| 4.1 

CP “rated-retarded” 6 | 32.7 | 6.0 | 60.0| 6.6 
Nonhandicapped 

retardates 6 | 31.0 | 4.4 | 61.0] 9.6 


Note.—CA is presented intmonths. 


learning sets and, in general, their ability for 
such learning has ranked them in accordance 
with their position on the phylogenetic scale 
(Cotterman, Meyer, & Wickens, 1956; Har- 
low, 1958; Koronakos & Arnold, 1957; Miles, 
1957; Miles & Meyer, 1956; Myers, Mc- 
Quiston, & Miles, 1962; Schrier, 1966; Shell 
& Riopelle, 1957; Warren & Barron, 1956; 
Wright, Kay, & Sime, 1963). 

Recent investigations with children below 
the age of seven have, as has already been 
mentioned, generally shown that the ability 
to form such sets is strongly related to classi- 
fications of measured intelligence (i.e., aver- 
age, moderately retarded, etc.). Results have 
typically shown that normals reach a higher 
level of performance and take fewer trials to 
do so than retarded children (Ellis, 1958; 
Ellis, Girardeau, & Pryer, 1962; Girardeau, 
1959; Harter, 1965; Stevenson & Swartz, 
1958). 


METHOD 
Subjects 


Four groups of six children, between 2 and 34 
years of age, were compared on their ability to form 
object-discrimination learning sets. Three of these 
groups were matched on IQ on the basis of their 
scores on the Cattell Infant Intelligence Scale; they 
contained children whose measured intelligence fell 
in the range of moderate to mild mental retarda- 
tion (ie., IQs between 50 and 75). Table 1 presents 
the descriptive statistics of the four groups. One of 
these three groups had retarded children who did 
not have any outstanding physical handicaps. The 
other two groups were comprised of cerebral palsied 
(CP) children whose physical handicaps quite con- 
ceivably interfered with their performance on a test 
Tike the Cattell or the Binet. One of these CP groups 
had children who were rated (see below) by nursery 
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school teachers and physical and occupational thera- 
pists (paramedical personnel very conversant with 
the children and how they functioned in a nursery 
school setting) as being retarded (the CP “rated- 
retarded” group), while the other group had children 
who were rated as not being retarded (the CP 
“rated-not-retarded” group). The fourth group was 
made up of nonhandicapped children with IQs in 
the average range of intelligence (ie. between 95 
and 110). 


Rating Scales 


Inasmuch as the major purpose of this experiment 
was to explore the use of learning sets as a gross 
measure of intellectual capacity that would be mini- 
mally influenced by the motor disabilities of CP 
children, it was necessary to have a second estimate 
of this ability that would be independent of both 
their handicap and their intelligence test scores. Such 
an estimate could then serve as a validity criterion 
against which to evaluate the performance of the CP 
Ss on the learning-set task. To this end a set of six 
rating scales was devised to be used by nursery 
school teachers and physical and occupational thera- 
pists. These scales were developed on the basis of 
the nursery school education literature (e.g. Read, 
1960) pertaining to the kinds of age-related behav- 
iors nursery school teachers might use in evaluating 
the intellectual and social development of their pu- 
pils. Five different age-related behavioral dimensions 
of this sort were developed into preliminary scales. 
They were: “Use of Play Material,” “Social Inter- 
action,” “Expressive Communication,” “Receptive 
Language,” and “Social Awareness.” A sixth scale 
asked the raters to rate each child as to his gross 
level of intelligence (ie., average, borderline, se- 
verely retarded, etc.). Each scale had five points 
representing the range of age-related competency in 
the skill. The midpoint of the scale represented those 
sets of behaviors expected of most children around 
the age of 24 to 3 years. The scale points were 
given descriptive labels. For example, the points on 
the “Expressive Communication” scale were (1) un- 
differentiated, (2) attention getting, (3) basic need 
communication, (4) social communication, and (5) 
interpersonal and fantasy communication. In addi- 
tion, each of these points was anchored by one or 
more behavioral referents which, where possible, did 
not rely on the motor skills of the child. As an ex- 
ample of this, the referents for the points on the 
“Expressive Communication” scale labeled “social 
communication” were “in some manner communi- 
cates social needs and desires” and “expresses more 
subtle emotions like jealousy, shame.” 

These scales, in a preliminary form, were sub- 
mitted to three highly experienced nursery school 
teachers for their suggested modifications. After the 
recommended changes were made, the scales were 
pretested by having three raters of the type to be 
used in the study independently rate each of six 
cerebral palsied children (not used in the study). 
The resulting reliability coefficients were between .76 


and .86 and were found to be significantly different 
from zero (p< .01). In addition, the cumulative 
ratings over the five dimensional scales for each child 
were checked to see how well they agreed with the 
rating of the child’s intelligence. The rank correla- 
tion coefficient for this match was .98 and found to 
be significantly greater than zero (< 01). 

In this study each cerebral palsied S was inde. 
pendently rated by two or more raters on the sit 
scales. Raters were asked to make their judgments 
on each scale separate from ratings on any of the 
other scales. Since each scale ran from 1 to 5, $ 
could receive a cumulative score ranging from 6 to 
30, Because a score of 17 was the midpoint in the 
range of possible scores, it was made the cutoff 
point for putting the CP S into the “rated-retarded” 
or the “rated-not-retarded” group. In this case, $ 
having a cumulative score between 6 and 17 were 
placed in the “rated-retarded” group while those 
having scores falling between 18 and 30 were placed 
in the “rated-not-retarded” group. 


Apparatus 


A Wisconsin General Test Apparatus (WGTA) 
was used to present the stimulus pairs to Ss, Since it 
had to be highly portable, the WGTA was made out 
of light material (1-inch-thick plywood) and was 
small enough to be easily transported and to fit on 
the top of any small table such as might ordinarily 
be found in a nursery school setting. 

The experimental stimuli were 100 commonplace 
objects, like small toys or eating utensils, and sol 
geometric blocks of various shapes, colors, and sizes. 
Tt was felt that the use of such objects would make 
the stimuli in each pair more easily discriminate 
from one another since they were likely to differ on 
a number of dimensions such as size, shapes color; 
texture, and “meaning.” In all, there were 100 such 
objects available for use in the study. Each objec 
was tagged with a number between 1 and 100. pi 
40 test pairs used in the experiment were obtain 
by drawing, in succession, 40 pairs of numbers 
between 1 and 100, from a table of random number 
with the restriction that no number could be draw 
more than once. In each of these 40 instances i 
objects corresponding to the numbers drawn forme 
a test pair. 


Experimental Procedure 


The initial phase of the procedure was given W 
to adapting S to the experimental situation $ 
shaping him into making the required response 
general, shaping was simply directed at atin 
to recognize that he could obtain a bit of T : 
ment (sugar-coated cereal) by displacing, in ww 
manner, one of two objects covering the £00 i 
on the discriminanda tray. Typically Ss 3 che de 
a few trials before they were able to make 
sired response. 

At this juncture the learning-set procedu 
begun. Forty object-discrimination learning ha 
lems were presented to each S using ê 
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regime (six trials per problem) with the left-right 
position of the reward object being varied from trial 
to trial according to a prearranged random sequence. 
On each trial of a given problem S was required to 
choose either by object displacement, head nod, or 
verbal identification the object which he thought 
had the reward under it. A noncorrectional pro- 
cedure was used throughout the presentation of the 
test problems. 

Primarily because of school schedules, 22 out of 
the 24 Ss were run on the experimental procedure on 
alternate days or three times a week. Some variation 
occurred here because of illness. Typically Ss were 
able to do about five learning-set problems a session 
so that it took an average of 3 weeks to complete 
the learning task. Two of the Ss were run on a daily 
schedule because they were available as Ss for a 
limited amount of time. In any event, it was felt 
that variations in scheduling experimental sessions 
was not critical because previous research on hu- 
mans (Wischner, Braun, & Patton, 1962) had shown 
that various levels of skill in forming learning sets 
are retained over a period as long as 6 months. 

The order of problem presentation, the trial se- 
quence of reward-object position, and the designa- 
tion of the reward object were randomized inde- 
pendently for each S. This was done to control for 
factors like position and object preferences or the 
inevitable variation in the difficulty level of the test 
pairs which could affect the experimental findings. 


RESULTS 


The basic response measure in this study 
was the number of correct responses on Trials 
2 through 6 over consecutive blocks of eight 
learning-set problems. In all, there were five 
such blocks of problems. Table 2 presents the 
mean number of correct responses on each 
problem block for the four comparison groups 
(Figure 1 depicts the same data in graphic 
form). A Group (G) X Problem Block (PB) 
tepeated-measures analysis of variance was 
done on these mean scores to test for differ- 
ence between the groups on the degree to and 
the rate at which they formed object-discrimi- 
hation learning sets. 


Amount Learned 


The finding of a significant F (F = 26.27, 
df= 3/20, p < .001) for the group effect on 
the analysis of variance indicates that the 
groups did differ in the degree to which they 
formed object-discrimination learning sets. To 
identify which groups differed in this regard, 
the group means on each problem block were 
analyzed using a Duncan’s multiple-range 
test. The results (p = .05: Ri = 7.46, Ra = 
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TABLE 2 


MEAN NUMBER OF CORRECT RESPONSES ON 
EACH PROBLEM BLOCK FOR THE FOUR 
COMPARISON GROUPS 


Problem block 
Group 
1 2 3 4 5 
Normals 28.50 | 34.33 | 33.33 | 35.00 | 35.35 
CP “rated- 
not-retarded” | 28.00 | 27.57 | 29.83 | 32.33 | 33.33 
CP “rated- 
retarded” 21.00 | 24.33 | 21.00 | 23.67 | 23.00 
Nonhandicapped 
retardates 20.00 | 21.00 | 19.67 | 23.50 | 20.17 


7.84, Rs = 8.07) showed the mean perform- 
ance of the CP “rated-not-retarded” group to 
be on a par with the normals by the end of 
the first block, Furthermore, both of these 
groups were found to perform at a level sig- 
nificantly above that of the CP “rated-re- 
tarded” and the nonhandicapped retarded 
groups on Blocks 3, 4, and 5. None of the 
other comparisons were found to be reliable. 

Examination of the learning curves for the 
four groups (Figure 1) suggested that the 
normals performed at a much higher level 
sooner in the learning-set procedure than any 
of the other three groups. Hence, there was 
the possibility that their ability, in this re- 
gard, was primarily responsible for the dif- 
ference found between the high and the low 
groups. Because it was critical to know if the 
CP “rated-not-retarded” group learned more 
than either of the low groups, a GX PB 
analysis of variance and the Duncan multiple- 
range test were performed on the same data 
except that the scores of the normals were 
excluded. Results of both the analysis of 
variance (F = 31.19, df= 2/20, p< .001) 
and the Duncan, on Blocks 3 through 5 ($ 
= 05: Ri= 7.46, Ro= 7.84), revealed that 
the high CPs had indeed learned more than 
either of the two low groups. 

Finally, examination of the learning curves 
for each of the four groups (Figure 1) aroused 
some curiosity as to whether the two low 
groups, the CP “rated-retarded” and the non- 
handicapped retardates, had formed learning _ 
sets to any significant degree. The learning 
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LEGEND 

Normals 
a-——a CP “rated not retarded” 
e-—-e CP “rated retarded” 
bi Retarded 


Fic. 1. Mean percentage of correct responses per problem block for each group. 


trend for each group was subjected to a trend 
analysis using an analysis of variance tech- 
nique. In this case, only a significant F for 
the linear component would mean that some 
learning had taken place across problem 
blocks. The results of these tests showed sig- 
nificant Fs for the linear component on the 
group variable for the normals (F = 7.65, df 
= 1/80, p<.01), the CP “rated-not-re- 
tarded” (F = 9.00, df = 1/80, p< .01), and 
the CP “rated-retarded” (F = 4.07, df = 180, 
p< 05). Thus, only the nonhandicapped 
retardates, it turns out, failed to show any 
significant degree of learning on the experi- 
mental task. 


Rate of Learning 


The significant Groups X Problems inter- 
action (F = 1.96, df = 12/80, p < .05) indi- 
cates that there was indeed a difference be- 
tween the groups in terms of their rates of 
learning. Several trend analyses using an 
analysis of variance technique were performed 
in order to further clarify the nature of this 
difference. A comparison of the learning 
trends for the two high groups, the normals, 
and the CP “rated-not-retarded,” showed a 
significant linear component for the Groups 
X Problems interaction (F = 4.36, df = 1/40, 
p< .05). On the basis of this finding, plus 
inspection of the learning curves for these 
two, groups (Figure 1), it was felt that the 


normals formed sets at a faster rate than the 
CP “rated-not-retarded,” in the early prob: 
lems, but that their rates became similar in 
the latter stages of learning. Another trend 
analysis showed that the same interaction 
effect (F = 6.80, df = 1/80, p< 01) was 
significant when tested on the two CP groups: 
Again, this finding, combined with inspection 
of the group learning curves (Figure 1) im 
volved in the comparison, indicated that the 
CP “rated-not-retarded” learned at 4 Tas 
rate than did the CP “rated-retarded! ! 
comparison of the trends for the two lo 
groups did not produce any significant find 
ings. 


Discussion 


The results of this study supported f 
hypothesis that the CP “rated-not-retattl 
group’s ability to form object-<iscriminsit 
learning sets would be on a pat with $ 
handicapped children with average 1s 
significantly better than the cP a 
tarded” and a group of nonhandicapP® 4 
tardates. The differences occurring betw 
groups, especially the ones having non 
capped children, demonstrate once s 
the capacity for such learning !S a 1 
related, in some positive fashion, to vd 
The fact that the nonhandicapped retaro, 
failed to show any degree of learning 
differ somewhat from what has been rep? 


OBJECT-DISCRIMINATION LEARNING IN CEREBRAL PALSIED 


on children at this IQ level. A probable rea- 
son for this is that the children in this group 
were run on 40 problems; this is far fewer 
than the 60-80 problems run on similar sam- 
ples of children who have demonstrated some 
capacity for forming learning sets (e.g., Ellis, 
1958; Girardeau, 1959; Kaufman & Peter- 
son, 1958; Stevenson & Swartz, 1958). 

Two other factors, task satiation and defi- 
cits in attention, were seen to play a disrup- 
tive role in the learning-set performance of 
a few Ss in each of the four groups. The first 
of these was most likely due to the fact that 
it took an average of 3-4 weeks to run an S 
through the learning-set procedure. During 
this time Ss seemed to hit a period or two 
when they became tired of the experimental 
task. When this occurred, these Ss no longer 
evidenced any concern for securing the rein- 
forcement and they would then lapse into 
rigid or haphazard response patterns. With 
regard to the second fact, it was noted that 
some of the Ss would frequently not look at 
where they were reaching when they made 
their choice response. This was often the case 
with the nonhandicapped retardates, but it 
was an especially prevalent feature in the 
performance of the CP children. It was not 
possible to ascertain if this deficit was the 
result of the organic problems of some of the 
Ss or if some of the children, for whatever 
reason, had not yet learned to attend to en- 
vironmental cues for any length of time in a 
problem-solving situation. The net result of 
this was twofold. One was an increase in in- 
dividual variability which, because it was un- 
systematic in character, inflated the error 
variance. This would seem to imply that the 
trends noted in this study would have been 
even more statistically pronounced than they 
were if there was not this additional source 
of error. The other was a decrease in the re- 
liability of the learning-set scores. Hence, it 
would not be possible to define a reliable and 
valid relationship between the learning-set 
scores and measured intelligence on the basis 
of these findings. Yet, the problem of using 
the learning-set procedure in this context does 
not appear to be insurmountable. Since the 
data showed that definite differences occurred 
between groups by the third problem block, 
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it is possible, for example, that fewer prob- 
lems could have been run, thereby cutting 
down on some of the influence of the factors 
mentioned in the above. But in the broader 
perspective, the fact that variables of this 
nature influence such learning is by no means 
a major threat to clarifying the role of 
measurable aspects of intelligence in learning- 
set performance. Harter (1967) has already 
shown that it was possible to define the rela- 
tionship between various kinds of incentive 
conditions, measured intelligence (both MA 
and 1Q), and object-discrimination learning- 
set formation. It should be possible in future 
investigations to expand the knowledge as to 
the effect of variables like these in order to 
further clarify the contribution of MA and/or 
IQ to such learning. 

One overriding result of this study was 
that the ability to form object-discrimination 
learning sets could be measured adequately on 
children whose handicaps possibly, for rea- 
sons outlined already, interfere with getting 
an adequate measure of intelligence. Perhaps 
the greatest importance should be attached to 
the fact that the CP “rated-not-retarded” 
group performed in a manner similar to an 
age-matched group of normals, It would ap- 
pear, then, that the learning-set procedure 
allows such children to express abilities that 
are not tapped, and most certainly not pre- 
dicted, by the IQ scores. Given the consis- 
tency with which children of average intellec- 
tual ability outperform retardates on this 
task, this finding lends credence to the possi- 
bility that current methods of 1Q testing 
underestimate the ability of some children 
with motor and/or sensory handicaps. If fu- 
ture research shows that there is a relation- 
ship between learning-set formation and mea- 
sured intelligence that is valid and reliable, 
the capacity for such learning might serve as 
one alternative way to assess the intellectual 
or educational potential of such children which 
would be more objective than the means now 
at our disposal. 
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INTERNAL-EXTERNAL CONTROL DIMENSION AS 
RELATED TO ACCIDENT AND SUICIDE PRONENESS 
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A 239-item inventory consisting of Rotter’s Internal-External scale, Keehn’s 


‘Accident Index, Farberow and Devries’ 


? MMPI Suicide scale, 


and Devries’ 


Potential Suicide Personality Inventory was administered to 235 introductory 
psychology students. As predicted, externally oriented Ss generally scored higher 
on the suicide potentiality scales than internally oriented Ss (with the two 


suicide scales correlating highly with 


each other). Contrary to predictions, 


scores on the accident proneness scale correlated positively with scores on both 
the externality and suicide potentiality scales, Possible explanations for these 


results were discussed. 


A review of the literature on accident and 
suicide proneness, on the one hand, and one’s 
perception of reinforcement as internally or 
externally controlled on the other hand, sug- 
gests that these areas may be related. That is, 
there is reason to believe that accident-prone 
individuals may be internally oriented and 
suicide-prone individuals may be externally 
oriented. 

The internal-external control dimension is 
based on Rotter’s (1954) social learning the- 
ory and describes two quite different world 
views. The internally oriented individual be- 
lieves that reinforcements are contingent on 
certain aspects of his own behavior, such as 
competence and skill; while, conversely, the 
externally oriented individual believes that 
such reinforcements are determined by forces 
independent of his own behavior, such as fate, 
chance, luck, or other individuals. 

In recent years there have been a number of 
studies showing differences in behavior as a 
function of the locus-of-control dimension 
(Lefcourt, 1966b; Rotter, 1966). Several in- 
vestigators have demonstrated that the acqui- 
sition and extinction of expectancies depends 
on whether Ss perceive a task as skill- or 
chance-determined (James & Rotter, 1958; 
Nickels & Tolen, 1968; Phares, 19575 Rotter, 
Liverant, & Crowne, 1961). In fact, a scale 
has been developed to measure individual 
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differences in generalized locus-of-control ex- 
pectancy (Rotter, 1966). Through the use of 
this scale and other procedures, data have been 
obtained which indicate that internal scorers 
differ from external scorers in a variety of 
ways. For example, they differ in preferences 
for skill versus chance activities (Schneider, 
1968), perceptual threshold variation (Phares, 
1962), delay in decision-making (Rotter & 
Mulray, 1965), latent learning performance 
(Getter, 1966), unusual shifts in expectancy 
(Battle & Rotter, 1963; James, 1957), mem- 
ory for various kinds of information (Seeman, 
1963), tendencies to forget failure experiences 
(Efran, 1963), degrees of conformity (Crowne 
& Liverant, 1963), resistance to subtle influ- 
ence (Gore, 1962), attempts to control the 
environment (Gore & Rotter, 1963; Strick- 
Jand, 1965), risk-taking behavior (Liverant 
& Scodel, 1960), attitudes toward informa- 
tion and social influence (Seeman & Evans, 
1962), seeking of relevant information (Davis 
& Phares, 1967), and achievement (Butter- 
field, 1964; Crandall, Katkovsky, & Crandall, 
1965). 

More recently Hersch and Scheibe (1967) 
treated locus-of-control as a personality di- 
mension which they related to items and 
scales of the California Psychological Inven- 
tory (Gough, 1964) and the Adjective Check 
List (Gough & Heilbrun, 1965). Results indi- 
cated that internal scorers as compared to 
external scorers were higher on the Domi- 
nance, Tolerance, Capacity for Status, Good 
Impression, Social Presence, Sociability, Psy- 
chological Mindedness, Intellectual Efficiency, 
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Self-Control, Self-Acceptance, Well-Being, 
Responsibility, Achievement via Conformance, 
and Achievement via Independence scales of 
the California Psychological Inventory. Also, 
internal scorers were found to be higher on 
the Adjective Check List scales of Defensive- 
ness, Self-Confidence, Achievement, Domi- 
nance, Endurance, and Order, but lower on 
scales of Succorance and Abasement. More 
specific analysis of individual items revealed 
that internal scorers viewed themselves as 
clever, efficient, egotistical, enthusiastic, inde- 
pendent, self-confident, ambitious, assertive, 
boastful, conceited, conscientious, deliberate, 
persevering, clear-thinking, dependable, de- 
termined, hardheaded, industrious, ingenious, 
insightful, organized, reasonable, and stub- 
born (as determined by these adjectives being 
checked significantly more often by internal 
Ss). According to the researchers, internally 
oriented persons were more likely to describe 
themselves as active, striving, achieving, pow- 
erful, independent, and effective. On the other 
hand, external scorers checked only one 
adjective significantly often—that of “self- 
pitying” (Hersch & Scheibe, 1967). However, 
some evidence suggested that external scorers 
were more anxious and maladjusted than in- 
ternal scorers. Supporting this view, Feather 
(1967) attributed greater anxiety and neuroti- 
cism to external scorers. 

Investigations relevant to (but not previ- 
ously related to) the locus-of-control variable 
include studies on accident proneness and sui- 
cide proneness. The accident-prone individual 
was described by Jenkins (1956) as a talka- 
tive, rule-breaking, independent, self-confi- 
dent, unrestrained, nonconforming, attention- 
seeking, brazen, and reckless individual who 
was cooperative only when it benefited him. As 
compared to “accident nonrepeaters,” they 
displayed greater superiority, boldness, initia- 
tive, and leadership, but were also more 
distractible, aggressively outspoken, impolitic 
in frankness, and grandiose in thinking. In 
addition, they felt less sensitive, less guilty, 
and less ashamed of themselves than non- 
repeaters, and had their feelings hurt less 
often by others. (In fact, they seemed to ob- 
tain satisfaction or thrill in minor injuries, in 
being in pain, or in being hurt by others.) 
When frustrated, however, accident-prone 
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persons had less control of their f 
so became quite upset and angry, 
revealed a greater interest in disciplin 
power vocations and less signs of e 
maturity and mental adequacy. Other 
acteristics attributed to accident-prone 
viduals portray them as being extray 
(Fine, 1963; Keehn, 1961), imprude 
dyce, 1964), daring and adventurous 
& Brewer, 1966; Kunce & Worley, 19 
quick, decisive, impulsive, and indi 
(Alexander, 1949; Dunbar, 1943). 
ingly, one group of researchers (Kun 
Brewer, 1966) suggested that accident 
ness may be incompatible with chroni 
severe maladjustment, whereas another 
(Selzer, Payne, Gifford, & Kelly, 1963) 
a high incidence of psychiatric illness 
accident-prone persons. 
On the basis of previous studies, a í 
rable picture of the suicide-prone (pote 
suicidal) individual can be drawn, 
been characterized as rigid (Fairbai 
Vinoda, 1966; Wall, 1944), introvel 
asocial (Hendin, 1950; Hopkins, 1937 
liams, 1936), timid and dependent ( 
& Napier, 1954; Faris, 1934), easil 
rassed, overly sensitive, restrained in f 
friendships, and lacking in self-confid 
initiative (Farberow & Devries, 1967 
fearful, worthless, and unwanted 
1966), irritable and resentful (Lest 
Stoneman & Perth, 1935), and po 
grated, weak in character, or abnor 
personality (Andics, 1947; Raphael, 
& Berridge, 1937; Sainsbury, 1955; 
1954—reviewed by Robin, 1956; $ 
Davidoff, 1942). In a study contrastim 
cidal and nonsuicidal patients with 
tory or cardiac illnesses (Farberow, į 
ligott, Cohen, & Darbonne, 1966), t 
group was found to be more depen 
pulsive, alert, anxious, agitated, appre 
depressed, and distressed over their 1 
They tended to be “bad patients,” 
were also hostile and intolerant of 
discomfort as well as complaining 
manding of special attention from i 
personnel. On the basis of an investi 
another patient population (Dean, Mis 
DeCook, Wilson, & Maley, 1967), ¥ 
searchers found suicidal patients to © 
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less overt anger, inappropriate behavior, and 
benefit from social activities (but more apa- 
thy) than nonsuicidal control patients. In- 
terestingly, the suicidal patients in this study 
were labeled “cooperative” and rated higher 
than controls in “adjustment,” while other 
researchers seemingly stereotyped them as 
“uncooperative” (Farberow et al., 1966) and 
lacking in “adjustment” (Andics, 1947; 
Batchelor & Napier, 1954; Devries, 1966). A 
parallel inconsistency may be pointed out in 
that Dean and his colleagues, as well as Win- 
field and Sparer (1953), found suicides to be 
less extrapunitive than controls, whereas 
Farberow (1950) found suicides to be more 
extrapunitive than controls, and both Fisher 
and Hinds (1951) and Vinoda (1966) found 
no differences between the attempted suicide 
population and psychiatric patients (except 
that suicidal persons were more rigid and 
revealed more general hostility). 

Although previous investigators have not 
compared the locus-of-control variable with 
either accident or suicide proneness, several 
researchers have specifically related the latter 
two areas, For example, Litman and Tabach- 
nick (1967) described the accident-prone per- 
son as active, counterphobic or counterde- 
pressive, impulsive, quick, decisive, indepen- 
dent, adventurous, rebellious, and defiant of 
authority, On the other hand, these same re- 
searchers described the suicide-prone person 
as masochistic, depressive, dependent, passive, 
immobilized, constricted, and disinterested in 
physical activity. In other words, the former 
individual was viewed as a “risk-taking win- 
ner,” the latter as a “danger-avoiding loser.” 
Also, Preston (1964) found that accident- 
Prone persons were more extrapunitive, less 
impunitive, and rather barrier-oriented (ob- 
stacle-dominance). Conversely, he found that 
suicide-prone persons were less extrapunitive, 
more impunitive, and rather ego-oriented 
(ego-defense). The results on extrapunitive- 
ness tend to confirm the findings of several in- 
vestigators (Dean et al., 1967; Winfield & 
Sparer, 1953) discussed above. In another 
study (Tabachnick, Litman, Osman, Jones, 
Cohn, Kasper, & Moffat, 1966), accident 
victims were found to be characteristically 
active, self-satisfied, and self-aggrandizing, 
but concerned with their appearance, whereas 
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suicide victims were dependent, self-critical, 
and self-punishing, but unconcerned with their 
appearance, Also, the former tended to be 
idealized by their survivors, whereas the latter 
tended to be depreciated by them. Although 
within a year prior to their deaths, accident 
victims had contemplated moving (or had 
actually moved) into situations of personal 
growth and responsibility, suicide victims just 
before their deaths had encountered the loss 
of a significant person or a feeling of failure 
(or being unloved). In addition, accident- 
prone persons were distantly integrated with 
and relatively autonomous of individuals in 
their environments, whereas suicide-prone per- 
sons tended to be closely integrated with and 
highly dependent on individuals in their en- 
vironments. Finally, persons dying by acci- 
dents were rather exhibitionistic and fre- 
quently upset when slighted and criticized by 
others, whereas persons dying by suicide were 
rather depressive and frequently upset by the 
loss of a significant other or the deprivation of 
important resources. As an interesting aside, 
two other investigators cast doubts on these 
results by claiming that suicide-prone persons 
display exhibitionism (Teicher, 1947) and a 
relative insensitivity to “hurt feelings” (Jen- 
kins, 1956). 

From the above summary of research on 
the internality-externality and accident-sui- 
cide variables, it appears that many of the 
characteristics attributed to the internal ori- 
entation are those typically associated with 
accident proneness, and many of the charac- 
teristics attributed to the external orientation 
are those typically associated with suicide 
proneness. That is, both internally oriented 
and accident-prone persons tend to be inde- 
pendent, self-confident, brazen, boastful, en- 
thusiastic, active, and achieving; whereas both 
externally oriented and suicide-prone persons 
tend to be dependent, guilty, anxious, timid, 
depressive, and unachieving. The former re- 
veal their initiative, dominance, and power 
orientations, and a recent gain in personal 
growth and responsibility; the latter reveal 
their lack of initiative, their lack of self-con- 
trol, and a recent loss of significant persons 
and resources in their lives. In addition, the 
former appear more self-acceptant, self-satis- 
fied, and self-aggrandizing; the latter more 
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self-critical, self-punishing, and self-pitying. 
Finally, the former seem to be more sociable, 
more concerned with their appearance and 
making a good impression, and more idealized 
by significant others; the latter more asocial, 
more unconcerned with their appearance and 
making a good impression, and more depreci- 
ated by significant others. 

A close approximation to the above descrip- 
tion of the internality-accident combination 
is Litman and Tabachnick’s (1967) state- 
ments: 


the accident-prone individual is active. . . . inde- 
pendent, adventurous, and likes excitement. . . . He 
cannot tolerate the feeling of being boxed in or 
losing autonomy [p. 253]. 

A major responsibility for fatal accidents must 
also be assigned to counterphobic and counterde- 
pressive attitudes....Some people try to avert 
depression by committing themselves to endeavors 
that exceed their capacities. This may lead to bun- 
gled actions which are potentially fatal. Counter- 
phobic persons gain self-esteem by taking drugs 
without becoming addicted, or by driving while 
tired, fighting fatigue, and struggling to stay awake. 
. .. The accident-prone individual may place him- 
self in risky situations to strengthen his feelings of 
invulnerability and omnipotence . . . [pp. 256-257]. 


On the other hand, Weiss (1957) has 
clearly described the externality-suicide com- 
bination in his statement: 


Many suicidal attempts have at least in part the 
character of a gamble with death, a sort of Russian 
roulette, the outcome of which depends to some 
extent on chance. The attempts are consciously or 
unconsciously arranged in such a manner that the 
lethal probability may vary from almost certain 
survival to almost certain death; and “fate”—or at 
least some force external to the conscious choice of 
the person—is compelled in some perhaps magical 
way to make the final decision [p. 21]. 

The purpose of the present study was to 
determine whether accident-prone persons 
view their reinforcements as contingent on 
their own skill, capability, proficiency, or 
expertise (internality), whereas suicide-prone 
persons view their reinforcements as contin- 
gent on fate, chance, luck, or other persons 
(externality). Accordingly, two hypotheses 
were advanced: First, measures of the exter- 
nal locus-of-control philosophy will correlate 
positively with measures of suicide proneness 
but negatively with measures of accident 


proneness. Second, measures of suicide prone- 
ness will correlate positively with one another 
but negatively with measures of accident 
proneness. 


METHOD 
Subjects 


The Ss were 235 introductory psychology students 
"(114 males and 121 females) at the University of 
Manitoba, who voluntarily participated in this 
experiment as part of their course requirement. 


Instruments 


A single questionnaire was administered to. groups 
of 40 to 60 Ss. The questionnaire consisted of all 
items comprising four separate scales, namely, the 
Internal-External scale (I-E; Rotter, 1966); the 
Accident Index (AI; Keehn,'1959); the MMPI Sui- 
cide scale (MMPIS; Farberow & Devries, 1967); 
and the Potential Suicide Personality Inventory 
(PSPI; Devries, 1966). The I-E scale was kept 
intact, but the items from the other instruments were 
placed into the final questionnaire through the use 
of a table of random numbers. 

The I-E is a 23-item (plus 6 fillers) forced-choice 
instrument reduced from an original pool of 100 
items developed on a priori grounds. For each non- 
filler item an internal belief is compared to an 
external belief, and extensive normative data have 
been obtained on various populations under various 
circumstances, _ 

The AI is a 38-item acquiescence-type instrument 
reduced from an original pool of 41 items developed 
as a listing of various accidents and minor mishaps, 
One hundred (mostly Arabic) male university stu- 
dents between 18 and 25 years of age served as the 
normative population, 

The MMPIS is a 52-item true-false scale extracted 
from the short form of the MMPI. Normative data 
were obtained on male neuropsychiatric hospital pa- 
tients separated into (a) original groups of 21 com- 
mitted suicides, 32 attempted suicides, 32 threatened 
suicides, and 32 nonsuicidal patients; (b) replication 
groups of 22 committed suicides, 47 attempted sui- 
cides, 50 threatened suicides, and 48 nonsuicidal 
patients; and (c) cross-validation groups of 50 
threatened suicides and 50 nonsuicidal patients. The 
median ages of the various groups ranged between 
32 and 37. 

The PSPI is a 50-item (plus 5 fillers) acquiescence- 
type instrument reduced from 333 critical incidents 
relevant to suicide. The original items were obtained 
from research articles on suicide, Normative data 
were obtained on male neuropsychiatric hospital pa- 
tients separated into groups of 33 attempted sui- 
cides, 55 threatened suicides, 42 attempted and 
threatened suicides, and 83 nonsuicidal patients. 
Fifty-two percent of the suicidal patients were 
between 17 and 39 years of age, whereas 45% of the 
nonsuicidal patients fell within this age range. 
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ACCIDENT AND SUICIDE PRONENESS 


TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR MEASURES OF 
EXTERNALITY, ACCIDENT PRONENESS, AND 
SUICIDE POTENTIALITY 


M SD 
Measure 7 
Males | Females | Males | Females 

Externality (I-E) 9.28 9.41 4,28 2.36 
Accident proneness| 

(AI) 14.62 | 15.91 | 4.58 | 5.30 
Suicidal potential- 

ity (MMPIS) 14.91 16.12 8.40 8.06 
Suicida! potential- 

ity (PSPI) 23.03 | 22.57 | 3.34 | 3.30 


Note.—For males, N = 114;forfemales,N = 121. 


Procedure 


Intercorrelations among the measures of external- 
ity, accident proneness, and suicide potentiality were 
based on the Pearson product-moment correlation 
and were computed separately for each sex. A one- 
way analysis of variance on extreme I-E scores was 
also computed in order to provide a further test 
of the relationship between externality on the one 
hand and AI, MMPIS, and PSPI scores on the other. 


RESULTS 


Means and standard deviations for mea- 
sures of externality, accident proneness, and 
suicide potentiality were computed separately 
for each sex and appear in Table 1. 

Intercorrelations among these same mea- 
sures appear in Table 2. 

All but two correlations reached statistical 
significance, with three being highly signifi- 
cant. As predicted in the first hypothesis, 
externality was directly related to suicide 
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potentiality as measured by the MMPIS (for 
both sexes) and the PSPI (for females only). 
In opposition to this same hypothesis, exter- 
nality was neither directly related to male 
PSPI scores nor inversely related to AI scores 
(for either sex). The second hypothesis was 
also partially confirmed, since correlations 
between the two suicide potentiality scales 
(MMPIS x PSPI) were highly significant in 
a positive direction. Contrary to this hypothe- 
sis, however, AI scores did not correlate nega- 
tively with MMPIS or PSPI scores. In fact, 
all correlations obtained in the present study 
were positive, and the third highest coefficient 
was that resulting from the correlation of 
AI and female MMPIS scores, 

The one-way analysis of variance revealed 
that the 114 “clearly” external Ss (those 54 
males and 60 females with externality scores 
greater than 9) had significantly higher 
accident- and suicide-proneness scores than 
the 65 “clearly” internal Ss (those 31 males 
and 34 females with externality scores less 
than 7), regardless of whether the scores were 
based on the AI (F = 12.80, p < .001), the 
MMPIS (F = 20.53, p < .001), or the PSPI 
(F = 10.53, p < .01). In other words, through 
the use of the extreme I-E cutoff points sug- 
gested by Eisenman and Platt (1968) from 
Rotter’s normative data (1966), the research- 
ers obtained additional confirmation for the 
results presented in Table 2. 


Discussion 


The results of the present study, taken as 
a whole, suggest that externality, accident 


TABLE 2 


INTERCORRELATIONS AMONG MEASURES OF EXTERNALITY, ACCIDENT PRONENESS, 
AND SUICIDE POTENTIALITY 


Externality Accident proneness Suicide potentiality 
(-E) (AD ____(MMPIS) 
Measure 

Males Females Males Females Males Females 

Accident pronen — = — =- 
oie ess (AI) .20* 14 F 

Suicide potentiality E .25** .18* .29** apee a2 sb 
Suicide potentiality (PSPI) 14 .18* .24* 2e l | Sore are 
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proneness, and suicide potentiality are per- 
sonality traits which vary together in the 
college population, The unexpected finding 
that AI scores correlate directly with mea- 
sures of both externality and suicide prone- 
ness—trather than inversely as had been pre- 
dicted—may reflect the lack of validity of 
the AI. Although the AI was developed on a 
university student norm group, it may not 
validly measure accident proneness either for 
present Ss (since most were Canadian rather 
than Arabic) or for any but the most honest 
accident repeater (since the AI requests 
truthful answers to obvious questions about 
a rather “touchy” problem area): On the 
other hand, perhaps the lack of I-E dif- 
ferentiation between accident and suicide 
proneness in the present research reflects the 
dissimilarity between the male and female 
normal college sample studied herein and the 
exclusively male neuropsychiatric hospital 
samples used to establish MMPIS and PSPI 
norms, Finally, the randomizing of AI, 
MMPIS, and PSPI items may have con- 
tributed in some way to the unexpected com- 
monality among all four personality measures. 

Nevertheless the question can still be asked, 
“If the four personality measures are valid, 
how can the results of the present study 
be explained?” Perhaps the accident-prone 
Ss were internally oriented as originally 
hypothesized but circumstances inflated their 
externality. First, the environment in which 
Ss function may to some extent influence 
whether they appear internally or externally 
oriented. Internals have looked like externals 
when reinforcements seemed to be chance- 
determined (Rotter & Mulray, 1965), and 
externals have looked like internals when re- 
inforcements seemed to be skill-determined 
(Lefcourt & Ladwig, 1965). According to 
Hersch and Scheibe (1967) an individual may 
describe himself as an external because he is 
in a highly competitive situation where the 
actions of others may indeed have great rele- 
vance for the success of his own efforts. 
Therefore, Ss used in the present study 
(college students) may have viewed their 
present academic reinforcements as contin- 
gent on others, and so may have responded 
as if they were externally oriented individuals, 


even though their basic world views may have 
been internal. 

Second, both accident proneness (Litman 
& Tabachnick, 1967, pp. 251-252) and sui- 
cide proneness (Shneidman, 1965, pp. 319- 
320) have been called transient personality 
characteristics rather than permanent char- 
acter traits. Therefore, accident-prone persons 
in the present study could have been inter- 
nally oriented during certain transition periods 
in their lives, at which times they reacted 
to stress by engaging in reckless and rule- 
breaking behavior resulting in accidents, 
Afterward, however, when the stress lessened 
(and they agreed to participate in the present 
study), they indicated both their previous, 
internally oriented accidents but their present 
locus-of-control philosophies, which for a large 
number of Ss could have changed to an 
external orientation. 

Third, there is evidence that accident-prone 
persons tend to “forget” their accidents 
(LeShan & Brame, 1953), and internals tend 
to “forget” their failure experiences (Efran, 
1963). This “forgetfulness” could be a factor 
if the internal individual who views himself 
as efficient, competent, deliberate, and clear 
thinking fails to maintain this image by 
having accidents (e.g., crashing cars, cutting 
himself, bumping into things, etc.). He might 
tend to forget these failures and therefore 
answer the AI as if he were really nonaccident 
prone. Such behavior on a large scale could 
reverse the relationship between internality 
and accident proneness, and could account for 
present results. 

Looking at the results from the reverse 
standpoint, perhaps the accident-prone Ss in 
the present study were, in fact, externally 
oriented. The problem then becomes one of 
explaining how the characteristics of acci- 
dent Proneness can seem to be consistent 
with the internal orientation and yet not be 
upon closer scrutiny. First, accident-prone 
persons may resemble internally oriented indi- 
viduals in many respects (e.g., independence 
and self-aggrandizement), even though they 
may have certain other qualities—more essen- 
tial to the concept of accident proneness— 
which internally oriented persons typically 
lack (e.g., recklessness and tule-breaking). 
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ACCIDENT AND SUICIDE PRONENESS 


Second, although the outward appearance 
of accident-prone persons may suggest the 
self-control and self-acceptance of internally 
oriented individuals, the inner dynamics of 
the case (Litman & Tabachnick, 1967) may 
reveal that their impressive demeanor is actu- 
ally a superficial denial of the urge to regress, 
to be passive, and to engage in escape, guilt, 
and revenge fantasies. In other words, in most 
instances they may do a superb job of hiding 
their close kinship to the suicide-prone person, 
although the present study uncovered this 
telationship. 

As a preliminary formulation, we suggest that the 
crucial factor which suicide prone and accident prone 
have in common is a weakness of the integrating and 
synthesizing functions of the ego which allows two 
distinct patterns of activity and passivity to coexist. 


In this sense, suicide-prone and accident-prone states 
are reciprocal [Litman & Tabachnick, 1967, p. 257]. 


Perhaps the various measures of accident and 
suicide proneness correlate because they tap 
this common factor of inner dynamics. 

i If this common factor is labeled “malad- 
Justment,” then measures of accident and 
suicide proneness might be expected to cor- 
relate positively with I-E scores, which them- 
selves have been found to relate directly 
to measures of maladjustment (Cromwell, 
Rosenthal, Shakow, & Kahn, 1961; Feather, 
1967; Hersch & Scheibe, 1967; Rotter, 1966). 
Furthermore, even though “general anxiety” 
may (Feather, 1967; Watson, 1967) or may 
not (Efran, 1963) correlate positively with 
measures of externality, one study (Tolor & 
Reznikoff, 1967) found that externals revealed 
a greater amount of overt “death anxiety” 
than internals. Such findings on maladjust- 
ment and death anxiety may well account for 
the positive correlations among measures of 
externality, accident proneness, and suicide 
potentiality, 

Third, although in the present study most 
accident- and suicide-prone persons may be 
externally oriented, any one-to-one pairing 
Of locus-of-control, accident-proneness, and 
Suicide-potentiality variables probably repre- 
Sents a gross oversimplification. In contrast to 
the original hypotheses, some investigators 

lave linked accident proneness to externality 
and suicide proneness to internality. For ex- 
ample, the relationship between accident 
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proneness and externality has been clearly 
made. 


The ego of the accident prone refuses to accept 
responsibility for his actions. . . . At the same time, 
there is no feeling of ego-alien “driving” toward 
accidents. They simply “happen,” and “fate” or 
“Juck” seems to be against the person [LeShan, 1952, 
footnote, p. 77]. 


Conversely, researchers in the area of suicide 
prevention have clearly indicated that some 
persons play a conscious and deliberate role 
in their own demise. Whether such individuals 
are called implementers (Farberow, Shneid- 
man, & Leonard, 1963) or psyde-initiators 
(Shneidman, 1963), they may certainly be 
described as internally oriented. 


A Psyde-iniator believes that he will suffer cessation 
in the fairly near future—a matter of days or weeks 
—or he believes he is failing and, not wishing to 
accommodate himself to a new (and less effective 
and less virile) image of himself, does not wish to 
let “it” happen to him. Rather, ke wants to play a 
role in its occurrence. Thus he will do it for himself, 
at his own time, and on his own terms. . . . We find, 
on occasion, a case in which an older person, hos- 
pitalized in a general medical hospital, in the terminal 
stages of a fatal disease will, with remarkable and 
totally unexpected energy and strength, take the 
tubes and needles out of himself, climb over the bed 
rails, lift a heavy window, and throw himself to the 
ground several stories below [Shneidman, 1963, p. 
213]. 


To further complicate the picture, accidents 
may sometimes represent self-destructive in- 
tentions (MacDonald, 1964; Menninger, 
1936) as well as attempts to solve personal 
problems in dangerous surroundings (Behan 
& Hirshfield, 1963). Similarly, so-called sui- 
cides may represent an unintentioned or 
accidental ending to one’s life as well as 
an example of deliberate self-destruction 
(Shneidman, 1963). 

As the present researchers view the complex 
interaction among the locus-of-control, acci- 
dent-proneness, and suicide-potentiality vari- 
ables, they hypothesize that, aside from a 
person’s physical condition, his death may to 
a large extent depend on (a) his desire to 
live or die, (b) his expectancy that he will 
live or die (within a prescribed period), and 
(c) his locus-of-control philosophy. Future 
studies may well consider the degree to which 
the above factors can predict fatal and non- 
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fatal accidents and attempted and completed 
suicides. In addition, through the use of the 
locus-of-control variable, further attempts can 
be made to differentiate those persons or 
conditions ripe for both accidents and suicides 
from those persons or conditions ripe for one 
but not the other (Henry & Short, 1951; 
Selzer & Payne, 1962). 

Further investigations of accident and sui- 
cide proneness may well reveal the close rela- 
tionship between these variables and that of 
locus of control. Since Lefcourt (1966a) has 
already indicated the importance of the I-E 
dimension for work in psychopathology and 
psychotherapy, perhaps the present study can 
offer a slight glimpse of the benefits for 
extending locus-of-control studies into the 
taboo areas of accidents, suicides, and death. 
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study reported by Shaw and White 
}) indicated a relationship between child- 
t identification and academic achieve- 
Shaw and White found that the high 
achiever, but not the underachiever, 
fies with the like-sexed parent. In addi- 
they suggested that an appropriate sex- 
dentification is characteristic of achievers 

ot of underachievers. 
ince this study was conducted at a single 
of intellectual capacity (IQ = 110), the 
ality of the findings has not been estab- 
. The index of identification was based 
e correlation between the perceptions of 
d and the parent, thus assuming that 
dentification related to achievement is 
ent on identification with the like- 
parent. The current study extends this 
estigation to the total IQ range of a small 
| high school and the assessment of iden- 
cation by objective test measures dependent 
On the child’s interests in and preferences 
‘sex role. On the basis of the Shaw and 
Teport, it was predicted that (a) under- 
fs will exhibit significantly lower in- 
appropriate sex-role identification 
evers, and (b) this difference will 
dless of level of intellectual ca- 


s 
The authors acknowledge their indebtedness to 
td G. Lathrop of Chico State College for his 
œ and counsel. Requests for reprints should 
t to Lois Knowles, 246 North Enright, Wil- 
California 95988, 
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CHILD-PARENT IDENTIFICATION AND ACADEMIC 
UNDERACHIEVEMENT 


ELNORE GRANLUND anp LOIS KNOWLES + 
Chico State College 


The current study extends Shaw and White’s investigation of the relationship 
of child-parent identification and academic underachievement by utilizing a 
wider range of intelligence scores and a different index of identification. Forty- 
eight male Ss were classified as either underachievers (UA) or achievers (A) 
based on the ratio of the prior year’s cumulative GPA to their California Test 
of Mental Maturity IQ score. UA and A masculinity-femininity scores derived 
from the Bell Adjustment Inventory were significantly different (p< .01); no 
such differences were found between IQ levels or the interaction of IQ X 
Achievement. The findings extend Shaw and White's suggestion that male sex- 
role identification is characteristic of A but not of UA. 


METHOD 
Subjects 


The Ss were selected from the male student popu- 
lation and divided into three equal groups (low, 
medium, and high IQ) based on IQ scores from the 
California Test of Mental Maturity. IQ ranges were: 
low = 74-89; medium = 90-105 ; high = 106-142. The 
groups were further subdivided on the basis of ra- 
tios of cumulative grade-point average (GPA) for 
the previous year to IQ score. The lower half of the 
computed ratios were classified as underachievers; the 
upper half as achievers. All Ss who were not available 
for testing or had a GPA from another school dis- 
trict were eliminated from the study. Further random 
elimination of Ss resulted in the final sample of eight 
Ss in each of six subgroups (Low, Medium, High IQ 
X Underachiever, Achiever)—a total of 48 Ss. 


Procedure 


The Bell Adjustment Inventory Revised (1962) 
Student Form (Bell, 1963) was administered to all 
Ss as a group. The Es indicated that the scores were 
to be treated in confidence and that none of the 
results would become a part of the S’s school record. 
The test administration was untimed. 


RESULTS 


Since Bell (1963) considers a high score on 
the 30-item masculinity-femininity scale to 
indicate a good identification with some male 
during infancy and childhood, scores on this 
scale were used as an index of sex-role identi- 
fication. These scores for the six subgroups 
were analyzed by analysis of variance. The re- 
sults of this analysis are presented in Table 1. 
As noted there, achievers and underachievers , 
have significantly different ($ < 01) mascu-: 
linity scores. However, no significant differ- 
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TABLE 1 
ANALYSIS OF VARIANCE: MASCULINITY SCORES 


Source df MS F 
Achievement level 1 136.69 | 11.24* 
IQ (low, medium, high) 2 8.52 oh 
Achievement Level X IQ 2 33.94 | 2.79 

Error 42 12.16 
Total 47 
*p <.01, 


ences attributable to level of IQ or its inter- 
action with achievement are apparent. Further 
inspection of the data indicated that males 
classified as achievers had a higher masculin- 
ity score (X= 21.44) than underachievers 
(2 = 18.08). 


Discussion 


The present data indicate a relationship be- 
tween underachievement and lack of sex-role 
identification as measured by the Bell Ad- 
justment Inventory’s masculinity-femininity 
scale. When these results are considered in 
conjunction with the findings of Shaw and 
White, the hypothesis that identification and 
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underachievement are related is supported. 
The current study, however, considerably ex- 
tends the generality of these conclusions by 
indicating that this relationship is manifested 
over the entire range of IQ scores and is not 
dependent upon a single method of assessing 
identification. It should be noted, moreover, 
that the Shaw and White study focuses on 
parent-child relationships, whereas Bell indi- 
cates that his masculinity-femininity scale 
measures the degree of like-sex identification 
not necessarily dependent upon parental rela- 
tionship. Further study is needed to determine 
whether an appropriate parental identifica- 
tion per se is a prerequisite for academic 
achievement and whether sex-role identifica- 
tion is a concomitant of appropriate parental 
identification. 
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WORD ASSOCIATION REPERTOIRES OF 
SCHIZOPHRENICS AND NORMALS* 


GEORGE D. FULLER ? anp SOLIS L. KATES 8 


University of Massachusetts 


The word association repertoires (of three association responses) of 20 poor 
and 20 good premorbid nonparanoid schizophrenics were not more deviant 
than those of 20 normals. Over all groups, there was significantly greater com- 
monality in the first association response than in the second response which, 
in turn, showed greater commonality than the third response. These results 
suggest that schizophrenic language disorders are not due to overall language 
deficiency and partially support the disturbance viewpoint. A third viewpoint 
is proposed, based on a two-stage process involving nondeviant associations 
with inadequate cognitive controls as differentiating schizophrenics from 
normals. The hierarchical order for frequency given by the Russell-Jenkins 


$ 


norms obtains for both discrete and extended word association responses. 


The present study investigates whether the 
word association repertoires of poor premorbid 
and good premorbid nonparanoid schizophren- 


“ics are more deviant than those of normal Ss. 


The association repertoires obtained in this 
research are made up of three consecutive 
association responses to each stimulus word. 

Several studies using the Kent-Rosanoff 


| Word association list with schizophrenics have 


found that they have associations that are 


less common than normals (Sommer, Dewar, 


& Osmond, 1960), more idiosyncratic than 
Normals (Johnson, Weiss, & Zelhart, 1964), 
and less related to the stimuli than normals 
(Moran, 1953). On the other hand, the con- 
clusion that schizophrenic patients are not 
different from normals in their verbal response 
habits and in their association responses has 


been reached by other investigators (Do- 


kecki, Polidoro, & Cromwell, 1965; Wynne, 
1963). One purpose of this study is to see 
whether schizophrenic Ss are indeed deviant 
in their word association responses. 

Tf our results point to greater deviance in 
word association repertoires for schizophrenic 
Patients, they may also offer some evidence 
about the two major viewpoints describing 

1 Base; d, 
submi 


of th 
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ased, in part, on a Master of Science thesis 
itted by the senior author in partial fulfillment 
e Master of Science degree requirements at the 
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tes, Department of Psychology, University of 
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schizophrenic language disorder. One view- 
point suggests that disturbance effects due to 
competing responses are responsible for schizo- 
phrenic deficit (Broen & Storms, 1966) 
whereas the other viewpoint offers general 
language deficiency as the responsible factor 
(Cameron, 1947). 

Broen and Storms (1966) infer that, for 
schizophrenics, competition between the more 
common and less common responses tends to 
flatten out the frequency distribution of dis- 
crete word associations. Consequently, the 
initial responses of schizophrenics may show 
less commonality (are of lower frequency) 
than those of normals. From this competition- 
disturbance viewpoint, it is reasonable to as- 
sume that common responses which are in- 
hibited as the first association in schizophren- 
ics should tend to emerge as the second or 
third association in their response hierarchy. 
If, however, there is an overall language de- 
ficiency as inferred by Cameron (1947), 
schizophrenics should offer association re- 
sponses that show significantly greater de- 
creases in commonality from the first to the 
second to the third association than the 
comparable responses of the normals, Another 
purpose of this investigation is to determine if 
there is support for one or the other of the 
two theoretical systems described above. 

In addition, it is predicted that for the 
combined schizophrenic and normal $ groups 
the hierarchical order of responses in fre- 
quency norms for discrete word associations, 
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TABLE 1 


MEANS AND STANDARD DEVIATIONS FOR THE MATCHING 
CRITERIA OF THE SUBJECT GROUPS 
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TABLE 2 


MEANS AND STANDARD DEVIATIONS FOR COMMONALITY 
AND IDIOSYNCRATIC MEASURES 


Group 
Criterion 
S Good Poor 
Normals premorbid|premorbid 
Age 
M 43.8 44.5 44.0 
SD 9.3 7.5 7.2 
Education (in grades) 
11.1 10.6 11.2 
SD 1.8 3.0 2.9 
IQ 
M 22.1 21.1 22.4 
SD 3.1 2.9 3.4 


as described in the Russell-Jenkins (1954) 
norms, should obtain when word association 
repertoires of three association responses are 
secured. Words given as first associations 
will be represented by a greater frequency on 
the discrete norms than words given as second 
associations, and second associations will be 
represented by a greater frequency than third. 
It is further predicted that these relationships 
will hold for both schizophrenic and normal 
Ss. 


METHOD 
Subjects 


Sixty male Ss in three groups (20 nonparanoid 
good premorbid schizophrenics, 20 nonparanoid poor 
premorbid schizophrenics, and 20 normal Ss) were 
matched on the basis of age, IQ (WAIS Vocabulary 
and Information), and educational level (Table 1). 

Good premorbids were defined as those schizo- 
phrenics rated between O and 12 points on the 
Phillips Scale, and poor premorbids as those be- 
tween 18 and 30 points. As a result of previous 
studies finding paranoid-nonparanoid to be an inde- 
pendent dimension in schizophrenia (Buss & Lang, 
1965), all schizophrenics diagnosed as paranoid or 
showing significant paranoid features in their case 
histories were excluded. 

Additional criteria for selection of both normal 
and schizophrenic Ss were: (a) cooperative, atten- 
tive normal and schizophrenic males able to under- 
stand and conform to the instructions; (b) between 
the ages of 21 and 60; (c) for schizophrenics, no 
other complicating pathology (e.g., organicity, men- 
tal retardation, alcoholism) and no lobotomy or 
electroconvulsive therapy within the last 6 months; 
(d) for normals, presently making an adequate and 
stable adjustment to the community with no mod- 


Response 
Group Total 
1 2 3 
Normals 
Commonality 
M 3,982 1,447 | 911 | 6,340 
SD 1,228 571 466 | 1,402 
Idiosyncratic 
M gal 6.6 9.6 19.3 
SD 2.2 3.0 2.9 6.3 
Good premorbid 
Commonality 
M 3,802 1,263 |819 | 5,884 
SD 1,196 615 563 |1,575 
Idiosyncratic 
M 2.8 8.8 | 10.2 21.8 
SD 2.7 3:2 4.1 8.5 
Poor premorbid 
Commonality 
M 3,516 1,146 | 682 | 5,344 
SD 1,202 601 | 459 | 1,638 
Idiosyncratic 
M 3.4 7.8 | 10.8 22.0 
SD 2.3 3.7 3.7 8.4 


erate or disabling emotional disturbance or alco- 
holism. 


Test Materials 


Twenty-four words were selected from the Kent- 
Rosanoff word association list which were likely to 
yield superordinate association responses (e.g., “fruit” 
as the superordinate or class response for the stimu- 
lus word “apple”). The stimulus words were pre- 
sented to Ss singly on 3 X 5 inch cards, 


Procedure 


The Ss were tested individually and the evalua- 
tion of each S’s repertoire was based on the scoring 
of three consecutive responses to each stimulus word. 

The groups were compared on (a) commonality 
scores where each association given by S was as- 
signed a score equal to its response frequency as 
given in the Russell-Jenkins (1954) norms and (b) 
the number of idiosyncratic responses (associations 
given by Ss having a frequency of zero in the above 
norms). 


RESULTS 


Analyses of variance procedures for both 
the commonality of response data and the 
number of idiosyncratic associations showed 
that the poor premorbid, good premorbid, and 
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measure for first three responses. 


normal groups did not differ significantly 
from each other. 

The three responses differed significantly 
from each other on the commonality measure 
(F = 238, dj = 2/108, p < .001) and on the 
idiosyncratic measure (F = 180, df = 2/108, 
$< .001) (Table 2). For each dependent 
Measure, the first association response dif- 
fered significantly from the second which, in 
turn, differed significantly from the third 
Tesponse. The three diagnostic groups show 
Closely parallel response curves for the com- 
Monality and idiosyncratic measures which 
are quite clearly seen in Figures 1 and 2. No 
oo interactions between responses and 

agnostic groups were noted for commonality 
and idiosyncratic scores. 


Discussion 


fag results show that nonparanoid schizo- 
Phrenics, both good and poor premorbid, 
Possess word association response repertoires 
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NUMBER OP IDIOSYNCRATIC RESPONSES 
o 


RESPONSE 


Fic. 2. Performance of groups on idiosyncratic 
measure for first three responses. 


that are very similar to those of normal Ss. 
First, no significant differences were noted 
among the groups on the measures of com- 
monality and idiosyncratic responses given to 
the 24 selected Kent-Rosanoff words. Further, 
the response curves for commonality and 
idiosyncratic responses are very similar for 
poor and good schizophrenic and normal Ss 
when their three association responses are 
compared separately. These findings question 
the viewpoint that schizophrenics have an 
overall language deficiency. On the other 
hand, our findings support the disturbance 
viewpoint, in part, and affirm the inference 
that the first, second, and third word-associa- 
tion responses in schizophrenics do not show 
a progressive decrease in commonality greater 
than the corresponding responses of normals. 

Possibly a different account of schizophrenic 
disorder may be proposed based upon the 
highly similar two-stage approach indepen- 
dently suggested by Cohen and Camhi (1967) 
and also by Schwartz and Rouse (1961). 
Cohen and Camhi (1967) reject the conclu- 
sion that schizophrenics have deviant word 
association repertoires from which they sample 


500 


responses, and assert that they make inap- 
propriate selections from these nondeviant 
repertoires, which normals are better able to 
suppress. Schwartz and Rouse (1961) divide 
the association process into associative link- 
ages and cognitive controls, holding that in 
clinical patients there is an abnormal modifi- 
cation of the cognitive controls. The associa- 
tive linkages in schizophrenic patients are 
probably similar to those of the normals, but 
it is our suggestion that there is an impair- 
ment of the cognitive controls in schizo- 
phrenics permitting associations to emerge in 
contexts where they are inappropriate. These 
associations, derived from nondeviant associ- 
ation repertoires, are consequently deviant 
when overtly expressed. 

The finding that the initial association re- 
sponse of both schizophrenic and normal Ss 
was more common and less idiosyncratic than 
the second response which, in turn, was more 
common and less idiosyncratic than the third 
response for the diagnostic groups supports 
the Russell-Jenkins (1954) norms for com- 
monality and idiosyncrasy. It further indi- 
cates that the hierarchical order of responses 
in the frequency norms for discrete word 
associations obtains for extended word associ- 
ations involving three responses. 
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INTELLIGENCE OF VOLUNTEERS AS RESEARCH SUBJECTS* 


JAY M. CUDRIN? 
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Recent inquiries into invasion of privacy 
through the use of psychological tests (Bray- 
field, 1965) and research in the social sci- 
ences (Brayfield, 1967) have made it increas- 
ingly important for psychologists to carry out 
their research with voluntary Ss. This empha- 
sis on the use of volunteers underscores the 
necessity for examining the personal charac- 
teristics of such Ss. A growing body of litera- 
ture has already begun to scrutinize this issue. 
Tn their discussion of the Kinsey study, for 
example, Maslow and Sakoda (1952) showed 
that volunteers tended to have higher self- 


"esteem than nonvolunteers. Riggs and Kaess 


(1955) learned that when students in a psy- 
chology class were asked to volunteer to take 
a psychological test, those who came forward 


_ showed more introversive thinking and moodi- 


ness than nonvolunteers. The volunteers also 
demonstrated greater cyclical trends in emo- 
tionality, higher intropunitiveness, and lower 
extrapunitiveness. This entire issue has also 
been studied and discussed by Bair and Gal- 
lagher (1960), Locke (1954), and Newman 
(1956). 

Tn view of the number of volunteer charac- 
teristics already studied, it is noteworthy that 
no one has examined the relative intelligence 
level of volunteers as a group. Investigators 
generally recognize the importance of intelli- 
gence in behavioral research, and usually try 
to equate the intellectual levels of their ex- 
perimental and control groups. It may be 
that people volunteering for a study have a 
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The personal characteristics of people who volunteer for research is becoming 
increasingly important to behavioral scientists. This study compared the intel- 
ligence of prison inmates who volunteered for medical experiments with that 
of inmates who did not volunteer. The results showed that groups of volunteers 
for two individual experiments demonstrated significantly higher scores on 
the Revised Beta Examination than did nonvolunteers. The implications of 
these findings to behavioral research were examined. 


different intelligence level than that of the 
population from which they are drawn. This 
would mean that, in spite of equating experi- 
mental and control groups, decisions would 
still have to be made about the appropriate- 
ness of generalizing results to the population 
at large. 

The purpose of this study was to learn 
whether people who volunteer for research do 
differ intellectually from the rest of the popu- 
lation. Since psychological research implies to 
the ordinary individual some kind of mental 
activity, the chances of volunteers being 
brighter than nonvolunteers is increased. That 
is, people who realistically see themselves as 
less intelligent might be reluctant to embar- 
rass themselves by volunteering for a “psycho- 
logical experiment.” This study, therefore, 
looked at volunteers who came forward for a 
project which had no implication whatever of 
examining mental behavior. The volunteers 
had offered their services for a medical proj- 
ect. We were thus able to learn something 
about whether the mere idea of volunteering 
(for some kind of research) appealed more to 
people of a certain intellectual stratum. 


METHOD 


The study was performed at a federal prison in 
the southeastern United States where public health 
physicians were engaged in two medical projects: 
one on malaria and the other on gonnorhea. Each 
required that volunteers artificially contract the 
disease, and then stay in the prison infirmary until 
cured. 

Volunteers for the projects were obtained from 
the prison’s inmate population. All inmates were told 
about these projects and the need for volunteers 
during their first week at the prison. They were told 
they could volunteer any time during their sentence. 
The only conditions they had to meet were to pass 
a routine physical examination and to have shown 
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little evidence of severe disciplinary infractions at 
the prison. The latter was an attempt to avoid be- 
havioral problems in the hospital. Few men were 
ruled out on the basis of it. 

When each man finished the project in which he 
was participating, he received a small stipend. A 
letter was also placed in his record indicating that 
he had taken part in the experiment. He was in- 
formed, though, that the letter constituted neither a 
pardon nor an assurance of parole. 

The data for this study consisted of inmates 
scores on the Revised Beta Examination. Forty men 
who had volunteered for the malaria study had 
taken this test as part of the prison’s routine ad- 
mission procedure. An equal number from the gon- 
norhea study was obtained by eliminating every 
fourth score from an alphabetical list of volunteers’ 
names (totaling 49) until 40 scores were left. 
Finally, 40 nonvolunteers’ scores were gotten by 
taking every fourth score from an alphabetical list 
of the prison population. There was no overlap in 
any group. 


RESULTS 


The mean Beta scores and standard devia- 
tions of the three groups were: for malaria, 
M = 106.65, SD = 12.84; for gonnorhea, M 
= 108.90, SD = 9.11; for nonvolunteers M 
= 100.32, SD = 11.49. 

The significance of differences between 
means of each of the three groups was de- 
termined by ¢ tests. The results of the ¢ tests 
are shown in Table 1 and indicate that volun- 
teers manifest significantly higher scores than 
do nonvolunteers. 


Discussion 


The results of this study show that inmate 
volunteers had significantly higher IQ scores 
than the population from which they came. 
There is the issue of whether the IQ differ- 
ences examined here are clinically as well as 
statistically significant. It would be somewhat 
difficult to make a case for functional differ- 
ences if only one experimental group were 
tested. This study, however, compared IQ 
means from each of two independent groups 
of volunteers with the IQ mean of nonvolun- 
teers. Each experimental group was involved 
in a separate research project. The fact that 
identical findings were obtained each time 
shows that, although the IQ differences were 
not large, volunteers consistently came from 
a significantly different population. Thus, it 
is the consistency of the findings that adds 
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TABLE 1 


SIGNIFICANCE OF DIFFERENCES BETWEEN REVISED 
BETA Means ror EACH OF THREE GROUPS 


Groups t df $ 


2.36 78 .05 
3.60 78 01 
94 78 ns 


Malaria vs. nonvolunteers 
Gonnorhea vs. nonvolunteers 
Malaria vs. gonnorhea 


weight to the conclusion that the groups were 
functionally different. 

The author can suggest two possible reasons 
for why volunteers came from a more intelli- 
gent population. One is that although inmates 
were given a certain amount of assurance 
that neither experiment would jeopardize 
their long range health, only the more intelli- 
gent men may have really accepted the idea 
they would not be harmed. An even more 
plausible hypothesis grew out of a conversa- 
tion the author had with one of the volun- 
teers. The man said he had volunteered in 
order to give himself something interesting to 
do. Experiences like these experiments, he 
said, helped break up the dullness of his sen- 
tence. If other volunteers shared this point of 
view, a desire for stimulation and unique ex- 
perience was important in motivating men to 
participate. Of course, prison life is the sort 
of experience that is apt to increase the need 
for psychological stimulation. The implicit 
correlation found here between intelligence 
and desire to participate in uniquely different 
experiences is not different from the findings 
of other researchers (Keniston, 1968; Terman 
& Oden, 1947). 

Other variables related to intelligence may 
also have been influential here. Education 
and socioeconomic level are just two exam- 
ples. The volunteers may have been men of 
higher formal education, and schooling could 
have predisposed them to taking part in re- 
search efforts. In addition, the brighter men 
may have come from a generally higher socio- 
economic level. This social background may 
have left them, in spite of their criminal be- 
havior, with more of the social awareness that 
leads to volunteering for research projects. 
Unfortunately, the only data the author had 
access to were the intelligence test scores. 
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Thus, there are several hypotheses which still 
could be profitably investigated. 

For the moment though the results do show 
a clear and consistently positive relationship 
between the volunteering of prison inmates to 
be research Ss and level of intelligence. It 
would be imprudent to automatically general- 
ize these findings to all groups of volunteer 
Ss. The conditions of the situation for which 
the men here offered themselves, however, do 
give psychologists reason for thought: if 
simply volunteering for research which im- 
plies no mental activity whatever draws on a 
somewhat brighter than average group of 
people, we have reason to be concerned over 
the intellectual representativeness of volun- 
teers for psychological studies. 
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HUMOR AND SUICIDE: 


FAVORITE JOKES OF SUICIDAL PATIENTS * 


DON SPIEGEL,? PATRICIA KEITH-SPIEGEL, JOEL ABRAHAMS 
anp LIONEL KRANITZ 
Brentwood Hospital, Veterans Administration Center, Los Angeles 


During an interview 20 patients who had attempted suicide and 20 matched 
nonsuicidal patients were asked to relate their favorite joke. Jokes were told 
by 12 attempters and 9 controls. Two raters, blind to the purpose and nature 
of the study, rated each joke for direction of punishment expressed in the 

+ themes, Based on the combined chi-square probabilities associated with ratings 
by the two judges, suicidal Ss told significantly more jokes with a self- 
punishing theme than did nonsuicidal controls. 


The technique of using jokes in an attempt 
to provide clinical insights into motivational 
states on the basis of the assumption that a 
person unconsciously identifies with the cen- 
tral figure in his favorite joke has been de- 
scribed in the literature. Brill (1940) felt 
that the favorite joke was used to express 
hostile and other tendencies for which direct 
expression is restricted or entirely shut off, 
and that the content of the joke fits the 
patient’s struggle and furnishes him a certain 
amount of pleasure—whether a certain free- 
dom from anxiety or a discharge of tension— 
by making light of his difficulties. 

More recently, Zwerling (1955) noted that 
the very themes that were most provocative 
of the patient’s anxieties were precisely the 
subjects of jokes, as though to laugh at some- 
thing is to deny that it arouses anxiety. Here 
the joke may function, as Freud (1928) wrote, 
as the loftiest of the defense mechanisms 
available to man. j 

Research studies in which the favorite joke 
technique is used to assess group differences 
appear to be nonexistent. The majority of 
humor studies have focused on the responses 
of individuals or groups to the visual or audi- 
tory presentation of humor stimuli. 

In the present study the assumption was 
made that the self-destructive tendencies of 
suicidal individuals tend to manifest them- 

1 Lisa Gray Shellberg and Barbara S. Eaton served 
as raters for the study. 

2 Requests for reprints should be sent to Don 
Spiegel, Brentwood Hospital (183 B), Veterans Ad- 
ministration Center, Wilshire and Sawtelle Boule- 
vards, Los Angeles, California, 90073. 


selves in various facets of their behavior, in- 
cluding humor productions. It was further as- 
sumed that people tend to identify with the 
central figure of their favorite jokes. Thus, it 
was predicted that the thematic content of 
favorite jokes of suicide attempters would 
more frequently involve harm to the central 
figure than would the favorite jokes of non- 
suicidal controls. 


METHOD 


The Ss were 20 patients consecutively admitted 
to a neuropsychiatric hospital over a 2-month 
period because of suicide attempts and 20 patients 
from the same facility who were matched individually 
for diagnosis, age, and sex, but who had no his- 
tory of suicide attempts, threats, or ideation. Each 
group contained 17 males and 3 females between 29 
and 56 years of age. In each group were nine schizo- 
phrenic, nine depressive, and two alcoholic patients. 
Lengths of hospitalization and acuteness-chronicity 
in the two groups were similar. 

All 40 Ss were given a standard interview and then 
asked to tell their favorite joke. Their productions 
were recorded verbatim by the interviewer. Jokes 
were told by 12 attempters and 9 controls. These 
were typed on separate sheets of paper, assigned 
code numbers, and given to two psychologists to 
rate independently for direction of aggression or 
harm expressed in the jokes, using a system derived 
from the work of Rosenzweig (1945). Raters were 
blind to the purpose of the study. If the hero (or 
general theme) of the joke was outwardly aggressive, 
the joke was classified as extropunitive. Actual jokes 
given by patients are used here to exemplify the 
categories into which jokes were sorted. (Example: 
What are the things between elephants’ toes? Slow 
natives.”) If the hero or main character(s) suffered 
harm, pain, or discomfort, the joke was classified 
as intropunitive. (Example: “What did the firefly 
say as he backed into the fire? Delighted, no end.”) 
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Jf there was no aggressive or punitive theme, and if 
no harm or discomfort was suffered by any character 
in the joke, it was classified as impunitive. (Example: 
“Why does a fireman cross the street? To get to the 
other side.”) 


RESULTS 


The following frequencies of jokes assigned 
to categories for each patient group are listed 
in the order, intropunitive, impunitive, and 
extropunitive, with frequencies for controls in 
parentheses: Judge A: 7(0), 4(6), 1(3), and 
Judge B: 5(1), 2(5), 5(3). Frequencies for 
the impunitive and extropunitive categories 
were then combined into an “other” category, 


‘and for each independent rater, the chi-square 


method was used to test the hypothesis that 


‘more suicide attempters than controls told 
"jokes with intropunitive content. The ob- 


tained chi-square values for each rater were: 
Judge A: 5.47 (p< .01), Judge B: 1.09 
(p< .15). Using the method described by 
Gordon, Loveland, and Cureton (1952), a 
combined chi-square value of 13.03 was ob- 
tained which was significant at p < .02. 


Discussion 
The fact that statistical significance was 


obtained with such small groups of joke tellers 


is impressive in itself. However, since only 
slightly over half the patients produced a 
joke on request, and since it is not always 
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possible to tell whether a joke produced has 
some personal significance for the teller (or is 
merely the most recent joke he has heard or 
the only one he can remember), this tech- 
nique can at best serve as an adjunct to more 
reliable diagnostic procedures. Clinically, it 
would appear wise to attempt to obtain as 
much information as possible about the mean- 
ing of the joke to the patient, why it is his 
favorite joke, when and how he learned it, etc. 
Thus, an individual’s humor productions, like 
any other potentially projective material, may 
be considered significant only to the extent 
that it corroborates other findings in an 
assessment procedure. 
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BRIEF REPORTS 


MOTIVATIONAL PROPERTIES OF PERSONALITY CONFLICT * 


RONALD E. SMITH 


University of Washington 


A number of S-R theorists have suggested that 
conflict induces a drive state, assumed to have 
energizing properties, over and above the original 
ones motivating conflict behavior in a given 
situation. Brown and Farber (1968) have re- 
viewed a number of supportive studies indicating 
that conflict may result in an energization of 
ongoing behavior and that escape from conflict is 
rewarding. A method for assessing personality 
conflict in a number of need areas has been pro- 
posed by Worell and Worell (1965). They assume 
that conflict is indicated when an individual’s 
stated behavior choices differ from his stated 
behavior preferences, for he is indicating that 
competing dispositions are replacing supposedly 
preferred alternatives. The relatively noncon- 
flicted individual, on the other hand, tends to use 
the behaviors that he states he prefers. The 
Conflict Scale (Hays & Worell, 1967), designed 
to measure this preference-behavior differential, 
is composed of modified scales from the Edwards 
Personal Preference Schedule. The present study 
employed this scale to assess the effects of con- 
flict in the need achievement area on the learning 
of noncompetitive and competitive paired asso- 
ciates under achievement-oriented conditions. The 
Ss were male college students divided into high- 
and low-conflict groups, each composed of 14 Ss, 
on the basis of their Conflict Scale scores. All of 
the Ss were above the median in achievement 
preference. The Ss were individually tested on 
noncompetitive and competitive lists of paired 


1An extended report of this study may be ob- 
tained without charge from Ronald E. Smith, De- 
partment of Psychology, University of Washington, 
Seattle, Washington 98105, or for a fee from the 
National Auxiliary Publications Service. Order Docu- 
ment No. 00513 from the National Auxiliary Publica- 
tions Service of the American Society for Informa- 
tion Science, c/o CCM Information Sciences, Inc., 
909 Third Avenue, New York, New York 10022. 
Remit in advance $3.00 for photocopies or $1.00 for 
microfiche and make checks payable to: Research 
and Microfilm Publications, Inc. Requests for reprints 
should be sent to the author at the above address. 


associates. They were told that the tasks mea- 
sured important concept formation skills and that 
they would be given information regarding their 
level of performance. On the basis of drive 
theory, it was predicted that if conflict increases 
drive level, the high-conflict group would out- 
perform the low group on the noncompetitive 
list while performing relatively more poorly on 
the competitive list. The mean number of correct 
anticipations for the two conflict groups on the 
nine-item noncompetitive list, which was pre- 
sented five times, were: high, 34.29; low, 28.14. 
On the eight-item competitive list, presented six 
times, the means were: high, 21.29; low, 22.64. 
Analysis of variance yielded a significant Con- 
flict X List interaction (F = 6.16, df= 1/26, 
p <.02), confirming the hypothesis. The role of 
conflict in the performance differences of the two 
conflict groups is further supported by the fact 
that the groups did not differ significantly in TAT 
n Achievement or Test Anxiety, the two motiva- 
tional variables most likely to affect performance 
in such a situation, The mean TAT n Achieve- 
ment scores for the conflict groups were: high, 
4.57; low, 7.64 (t=1.19, df =26). The mean 
Test Anxiety scores for the two conflict groups 
were: high, 69.43; low, 72.71 (t= .23, df = 26). 
The results are interpreted as supporting the 
notion that conflict, defined as the simultaneous 
arousal of incompatible response tendencies, has 
motivational (drive) properties associated with it. 
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PERCEIVED MATERNAL NURTURANCE AND CONTROL OF 
PROCESS SCHIZOPHRENICS, REACTIVE SCHIZOPHRENICS, 
AND NORMALS: 


PETER A. MAGARO 


University of Maine 


The present study attempts to compare the 
amount of maternal nurturance and control as 
measured by Heilbrun (1964) in three $ groups 
—process schizophrenics, reactive schizophrenics, 
and normals. It has been reported that mothers 
of process schizophrenics tend to be more domi- 
neering, restricting, rejecting, and demanding of 
their sons than the mothers of reactive schizo- 
phrenics or normals (Rodnick & Garmezy, 1957). 
Considering these findings and the usual the- 
orizing of differential child-rearing histories for 
normals and schizophrenics, it was predicted that 
the process schizophrenics would rate their moth- 
ers as exhibiting less nurturance and more con- 
trol than the reactive schizophrenic or normals. 
The Ss included 198 male undergraduate stu- 
dents randomly selected from introductory psy- 
chology classes, 33 male process schizophrenics, 
and 33 male reactive schizophrenics. The process- 
| reactive breakdown was made on the basis of the 
Phillips Scale of Premorbid Adjustment (Phil- 
lips, 1953). The tests administered were the 
Parent Attitude Research Instrument (Schaefer & 
Bell, 1958) measuring control and the Parent- 
Child Interaction Rating Scale (Heilbrun, 1964) 
Measuring nurturance. 
As predicted, there was a significant difference 
between the three groups on the nurturance fac- 


TAn extended report of this study may be ob- 
tained without charge from Peter A. Magaro, Psy- 
chology Department, University of Maine, Orono, 
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for photocopies or $1.00 for microfiche and make 
oe Payable to: Research and Microfilm Publica- 
ie Inc. Requests for reprints should be sent to 
eter A. Magaro at the above address. 
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Northern Illinois University 


tor (F =4.71, df = 2/261, p < .01). The process 
group exhibited the lowest nurturance scores, 
differing significantly from the reactive and nor- 
mal groups who did not differ from one another. 
A difference between the normals and schizo- 
phrenics was found on the control scale with both 
schizophrenic groups exhibiting higher control 
than normals, However, the design of the Bell 
Scale, involving a multiple selection of responses, 
was felt to be subject to a response set with 
schizophrenic Ss, Therefore, the control scale was 
not seen as being a clear indicator of maternal 
control. 

It is felt that these results support the hypothe- 
sis of nurturance differences being present be- 
tween the mothers of process schizophrenics and 
mothers of reactive schizophrenics. Process 
schizophrenics report the child-rearing period as 
devoid of affection, while reactive schizophrenics, 
who had a better premorbid adjustment, rated 
their mothers similar to normals, that is, higher in 
demonstrated affection. The results also tend to 
support the process-reactive distinction in regard 
to differences in prior family interaction. 
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PERFORMANCE CHARACTERISTICS OF SCHOOL CHILDREN ON 
THE BIP BENDER TEST* 


JERRY ADAMS 
William S. Hall Psychiatric Institute 


The Background Interference Procedure (BIP) 
as applied to the Bender Test for the detection 
of organic brain disorder has been developed for 
use with adults. However, there appears to be 
an insistent need for such a technique that may 
be used with children, The purpose of the present 
study was to determine what the “normal” 
characteristics of children’s performance would 
be on the BIP Bender. 

A total of 272 presumably normal children, 
sampled from rural and small city schools, 
ranging in age from 6 to 14 years were adminis- 
tered the BIP Bender Test and a Coding test 
based on the one in the WISC battery. Mental 
ability test scores on the children were also 
available. The BIP Bender tests were scored by 
the method proposed by Canter (1968) which 
yields four scores: error scores for both standard 
Bender and BIP test administrations, and two 
BIP decrement indexes, a D score, and “number 
positive” (NP) representing the difference in total 
errors between the two test administrations and 
the number of designs worse on BIP, respectively. 

It was found that the scores on the Bender, 
BIP, and Coding tests changed with age. The 


1An extended report of this study may be ob- 
tained without charge from Arthur Canter, Univer- 
sity of Iowa Psychiatric Hospital, 500 Newton Road, 
Iowa City, Iowa 52240, or for a fee from the Na- 
tional Auxiliary Publication Service. Order Docu- 
ment No. 00515 from the National Auxiliary Publica- 
tions Service of the American Society for 
Information Science, c/o CCM Information Sciences, 
Inc, 909 Third Avenue, New York, New York 
10022. Remit in advance $3.00 for photocopies or 
$1.00 for microfiche and make checks payable to: 
Research and Microfilm Publications, Inc. Requests 
for reprints should be sent to Arthur Canter at the 
above address. 
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Bender and BIP error scores decreased fairly 
regularly with age whereas the BIP D scores 
decreased sharply around age 8 and tended to 
vary more narrowly after age 10. The correlation 
between BIP D score and age was —.25, a sig- 
nificant (p < .01) but low-order correlation. Ex- 
tremes of the intelligence range in the sample 
were compared on BIP D scores and usual 
Bender error scores. Whereas the latter showed 
significant differences between intelligence levels, 
the BIP D scores did not. Also no significant 
relationship was found between Coding scores 
and the BIP D scores. No sex differences were 
observed on the various BIP Bender scores. 

It would appear that the ability to cope with 
the BIP effect, as measured by the D score and 
NP scores, is fairly well established by the age 
of 13 and is relatively unaffected by differences 
in intelligence. Below this age, greater allowances 
need to be made for what appears to be a normal 
decrement in performance associated with the 
BIP task. This is especially true for children 
under 8 years of age. Thus one should not apply 
Canter’s criteria for brain-damage performance 
on the BIP Bender to children below the age 
of 13. The way the procedure is used with adults 
as well as some of the types of error deviations 
measured on the Bender probably need to be 
modified to make the technique more useful for 
the evaluation of children. This is a matter for 
further investigation. 
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Brief Reports 


The Journal of Consulting and Clinical Psy- 
chology will accept Brief Reports of research 
studies in clinical psychology for early publication 
without expense to the author. The procedure is 
intended to permit the publication of soundly de- 
signed studies of specialized interest or limited 
importance which cannot now be accepted because 
of lack of space. Several pages in each issue will 
be devoted to Brief Reports, published in the 
order of their receipt without respect to the dates 
of receipt of the regular articles. Most Brief Re- 
ports appear in the first or second issue to go to 
press following their final acceptance. 


An author who wishes to submit a Brief 
Report: 

1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the National 
Auxiliary Publications Service (NAPS) to insure 
indefinite availability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
charge to all who request it as long as the supply 
lasts. 

4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 
Brief Report. The Brief Report should give a 
clear, condensed summary of the procedure of 
the study and as full an account of the results 
as space permits. 


To insure that the Brief Report will be no 
longer than one printed page, its typescript, in- 
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Th the past decade, modeling or imitation 
ning has received increasing emphasis as 
| a necessary and a real component of 
peutic behavior change. Authors using 
ly differing theoretical contexts have 
ted that the psychotherapist serves as a 
lel whose function is to demonstrate desir- 
behaviors so that his patients can learn 
rough imitation (e.g., Bandura, 1965; 
laley, 1963; Jourard, 1964; Masserman, 
7; Satir, 1964). The literature is also 
plete with evidence indicating that imitation 
ing not only should but actually does 
in traditional psychotherapeutic settings 
, Matarazzo, Wiens, & Saslow, 1965; 
eflen, 1963). After demonstrating that 
form of modeling in psychotherapy is 
th inevitable and desirable, one must ask 


paper is based on a PhD dissertation sub- 
to the Department of Psychology, University 
‘Ornia, Los Angeles, 1967. The write: wishes 
express her gratitude to the chairman of her 
ral committee, Tommy Tomlinson, and to 
ara Henker, Charles Nakamura, and Tom 
for their generous help and support during 
Course of this investigation. Thanks are also 
ill Waterman Janovici and Richard Whitney, 
whose continued efforts the data collection 
this investigation would not have been pos- 
is investigation was partially supported by 
States Public Health Service Predoctoral 
wship 5 F1-MH-20, 464-04, 

Requests for reprints should be sent to Carol 
b who is now at the Fairview State Hospital, 
Mesa, California 92626. The author is also 
© University of California, Irvine. 


EFFECTS OF A MODEL AND INSTRUCTIONS 
ON GROUP VERBAL BEHAVIORS* 


CAROL WHALEN 2 
University of California, Los Angeles 


This study assessed the relative efficacy of modeling and instructional ap- 
proaches in increasing interpersonal openness in a group setting. Groups of 
four male college students participated in a leaderless discussion session after 
exposure to one of four manipulations: a film model of interpersonal openness 
plus detailed exhortative and descriptive instructions, film model plus minimal 
instructions, detailed instructions with no film, and minimal instructions only. 
It was found that Ss exposed to both the film model and the detailed instruc- 
tions tended to engage in interpersonal openness, while groups in the other 
three conditions failed to do so, devoting most of their time to impersonal 
discussion. These findings were discussed in terms of the combined role of 
understanding and disinhibition in modifying verbal response classes. 


tematically and efficiently utilized to effect 
specific therapeutic goals. 

Although a few early studies of modeling 
in therapylike situations can be found in the 
literature (e.g., Chittenden, 1942; Jones, 
1924), most of the work has been done in the 
1960s. Metz (1965) and Lovaas, Freitas, 
Nelson, and Whalen (1967) used a combina- 
tion of imitation and reinforcement to teach 
schizophrenic children to perform a variety 
of self-care and social responses. Bandura, 
Grusec, and Menlove (1967) used a modeling 
approach to help nonpsychotic children over- 
come their fear of dogs. In their discussions 
of behavioral rehearsal techniques, Gittelman 
(1965) and Lazarus (1966) have described 
further uses of modeling to modify inappro- 
priate or deficient behavioral repertoires. 

Systematic attempts to change behavior 
through the provision of models often focus 
on verbal responses. For example, an imita- 
tion-reinforcement procedure has been used 
to reinstate simple verbal behaviors in 
mute psychotics (Sherman, 1965; Wilson & 
Walters, 1966) and to initiate speech in 
schizophrenic children (Lovaas, Berberich, 
Perloff, & Schaeffer, 1966). 

There have also been a few experimental 
attempts to modify more complex aspects of 
verbal behavior through imitation procedures. 
Schwartz and Hawkins (1965), using patients 
as models and a therapist dispensing rein- 
forcements, succeeded in either increasing or 
decreasing patient affect statements. Using 
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college student Ss, Marlatt, Jacobsen, Johnson, 
and Morrice (1966) modified the degree of 
problem admission through a similar modeling- 
reinforcement procedure. And finally, Truax 
and Carkhuff (1965) have described the use 
of a tape-recorded model to facilitate the 
learning by new patients of those verbal be- 
haviors deemed desirable in a therapeutic 
encounter. 

These studies demonstrate that imitation 
can involve the learning of entire classes of 
behaviors, such as verbal “affect” statements, 
as well as simple discrete responses, such as 
“That makes me angry.” The goals of psycho- 
therapy often include modification of such 
complex classes of verbal behaviors. For ex- 
ample, one of the major therapeutic objec- 
tives has traditionally been the development 
of insight or understanding. In the 1960s the 
focus has shifted away from the private, indi- 
vidual, and toward the social, communicative 
aspects of such insight. Bradford (1964), 
Jourard (1964), and Mowrer (1964) all af- 
firm the importance of “interpersonal open- 
ness,” the readiness to convey personal infor- 
mation about one’s life, one’s feelings, and 
one’s impressions of other individuals. 

The primary objective of the present in- 
vestigation was to compare the efficacy of 
modeling and instructional approaches in ma- 
nipulating complex verbal behaviors. As it 
appears that interpersonal openness is rapidly 
becoming a major goal of many behavior 
modification approaches, this response class 
was chosen as the major dependent variable. 


METHOD 


The study consisted of a-2X 2 factorial design 
with film model (present or absent) and instructions 
(detailed or-minimal) serving as the main variables. 
In the film-detailed instructions (F-DI) condition, 
Ss were presented with instructions describing inter- 
personal openness and directing them to engage in 
this type of interaction. In addition, these Ss viewed 
a film in which a group of individuals demonstrated 
or modeled the desired behaviors. This manipulation 
was seen as the condition of maximal influence, 
combining’ the informational and motivational cues 
provided by both models and instructions. - 

The Ss in the film-minimal instructions (F-MI) 
condition viewed the same film presented to F-DI 
Ss but received neither a description of interpersonal 
openness nor an exhortation to- engage in such be- 
havior. The intent of this manipulation was to 
provide evidence as to whether adults, like children, 
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will imitate complex and possibly embarrassing 
behaviors after merely observing models engage in 
such behaviors. In other words, can adult Ss discern 
what is expected of them and will they fulfill these 
expectations when a model is presented in the 
absence of exhortative or descriptive instructions? 

In real-life settings verbal instructions of both a 
descriptive and a directive nature are often used 
alone to effect desired behavior change. In order to 
compare this procedure with the modeling approach, 
a third condition labeled no film-detailed instructions 
(NF-DI) was included. In this condition, Ss received 
instructions similar to those presented in the F-DI 
condition but were not exposed to the film model. 

A fourth condition was included to provide a 
comparison level, delineating the verbal behaviors 
which occur in a group of individuals, meeting each 
other for the first time, when no definite structure 
is provided. The Ss in this fourth condition 
(NF-MI) received minimal instructions and no 
exposure to the film model. g 

On the basis of previous findings (Bandura, 1965) 
and pilot data, it was predicted that compared to 
the other three conditions, the F-DI manipulation 
would result in the greatest amount of interpersonal 
openness. Analogously, Ss in the F-MI, NF-DI, and 
NF-MI conditions were expected to engage in sig- 
nificantly more impersonal interactions than those 
exposed to the F-DI manipulation. No specific dif- 
ferences were predicted among the F-MI, NF-DI, 
and NF-MI conditions. 


Subjects 


The Ss were 128 male students who volunteered 
to participate in a group dynamics experiment in 
partial fulfillment of their introductory psychology 
course requirements. They participated in groups of 
four, each group consisting of members who had not 
previously met each other. The groups were ran- 
domly assigned to one of the four experimental 
conditions. 


Film Model 


The model consisted of a 12-minute, untitled, 
16-millimeter film produced for this study by the 
Academic Communications Facility at the University 
of California, Los Angeles. A film was used in pref- 
erence to live models so that all Ss would be exposed 
to an identical manipulation. Data from previous 
studies (e.g, Bandura, Ross, & Ross, 1963; Harby, 
1962) indicate that film models are as potent as live 
models in eliciting imitative behavior. 

In the experimental film a group of four male 
undergraduates conversed after implying that they 
had just. met. Two of the film participants spent 
a few seconds conveying biographical information, 
and then a third member, interrupting these inter- 
changes, maintained that the group members -were 
not using the- best method of getting to know each 
other. For the remainder of the time the participants 
talked on a more personal level, describing their 
anxieties and other feelings, and feeding back their 


whether positive, negative, or neutral— 
‘group members. 

the Ss was acquainted with any of the 
ts in the film.. Each film participant had 
attended and led. sensitivity training ses- 
thus -was experienced in the types of 
tions focused on in this study. : 
pts were made to make the film group ap- 
‘similar as possible in composition to the S 
(eg, in age, sex, and dress). Similarity 
‘Ss and models was maximized because of 
ibility that large S-model differences might 
5 to conclude that behavior matching was 
propriate, unnecessary, or impossible (Goldstein, 
T, & Sechrest, 1966; Hollander, 1958). 


‘tions 

her or not they were exposed to the film, 
received some form of verbal instruction. 
ions for each condition were tape recorded 
to control for possible intersession variabil- 


male E made the recordings, The detailed 
nimal instructions follow. 


m—detailed instructions condition: This is 
up dynamics laboratory. It’is a rare oppor- 
y to improve your ability to communicate 
others by learning how you appear to them; 
‘is, how you present yourself and what im- 
essions they form about you. To accomplish this, 
is important that you be open and honest with 
members of the group and discuss freely 
ir immediate feelings, attitudes, and opinions, 
th about yourself and others in the group. The 
Ormation used to form your impressions of the 
“group members will come from their ap- 
ice, words, and behavior. We want you to 
back your impressions to the others as soon 
ie impression is formed in your mind. This 
be difficult, but it is essential to obtain the 
um benefits from the experience. The 
unt you learn about yourself will depend on 
extent to which you participate in the group. 
) show you what we mean by open discussion 
elings and feedback of impressions, we are 
to show you a film of a group similar in 
sition to yours. The group members in this 
Provide excellent examples of desirable types 
toup behavior. You will note that they are 
talking about themselves. In ‘addition, they 
express their immediate impressions of other 
S of the group, and each member makes 
effort to accept and examine the truth 
of the impression as it applies to him. 
e film you will form your own group, 
Ould like you to try and interact in a 
i similar to that of the film group. No ex- 
Perinienter will participate in your group, but an 


4 bserver will be ‘watching through the one-way- 


Or in order to record the interaction variables 
' examining. i 

er you have seen the film, you may begin. 
Will be notified when your time is up. Re- 
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member, it is important that you openly discuss 
your feelings and immediate impressions of others 
in the group, as well as invite similar comments 
„about yourself. - ed) AEO H 

_ .Film-minimal. instructions condition: This 
_is a group dynamics laboratory. We are interested 
in examining some variables important in the de- 
velopment of group interaction. To enable us to 
study these variables we will ask you to interact 
as a group, but before you begin we are going to 
show you an example of what group interaction 
looks like. You will see subjects in a situation 
similar to the one in which you will find your- 
selves. After the film is over you may begin your 
own interaction. No experimenter will participate 
in your group, but an observer will be watching 
through the one-way mirror in order to record 
the interaction variables we are examining, You 
will be notified when your time is up. 


Instructions for the two no-film conditions were 
identical to those for the respective film conditions 
except that all references to the film were deleted. 


Procedure 


The study was conducted in two adjoining rooms. 
The Ss entered a room containing four chairs placed 
around a circular table such that when seated all Ss 
were at least partially facing a one-way observa- 
tion window. The E and raters recorded their obser- 
vations from the adjoining room. The situation was 
arranged so that the observers both saw and heard 
the Ss, while the Ss heard nothing from the adjoining 
room and saw only a mirrror. 

The Ss were met by one of four male Es when 
they entered the experimental room and asked not 
to talk to the other group members before the 
experiment began. When the four Ss arrived, E 
played the tape-recorded instructions and, if appro- 
priate, ran the film of the modeled interaction. 

Following the experimental manipulation, Æ dis- 
tributed 5X 8 inch white index cards and a grease 
pen to Ss, requesting that each S write his first name 
on the card, fold it in half, and stand it up on the 
table in front of him so that the card was visible 
to the other Ss. The Æ then left the room and Ss 
began their conversation. After approximately 4C 
minutes had elapsed, E reentered the room to termi- 
nate the interchange, to request that Ss not discuss 
this experiment with their friends, and to thank them 
for their cooperation. 


Dependent Variables 


The dependent measures consisted of continuous 
ratings of five major verbal response classes, The 
classes were selected so as to include the behaviors 
modeled in the film as well as those which typically 
occur in newly formed groups. Four of the response 
classes were further-broken down into component 
categories in order to allow a more precise analysis 
of the group verbal behaviors which occur in the 
various conditions. In all, 18 response categories 
were recorded, 
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The first two response classes, labeled “personal 
discussion” and “feedback,” include the primary 
types of interaction modeled in the film. These two 
classes together comprise the more general response 
class labeled “interpersonal openness.” The third 
response class, “impersonal discussion,” is composed 
of verbal behaviors, such as conveying and request- 
ing biographical information, which spontaneously 
occur quite frequently in newly formed groups. 
These three response classes were considered the 
primary foci of this investigation, since it was 
assumed that Ss who imitate or follow instructions 
would make utterances categorized as either personal 
discussion or feedback, while those who fail to 
imitate or follow instructions would engage in the 
conventional verbal behaviors categorized as imper- 
sonal discussion. 

In order to obtain a comprehensive picture of the 
group interaction, two additional response classes 
were included in the behavioral analysis, although 
no specific predictions were advanced regarding their 
occurrence. One of these classes, labeled “group 
process,” included all general discussions about the 
experimental manipulations and procedures. The 
remaining response class included several descriptive, 
noncontent aspects of communicative speech which 
frequently occur in a group setting. 

All verbal responses which were either inaudible, 
unintelligible, or difficult to classify were included 
in a separate category which was labeled “unscore- 
able utterances.” 

A more precise description of each response cate- 
gory follows. 

1, Personal discussion. (a) Personal self-disclosure: 
Discussion of feelings, attitudes, and behaviors un- 
related to current group interaction which are non- 
public, not ordinarily readily volunteered, and which 
may make the individual vulnerable to negative 
evaluations from others. Examples of actual com- 
ments categorized as personal self-disclosure are “I’m 
a status seeker like a lot of people,” “I might even 
try homosexuality,” or “I just broke down and 
started yelling and screaming.” (b) Immediate feel- 
ings: Discussion of feelings and attitudes related to 
current group interaction which are relatively pri- 
vate, not ordinarily volunteered by most people, and 
which may make the individual vulnerable to nega- 
tive evaluations from others (e.g., “I feel alienated 
here” or “Now I’m embarrassed”). (c) Personal 
questions: Questions relating to an individual’s opin- 
ions, feelings, or actions which are usually considered 
to be private information: “Have you always been 
heavy?” or “I wondered how you felt when we were 
so silent.” 

2. Feedback. (a) Positive feedback: The conveying 
of an individual’s positively valenced impressions of 
or reactions to the comments, appearance, or actions 
of another group member. This category includes 
compliments, flattery, etc. Examples are “I like the 
way you said that” or “You seem like an alert, 
intelligent guy.” (b) Negative feedback: Negative 
evaluation of the personality, intelligence, actions, 
etc., of another group member. This category includes 


CAROL WHALEN 


hostility, ridicule, criticism, and implications that the 
individual is not telling the truth. Examples of 
comments rated as negative feedback are “You were 
a fool,” “Big deal!” or “Your argument isn’t very 
rational.” (c) Neutral feedback: This category in- 
cludes comments about a group member that cannot 
be evaluated as either positive or negative, as well 
as direct advice from one group member to another. 
Some examples are “You seem like the silent type” 
or “You ought to go up and tell her.” (d) Accept 
feedback: Acceptance by a group member of the 
positive, negative, or neutral feedback he receives 
from other group members, for example, “Yeah, I 
guess you're right.” (e) Reject feedback: Refusal to 
accept the feedback about oneself provided by other 
group members, for example, “No, you see it’s really 
like this . . . .” (f) Request feedback: A direct re- 
quest that the group or particular members provide 
S with an evaluation of himself, for example, “What 
do you think of me?” 

3. Impersonal discussion. (a) Impersonal self- 
disclosure: Nonpersonal biographical information 
about oneself which is either generally accessible or 
readily volunteered in appropriate contexts by most 
people. Examples of this category include “I came 
from Chicago” or “I don’t know the outskirts of 
the city well.” (b) Extragroup process: Nonpersonal 
topics unrelated to the experiment or the group 
interactions—“All universities don’t accept correspon- 
dence courses” or “I know a guy who is in the 
Navy now.” (c) Impersonal questions: Questions 
regarding public, easily accessible, readily volun- 
teered information about a group member—“Have 
you ever seen the San Diego campus?” or “How 
many units are you taking?” 

4, Group process. Nonpersonal discussion relevant 
to the experiment such as comments regarding the 
experimental manipulations, the purposes of the ex- 
periment, the behavior of the Zs or film participants, 
the delineation of a plan of action, etc. Examples: 
“Are they watching us?” “I guess they just want us 
to act normal” or “Let’s all talk at the same time— 
that ought to goof them up.” 

5. Descriptive aspects of communicative speech. 
(a) Agreement: Any verbal indication of agreement, 
for example, “Uh huh” and “Yeah, that’s right!” 
Head nods are not scored. (b) Disagreement: Any 
verbal indication of disagreement, for example, “NO” 
and “That’s not true!” Head nods are not scored. (c) 
Laughter: Any amount of audible giggling or laugh- 
ter by one or more group members. Smiles and grins 
are not scored. (d) Silence: This category is scored 
only if the duration of the silent period exceeds 3 
seconds. (e) Interruption: Scored whenever one 
group member stops another from talking or when 
two or more individuals are speaking simultaneously. 

6. Unscoreable utterances. A scoring category for 
those remarks which are either inaudible, unintel- 
ligible, or too difficult to categorize immediately. 


Rating Procedure and Apparatus 


For 26 of the 32 groups, two trained raters inde- 
pendently categorized each utterance that occurred 
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during the discussion, Each rater used an Esterline- 
Angus event recorder and a 12-button operating 
panel. This apparatus provided a continuous record 
of both the frequency and the duration of each 
utterance in each category. The resultant scores 
describe total group interaction; the verbalizations of 
individual group members were not rated separately. 


_ Analysis of Data 


The frequency scores presented in the following 
analyses are actual frequency counts of each type 
of occurrence. The duration scores are not actual 
time periods. The exact number of seconds devoted 
to each response category can be obtained by multi- 
plying each duration score by a factor of 7.5. In the 
statistical analyses the “unscoreable utterances” were 
considered a separate category, thus bringing the 
total number of categories to 19. 

There was some variation among groups in total 
interaction time. In order to make total scores com- 
parable, the groups were equated by using the length 
of the shortest group as a cut-off criterion. Thus 
only data derived from the first 36 minutes of each 
group interaction were entered into the analyses. 

Reliability. In order to assess the relationship 
between each rater’s scores, a reliability analysis was 
performed on data from those groups for which 
two sets of ratings were obtained. Pearson product- 
moment correlation coefficients were computed from 
total scores for each group. The analysis was done 
separately according to condition, category, and type 
of score (frequency or duration), giving a possible 
total of 4X 19X2 or 152 analyses. However, in 
those cases in which there were fewer than two 
pairs of scores, the correlation coefficient was not 
computed since it would have no meaning. This 
criterion resulted in the elimination of 16 or ap- 
proximately 10% of the possible correlation co- 
efficients, leaving a total of 136 reliability measures. 

After the correlation coefficients were computed, 
they were transformed, using Fisher’s s coefficients, 
so that they could be averaged across conditions 
within each category. The s scores were weighted 
according to the number of groups in each condition, 
and then the arithmetic mean of the 3’s across condi- 
tions was found, Finally, the mean 2’s were con- 
Verted into their corresponding correlations (Guil- 
ford, 1950). These final correlations were then used 
in evaluating the reliability of the rating system. A 
Mean correlation coefficient of .75 was selected as 
the requisite minimum for acceptable reliability. Data 
from any category for which the mean correlation 
failed to reach this level were excluded from subse- 
quent analyses, 
ister, transformation and averaging, the inter- 
More Geen coefficients ranged from —.11 to 1.0. 
83% e an 60% of these coefficients reached 90, and 
agin Te .74. After transformation and aver- 
1 ee ae correlation coefficients ranged from .10 
92% 3 ixty-six percent were at or above .90, and 
Gat Ta .75, Frequency data from the category 
the E ance of feedback” and duration data from 

Wo categories “group feelings” and “agreement” 


513 


were not considered. These data were omitted be- 
cause interrater reliability did not reach the pre- 
established criterion of r=.75 for these three 
categories. 

In summary, these data clearly indicate that the 
system of content analysis devised for and used in 
this investigation can be reliably applied by trained 
raters. For 15 of the 18 response categories, the 
interrater reliability indexes reached the preestab- 
lished criterion of r=.75 for both frequency and 
duration scores (see Whalen, 1967). 

Mean proportions. In order to provide a descrip- 
tive comparison of the differential occurrences of 
each category in the four conditions and in the 
actual film, mean proportions were computed. For 
a given condition, the means for all reliable cate- 
gories were first totaled, and then each individual 
category mean was divided by this total mean. This 
analysis was performed separately on the data from 
the film model as well as from each of the four 
experimental conditions. Separate analyses were done 
for each rater and for each type of score. Overall 
proportions for each response class were computed 
in a similar manner. 

Statistical analyses of group differences. The effects 
of the film and instructions manipulations were more 
formally assessed using Winer’s (1962) procedure for 
two-way analyses of variance with unequal cell fre- 
quencies. Separate analyses were performed for each 
rater, for each type of score (frequency and dura- 
tion), and for each reliable category. The main 
reason for separately analyzing the data from each 
rater instead of including the rater variable as a 
third factor in the design was that the two raters 
did not always rate the same S groups. 

Method of analysis. In order to facilitate these 
analyses, the frequency and duration data from the 
event recorders were coded into a digital form and 
punched onto computer input cards. Fortran pro- 
grams were written for the reliability and factorial 
analyses. The data were processed on an IBM 1410. 


RESULTS 


Figures 1 and 2 illustrate the proportion of 
total verbalization frequency and duration 
(averaged across raters) devoted to each 
verbal response class in the actual film and 
by the groups participating in each of the 
four experimental conditions. They indicate 
that certain quantitative and qualitative as- 
pects of group verbal behavior are strongly 
influenced by the joint presentation of in- 
structions and a film model. It can be seen 
that F-DI groups engaged in more personal 
discussion and feedback than did Ss in any 
of the other three conditions. The proportions 
of total verbalization frequency and duration 
devoted to these two response classes by Ss 
in the F-DI condition approached the actual 
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F10, |. Frequency scores: Relative proportion of total interaction devoted to each response class, (Code: 
Film = film model; F-DI = film-detailed instructions; F-MI = film-minimal instructions; NF-DI = no film- 
detailed instructions; NF-MI = no film-minimal instructions; PERS DISC = personal discussion; IMPERS 
DISC = impersonal discussion; DESCRIP = descriptive aspects; UNSCORE = unscoreable utterances.) 


film proportions, while the corresponding 
statistics for the remaining three conditions 
were markedly lower. These three latter con- 
ditions differed little among themselves in 
total time spent in personal discussion and 
feedback. 

The proportion of total interaction time 
devoted to impersonal discussion by the F-DI 
groups also approached the analogous propor- 
tions for the actual film model. As can be seen 
from Figures 1 and 2, F-MI, NF-DI, and 
NF-MI groups engaged in markedly more 
impersonal verbalization than did the actual 
film group or Ss in the F-DI condition. Again, 
there was little variation in these scores 
among the F-MI, NF-DI, and NF-MI 
conditions, 

In summary, relative to the other three 
conditions, the F-DI groups more closely 
matched the actual film model in the three 
response classes for which experimental pre- 
dictions were advanced, namely, personal dis- 
cussion, feedback, and impersonal discussion, 
In contrast, only minor differences were noted 
among the actual film and the four experi- 


mental conditions regarding the proportion of 
total frequency and duration devoted to the 
remaining response classes, that is, group 
process and descriptive aspects, and to the 
unscoreable utterances category, 


Statistical Analysis of Film and Instructions 
Effects 


Tables 1-4 present for each condition the 
mean frequency and duration scores ascribed 
to the individual categories by each of the 
two raters. These tables also summarize the 
tests of statistical significance, The following 
discussion is based on these means and the 
results of 12 separate F tests for each cate- 
gory: tests on both main effects (film and 
instructions) as well as the interaction for 
each rater and for each type of score. 

Personal discussion. As shown in Table 1, 
the overall personal discussion frequencies 
and durations of the F-DI groups were two 
to eight times as large as those for groups 
in the other three conditions. Analyses of 
variance revealed a significant instructions 
variable in all but one case, regardless of 
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SCORES 


DURATION 


PERS FEEDBACK IMPERS GROUP DESCRIP UNSCORE 
DISC DISC PROCESS 


. 2, Duration scores: Relative proportion of total interaction devoted to each response class. (Code: 

film model; F-DI = film-detailed instructions; F-MI = film-minimal instructions; NF-DI = no film- 
led instructions; NF-MI = no film-minimal instructions; PERS DISC = personal discussion; IMPERS 
= impersonal discussion; DESCRIP = descriptive aspects; UNSCORE = unscoreable utterances.) 


TABLE 1 


MEAN FREQUENCY AND DURATION SCORES AND ANALYSES OF VARIANCE 
FOR THE PERSONAL Discussion RESPONSE CLASS 


Condition means Analyses of variance 
Score Rater Film ‘ eactthnn FXI 
F-DI F-MI | NF-DI | NF-MI | on E ea 
E lope Flo] Fle 
Frequency 1 12.3 3.0 6.2 27 1.87 | ns | 7,65 | .05) 1.55] ns 
2 18.6 3.1 5.7 37 4.13} 10} 8.35] .01] 4.86 | .05 
Duration 1 23.9 2.6 7A 2.9 3.81 | .10| 9.58|.01| 4.01 |.10 
2 34.2 4.1 79 10.4 2.80 | ns | 5.34] .05| 7.40) .05 
‘Ammedi Frequency 1 2.5 A al 0.0 | 18.30 | .01 | 18.30 | .01 | 14.82 | .01 
À 8 2 4.0 A 0.0 0.0 8.61| .01| 5.60] .05| 5.60) .05 
p Duration 1 Omitted because of unreliability 
» y 2 Omitted because of unreliability 
: Frequency 1 4.2 1.1 2.0 A | 3.92) 10} 10.17 | 01} 1.03 | ns 
“Westion 2 6.6 6 1.0 ja 5.19 | .05| 6.93 |.05| 5.56 | .05 
aN Duration 1 2.0 A 8 .1 | 7.22 | 05 | 16.54) 01) 1.94) ns 
2 25 4 A 2 3.72 | 10} 4.00 | .10| 2.86) ns 
| 


NENTE E:DI = film-detailed instructions; F-MI = film-minimal instructions; NF-D] = no film-detailed instructions; 


= no film-minimal instructions, 
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TABLE 2 
MEAN FREQUENCY AND DURATION SCORES AND ANALYSES OF VARIANCE 
FOR THE FEEDBACK RESPONSE CLASS 
Condition means Analyses of variance 
Category Score Rater Film {Instructions} F XI 
F-DI | F-MI | NF-DI | NF-MI 5 
F |? F HIF |p 
Positive Frequency 1 8.7 6 1.6 0.0 5.72 | .05 | 9.09 | .01 | 4.00 | .10 
feedback 2 8.7 7 =} 0.0 | 14.02 | .01 | 11.33 | .01 | 10.01 | .01 
Duration 1 9.7 3 ual 0.0 4.68| .05| 6.62] .05| 4.04] .10 
2 9.3 5 Y 0.0 8.08 | .01 | 7.08 | 05| 6.57; .05 
Negative Frequency 1 7.0 1.0 A 7 | 21.91} .01 | 14.70 | .01 | 18.44 | .01 
feedback 2 7.0 13 a) 8 | 10.43] .01| 17.99 | .01 | 7.76 | .01 
Duration ci 7.2 6 oh 6 | 10.38] .01} 7.36] .05 | 10.32 | .01 
2 9.2 1.3 a0) 5 9.62| .01| 6.51] .05] 6.76] .05 
Neutral Frequency 1 17.7 3.6 4.6 4.0 8.70 | .01 | 11.67 | .01 | 9.76 | .01 
feedback 2 18.9 4.9 6.7 7.0 3.80|.10| 7.24|.05| 7.77} .01 
Duration 1 18.4 21 3.4 3.3 9.18 | .01 | 13.10 | .01 | 12.66 | .01 
2 18.6 3.1 5.2 5.3 4.33 | .05 | 8.24] .01| 8.45] .01 
Request Frequency 1 3.0 AS 0.0 =p 16.73 | .01 | 14.04 | .01 | 17.13 | .01 
feedback 2 2.6 0.0 0.0 «3 | 13.62} .01 | 13.62 | .01 | 22.70 | .01 
Duration 1 2.1 Bf 0.0 .1 | 12.82} .01 | 11.29 | .01 | 14.16 | .01 
2 1.7 0.0 0.0 -2 | 12.50] .01 | 12.50 | .01 | 18.69 | .01 
Accept Frequency 1 Omitted because of unreliability 
feedback 2 Omitted because of unreliability 
Duration 1 1.2 iL aA -1 | 13.59 | .01 | 11.94 | .01 | 12.32 | .01 
2 21 aL Si -1 | 17.63 | .01 | 19.63 | .01 | 18.06 | .01 
Reject Frequency 1 1.5 4 SE 0.0 9.59 | .01 | 4.90) .05] 3.13 | .10 
feedback 2 1.9 3 A 5 3.12 | .10 | 4.04|.10| 5.54] .05 
Duration 1 1.1 aA A 0.0 9.83 | .01 | 7.07 | .05 | 5.38 | .05 
2 1.1 a$ A 3 3.71) 10] 3.85] .10] 7.24] .05 


Note.—F-DI = film-detailed instructions; F-MI = film-minimal instructions; NF-DI = no film-detailed 


NF-MI = no film-minimal instructions. 


category, score, or rater. The one exception 
was that for the personal question duration 
scores of Rater 2, differences in the instruc- 
tions dimension only approached significance. 
The film variable was significant only for 
group feelings frequency scores of both 
raters, personal question frequency scores of 
Rater 2, and personal question duration 
scores of Rater 1. Rater 2’s frequency and 
duration scores for the personal self-disclosure 
category led to a significant interaction, as 
did both raters’ frequency scores for the 
group feelings category. The only other sig- 
nificant interaction effect was found for 
Rater 2’s personal question frequency scores, 


instructions ; 


Feedback. As with the personal discussion 
categories, feedback occurred markedly more 
often in the F-DI condition than in any of 
the three remaining conditions (see Table 2). 
The F-DI Ss engaged in feedback as much as 
eight times as often and for 12 times as long 
as did Ss in the F-MI, NF-DI, and NF-M1 
groups. 

For the film variable, 19 of the 22 F tests 
were significant, and the remaining three ap- 
proached significance. Similarly, 20 of the 22 
F tests for the instructions variable and 19 
of the 22 F tests for the interaction were 
significant, while the results of the remaining 
tests approached significance. 
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TABLE 3 


MEAN FREQUENCY AND DURATION SCORES AND ANALYSES OF VARIANCE 
FOR THE IMPERSONAL Discussion RESPONSE CLASS 


| Condition means Analyses of variance 
| 
Category | Score Rater Film {Instructions} F X I 
| F-DI | F-MI | NF-DI | NF-MI 
nepal le F b F |p| F | > 
Impersonal Frequency 1 20.8 44.1 32.2 35.6 .08| ns | 6.72] .05| 3.78 | .10 
self- 2 25.9 46.3 33.4 34.7 16] ns | 4,53) 05} 3.51] .10 
disclosure | Duration 1 25.7 62.5 62.0 49.3 | 1.22| ns | 1.33 | ns | 5.60 | .05 
| 2 32.9 67.8 65.4 49.5 -30 | ns .53 | ns | 3.80 | .10 
Extragroup | Frequency 1 13.3 39.2 33.9 43.0 8.42 | .01 | 17.52 | .01 | 4.02 | .10 
process | 2 18.5 38.0 35.0 38.8 | 5.22|.05| 9.46 |.01| 4.26) .05 
| Duration 1 32.2 142.6 | 153.7 | 155.0 |19.15 | .01 | 13.34 | .01 | 12.72 | .01 
| 2 42.7 138.8 | 151.1 | 160.9 | 18.60} .01 | 12.25 | .01 | 8.13 | .01 
Impersonal | Frequency 1 11,2 24.6 17.6 11.3 1.02 | ns | 1.09| ns | 8.46 | .01 
question 2 11.5 25.3 17.1 13.0 | 1.12| ns | 2.35| ns | 8.07 | .01 
Duration 1 3.5 17 5.7 3.2 | 1.08] ns 55 | ns | 9.07 | 01 
We 3.0 6.5 44 3.6 -76 | ns | 2.43| ns | 6.29) .05 
| 
Note.—F-DI = film-detailed instructions; F-MI = film-minimal instructions; NF-DI = no film-detailed instructions; 
NF-MI = no film-minimal instructions. 
Impersonal discussion. As can be seen from Differences due to the film variable reached 


Table 3, means for this response class were a significant level only for the extragroup 
generally the reverse of those for the personal process category (for both types of scores and 
discussion and feedback categories. Relative both raters). The instructions variable was 
to the F-DI groups, F-MI, NF-DI, and also significant for all four tests in the extra- 
NF-MI Ss engaged in impersonal discussion group process category. In addition, the im- 
about twice as often and for about three times personal self-disclosure frequency scores for 
as long. Most of the comments made by Ss both raters were significantly different on the 
in the F-MI, NF-DI, and NF-MI conditions instructions dimension. Tests of the inter- 
were classified as impersonal discussion, while action revealed a significant effect for the 
fewer than a third of the utterances of F-DI impersonal question category regardless of 
Ss were scored in this response class. rater and score. The interaction effect for 


TABLE 4 


MEAN FREQUENCY AND DURATION SCORES AND ANALYSES OF VARIANCE 
FOR THE GRouP Process Response CLASS 


Condition means Analyses of variance 


Category Score Rater Film [Instructions|. F X I 
F-DI F-MI | NF-DI | NF-MI 
| SF) e A E a AA aiaga pari 


| 
Group F. 7 1 20.8 | 10.7 6.6 9.9 | 4.70|.05| .97 | ns | 3.65 | .10 
piocs Sie “bis 2 18.9 11.0 44 9.0 | 6.90|.05| .27 | ns a 10 
i 493 | 246 | 19.1 | 161 | 4.68] .05| 2.42 | ns | 1.48 | ns 
fate , 470 | 249 | 11.3 | 147 | 11.08] .01) 1.83 | ns | 3.42 | 10 


Note.—F-DI = film-detailed instructions; F-MI = film-minimal instructions; NF-DI = no film-detailed instructions; 


NF- 
NF-MI = no film-minimal instructions. 
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the extragroup process category was also 
significant for Rater 2’s frequency scores and 
the duration scores of both raters. All tests 
of the interaction which did not reach a 
significant level did approach significance. 

Group process. It can be seen from Table 4 
that F-DI Ss engaged in somewhat greater 
amounts of group process verbalization than 
did Ss in the other three conditions. The 
film model variable was significant for the 
frequency and duration scores of both raters; 
the interaction analysis of frequency scores 
from each rater and duration scores from 
Rater 2 approached significance. 

Descriptive aspects, This response class in- 
cludes about 20-30% of all interactions in 
each condition. The mean frequency and 
duration scores for the four conditions are 
generally quite similar. Only one of the 54 
analyses of the data from the five categories 
comprising this response class reached a 
significant level (Rater 1’s frequency scores 
for the agreement category), indicating that 
neither the film nor the instructions influ- 
enced this response class. 

Unscoreable utterances. According to the 
scores of Rater 1, this relatively rare cate- 
gory occurred more often and for a longer 
period of time in the F-DI condition than 
in the other three conditions. Significant dif- 
ferences were found in the instructions analy- 
sis of Rater 1’s frequency and duration data. 
None of the other analyses of data from 
either rater revealed significant differences in 
this category. (See Whalen, 1967, for the 
means and F values for the descriptive and 
unscoreable response classes.) 

In summary, for all 12 categories com- 
prising the three critical response classes 
(personal discussion, feedback, and imper- 
sonal discussion), some of the F tests reached 
significance. However, the categories varied 
as to whether the significant effect rested in 
the instructions, the film, the interaction, or 
some combination of these three factors. As 
previously pointed out and illustrated in the 
tables, the major group differences appear 
primarily due to the F-DI condition alone. 
Compared to the relative magnitudes of F-DI 
scores, differences among scores in the other 
three conditions seem unimportant, 


Discussion 


The data obtained from this experiment 
indicate that exposure to a filmed model of 
interpersonal openness, when preceded by 
detailed exhortative and descriptive instruc- 
tions, facilitates the expression of interper- 
sonal openness and inhibits impersonal dis- 
cussion in the observers. When presented 
without the other, neither the film nor the 
detailed instructions has these facilitative and 
inhibitive effects. In fact, the performances 
of Ss who were exposed to the instructions 
with no film and to the film with only mini- 
mal instructions were markedly similar to 
those of control Ss who were exposed to 
neither the film nor the detailed instructions. 

In addition to the predicted effects, it 
was found that F-DI as well as F-MI Ss en- 
gaged in more group process than did Ss in 
the remaining two conditions (see Table 4). 
It is likely that this effect resulted partially 
from the novel and unexpected stimuli pro- 
vided by the film, leading Ss exposed to the 
film to spend more time discussing the ex- 
perimental manipulations than Ss receiving 
only the more traditional tape-recorded in- 
structions. Stated simply, Ss viewing the film 
had more to talk about regarding the 
experiment. 

It is interesting, however, that the group 
process engaged in by Ss in the two film 
conditions differed in quality. It appeared 
that F-MI groups spent significant portions 
of time criticizing the film participants and 
justifying their own failure to imitate, while 
the F-DI Ss seemed to be concerned with 
checking their own behavior against that of 
the film participants in order to see how 
close a match they were achieving. Unfortu- 
nately, no objective measures of these two 
types of group process were taken. 

According to accepted definitions of imita- 
tion (e.g., Bandura & Walters, 1963), it can 
be said that F-DI Ss imitated—they matched 
the behavior of a model. The fact that F-MI 
Ss failed to imitate raises many interesting 
questions. A survey of the imitation litera- 
ture (e.g., Bandura et al., 1963) reveals that 
experimental manipulations similar to those 
of the F-MI condition are usually sufficient 
to elicit modeling. One can use either a 
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motivational or a cognitive framework, or a 
combination of the two, for explaining the 
differences between F-DI and F-MI Ss. 


Motivational Framework: The Inhibition 
Hypothesis 


There are three alternative hypotheses that 
could be advanced within a motivational 
framework: (a) The act of imitation itself 
may have been inhibited; Ss could have felt 
that they would be cheating or unmanly if 
they matched the demonstrated behaviors. 
(b) F-MI Ss may have failed to engage in 
interpersonal openness because impersonal 
discussion is the customary verbal behavior 
in a new group, and the Ss had no reason to 
exert the extra energy needed to change these 
behaviors (“Law of Least Effort”). (c) The 
expression of interpersonal openness may 
have been perceived as risky and potentially 
embarrassing. 

The first two explanations, although fea- 
sible, seem unlikely on both logical and 
empirical grounds (see Whalen, 1967, for 
a further discussion of these notions). The 
embarrassment explanation appears more 
plausible. The assumption is that interper- 
sonal openness is often perceived as risky 
because it depicts the individual as weak and 
inadequate (Lakin & Carson, 1966). Because 
of this perceived risk or fear of punishment, 
the expression of interpersonal openness is 
inhibited. One could postulate that both the 
detailed instructions and the film model 
Served to partially disinhibit interpersonal 
openness. The instructions depicted this be- 
havior as highly desirable; they suggested 
that interpersonal openness was not only 
Permissible, but also expected in this setting. 
The film further disinhibited Ss by demon- 
Strating that the consequences of engaging 
in self-disclosure and feedback are positive 
rather than negative. It could be that, while 
neither of these two “disinhibitors” was itself 
potent enough to counteract the fear of 
Punishment, the combined effect of the two 
was sufficient for this purpose. 

The F-MI Ss, lacking the external induce- 
ment provided by detailed instructions, ap- 
parently created their own standard of behav- 
lor within the experimental session. Indeed, 
it appeared that in the F-MI groups, Ss often 
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spent some time criticizing the behavior of 
the film participants and developing argu- 
ments contraindicating the expression of 
interpersonal openness. These conversations, 
which generally occurred early in the group 
session, appeared to be attempts on Ss’ parts 
to convince themselves and each other that 
imitative behavior in this instance was in- 
advisable. This speculation regarding explicit 
and implicit inducements is consistent with 
Berelson and Steiner’s (1964) conclusion that 
the less definite the standards external to 
the group, the more control the group itself 
can exercise. - 


Cognitive Framework: The Ignorance 
Hypothesis 


Rather than focusing on the motivational 
aspects of imitation, one could view the re- 
sults of this study within an attention- 
information context. Thibaut and Kelley 
(1959), for example, have detailed a norm- 
sending process consisting of three compo- 
nents: rule statements, surveillance-evalua- 
tion, and application of sanctions (rewards 
and punishments). In the present study -the 
film-instructions manipulation could be con- 
sidered the rule statement; the presence of 
the other group members and the knowledge 
that the Es were observing from the adjoin- 
ing room might serve an implicit surveillance- 
evaluation function; and the other group 
members as well as S himself could apply the 
sanctions, 

This conceptualization bears a marked 
resemblance to a simple learning-reinforce- 
ment framework. Perhaps the primary dif- 
ference is that it focuses on the learning of 
rules or strategies rather than discrete acts. 
As previously stated, it stresses the cognitive 
as well as the overt motor components of 
behavior. Although such a focus is not new 
in the study of imitation (Asch, 1952; 
Koffka, 1925), little emphasis has been 
placed on problems of attention and under- 
standing in the recent research on modeling. 
Perhaps cognition has been ignored because 
data derived from the tasks typically. em- 
ployed in imitation experiments can be more 
parsimoniously explained in simple stimulus- 
response terms. A weighty cognitive frame- 
work seems unnecessary when describing dis- 
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crete motor acts or two-choice decisions, since 
Ss imitate these simple behaviors even when 
problems of attention and understanding are 
ignored in the experimental manipulations 
(e.g., Kimbrell & Blake, 1958). 

The present experiment differed from the 
usual imitation study in that complex verbal 
response classes rather than simple discrete 
acts were modeled; successful imitation in- 
volved matching general verbal categories 
rather than simply mimicking the exact be- 
havior observed in the film model. Perhaps 
in this latter situation some attention must 
be paid to orienting S toward the relevant 
variables if imitation is to occur. 

It is interesting to note that this “cogni- 
tive” view of behavior change has recently 
been advocated by numerous experimental as 
well as clinical psychologists (e.g., Breger & 
McGaugh, 1965) and has formed the basis 
of several systems of psychotherapy (e.g., 
Haley, 1963). Moreover, despite a general 
disregard for cognitive components in empiri- 
cal studies of imitation, recent studies and 
theories of modeling behavior (e.g., Bandura, 
1965) reveal an incipient trend toward con- 
sidering these aspects. 

In summary, it would appear that the 
norm-sending notion (which stresses the im- 
portance of attention and information) and 
the disinhibition notion (which assumes inter- 
personal openness is viewed as potentially 
embarrassing or dangerous) provide feasible 
frameworks for discussing the present findings. 
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EFFECTS OF MODELING PROCEDURES IN 
HELPING RELATIONSHIPS 


KENNETH HELLER? 
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The clinical utility of modeling techniques is discussed in terms of a growing 
trend in which psychotherapy is viewed as part of a broader psychology of 
behavior change. Modeling techniques can be important facilitators of change, 
but their application to clinically helpful relationships is impeded by a paucity 
of research investigating the operation of modeling procedures in clinical-like 
settings. Intermediate steps or “bridging” research is needed before experi- 
mentally derived hypotheses can be applied clinically. The Whalen study, 
which is one such “bridge,” describes the effort needed for modeling pro- 
cedures to be effective in adult groups and illustrates the need for constant 
attention to setting and context factors if a systematic psychology of behavior 


change is to be developed. 


Throughout the history of psychotherapy, the 
major source of ideas about its conduct has been 
reports of experiences of practicing therapists. 
This is as true today as it was some 70 years 
ago when Breuer and Freud (1937) developed 
their theoretical notions about psychopathology 
derived from experiences with five cases of 
hysteria, 

Most recently, the experimental laboratory has 
become a serious source of competing hypotheses 
about the nature of therapeutic interventions. 
Psychologists in seemingly divergent areas of 
experimental research have been trying, for some 
time, to understand the factors responsible for 
behavior change. The literature on the conditions 
that facilitate psychological change has been 
growing in such fields as learning, education, 
social influence, attitude change, persuasion, and 
cognitive restructuring. Clinical psychologists 
trained in academic settings have begun to 
wonder whether changes in therapeutic technique 
might be possible if, taking advantage of the 
experimental literature, psychotherapy became 
part of a broader psychology of behavior change. 

The work of Whalen (1969) is one attempt 
to test, in a clinical-like setting, hypotheses de- 
rived from the experimental literature on obser- 
vational learning and to suggest a clinical utility 
to modeling procedures. In discussing the Whalen 
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Heller, Department of Psychology, Indiana Univer- 
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study, several points will be covered: the cur- 
rent research evidence concerned with the effects 
of models in personal interviews; models as 
facilitators of change in helping relationships: 
the limits of modeling procedures; and the func- 
tion of laboratory analogue research in the acqui- 
sition of knowledge about behavior change in 
clinical settings. 


Modeling Techniques in Personal Interviews 


There have been a number of studies demon- 
strating the facilitating effects of modeling pro- 
cedures on the discussion of personally revealing 
topics in interviews. Several of these studies, 
conducted at Indiana University, have been 
described elsewhere (Heller, 1968; Heller & 
Marlatt, 1969). In summary, this research has 
found that modeling procedures provide an ef- 
fective means of teaching interview role behav- 
iors, particularly when task instructions are am- 
biguous (Marlatt, 1968b). Modeling procedures 
were found to be more effective than verbal re- 
inforcement in eliciting and sustaining self- 
disclosure (Marlatt, 1968a) and were also more 
effective when consequences to the model were 
either positive or neutral (Marlatt, Jacobson, 
Johnson, & Morrice, 1966). Teaching role expec- 
tations through modeling produces greater inter- 
view participation for chronic hospitalized psy- 
chotics as well as for more typical college 
undergraduates (Zerfas, 1965). However, for 
college student volunteers, instructions to “talk 
about your problems” without models can also 
produce high levels of self-disclosure in individual 
interviews (Jacobson, 1968). 

The study by Whalen (1969) found that in a 
group setting neither instructions nor models 
alone could produce high levels of self-disclosure. 
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tion of detailed, exhortative instruc- 
modeling were required. Her results 
the importance of context and setting 
research of this type. The meaningful 
of personal concerns and fears is often 
as potentially risky and embarrassing. 
the context of an individual interview, 
models or detailed instructions appear 
; to elicit self-disclosure. In a group 
in which S faces complete strangers, the 
appearing weak and inadequate is greater, 
al openness is inhibited, In the Whalen 
ly, neither modeling nor instructions were, by 
ves, capable of removing that inhibition. 
dy reminds us that therapeutic tools are 
universal panaceas, indiscriminately effective 
all circumstances. 


“as Facilitators of Change in Helping 
ships 


@ observational learning is so common- 
normal socialization, it is a wonder that 
jotherapists did not use it earlier as part 
ir therapeutic procedures. Undoubtedly, 
ist ideology had an important role in its 
sion. The model of patient psychopathology 
h which therapists traditionally oriented their 
primarily involved emotional and cognitive 
ons. Even today, most of our current 
jes assume that what went wrong initially 
he patient’s life is that anxiety engendered 
e disapproval of others has blocked self- 
sion. Inhibitions lead to anxiety which in 
Provides fuel for further inhibition. The 
mt becomes caught in a destructive cycle 
h Ag him unable to think, feel, or act 
riately. 
inhibition model of pathology predicts 
mpts to foster observational learning 
because the patient’s ability to perform 
ly is blocked by anxiety. Indeed, there 
tal evidence to substantiate the belief 
patients will be unresponsive to 
vior that is too “healthy” or bold. 
ple, Bandura, Grusec, and Menlove 
ied the effects of modeling procedures 
g fear of dogs in nursery school chil- 
work revealed that when a peer 
hed the dog boldly from the begin- 
layed an early close interaction 
, the fearful children actively 
at the model’s performance and 
to participate in subsequent 


ighly fearful patients, the therapist has 
f | One alternative available to reduce 
tear. He can adopt the more typical pro- 
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cedure of working toward the extinction of fears 
by first removing verbal inhibitions, on the as- 
sumption that indirect, symbolic disinhibition of 
fears plus verbal understanding of them will lead 
to overt behavior change. However, the therapist 
can also deal with the patient’s fears by encour- 
aging approach responses more directly. The 
Bandura et al. (1967) study illustrates this point. 
Dog-phobic nursery school children were exposed 
to one of four treatment conditions. One group 
observed a peer model exhibit progressively 
stronger approach responses to a dog within the 
positive context of a party. A second group ob- 
served the model’s behavior in a neutral context. 
A third group observed the dog during a party 
with the peer model absent, while a fourth group 
participated in a party with both the model and 
the dog absent. Results indicated that Ss who 
observed the model interact nonanxiously with 
the dog in either a positive or a neutral context 
displayed significantly greater approach responses 
to the dog, which persisted through a 1-month 
follow-up. A further extension by Bandura and 
Menlove (1968) revealed that filmed models 
interacting with dogs can be used in place of live 
models, Greatest effectiveness for the filmed 
models is achieved by a broad sampling of 
models and aversive stimuli. In other words, 
observation of different models varying in age, 
as they interact with various dogs, produces 
greater approach behavior in fearful children than 
observation of single-model single-dog interaction. 

When the patient’s primary problem does not 
involve areas of neurotic inhibition, verbal 
insight-oriented therapy alone may be insufficient. 
For example, many patients do not possess within 
their response repertoires the behaviors neces- 
sary for successful therapy or for productive 
real-life behavior. Limited experience makes 
them appear unsuitable for therapy; they need 
opportunities to learn prosocial behavior. If a 
therapist believes that he has a role in maxi- 
mizing behavior change for patients with im- ` 
poverished behavioral repertoires, he might wish 
to turn to the literature on imitation learning 
in his search for new therapeutic procedures (see 
Goldstein, Heller, & Sechrest, 1966, pp. 197-201 
and 261-275). 

As a therapeutic tool, modeling procedures can 
be used to weaken or remove inhibitory re- 
sponses in the observer, to aid in teaching new 
behaviors, to help establish the timing of re- 
sponses, and to provide a means of learning 
inhibitory controls. With the current emphasis 
on community mental health, psychologists are 
being called upon to serve in a variety of helping 
relationships besides psychotherapy. In these 
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new settings, similar uses of modeling procedures 
might be expected. For example, some consul- 
tants teach interviewing techniques to consultees 
(e.g., nurses, case aides, para-professionals) by 
having the latter observe the work of more 
experienced professionals. In a manner similar 
to that used by Whalen (1969), filmed and 
taped examples of encounter groups have been 
used to stimulate interpersonal openness in sensi- 
tivity training groups. We are now at a stage in 
which the potential utility of modeling procedures 
in psychologically helpful relationships is just 
beginning to be realized. 


Limits of Modeling as a Therapeutic Technique 


The mental health fields are pervaded by a 
philosophy of “oversell,” in which techniques are 
put forth as panaceas for all types of problems. 
Inevitably, what follows each new fad is dis- 
illusionment resulting from inappropriate, over- 
generalized application. With reference to model- 
ing techniques, faddishness might be retarded if 
we can clearly specify what we know about the 
conditions that affect the strength of observa- 
tional learning. One way of beginning would be 
to specify why those in need of psychological 
help have not been able to profit from observing 
the behavior of others before entering psycho- 
therapy. If observational learning is so wide- 
spread in our society, why are patients unable to 
resolve their difficulties on their own by observing 
the adequate solutions of others? 

For some individuals, appropriate pro-social 
models are not available. For example, delin- 
quents raised in a deviant subculture may have 
few opportunities to observe socially approved 
patterns of adjustment. Other individuals may 
be too inhibited to copy behaviors they see per- 
formed successfully, while still others may be too 
withdrawn to notice those behaviors. Included in 
this latter group are those chronically disturbed 
persons whose defensive style of reacting to stress 
removes them from normal social interaction. 
They illustrate the secondary effects of early 
emotional disturbance, namely social impoverish- 
ment associated with lack of experience. 

For the groups cited above, either environ- 
mental circumstance or their own pathology pre- 
vents them from taking full advantage of the 
adaptive experiences of others. It might be ex- 
pected that if they were provided with opportu- 
nities for observational learning in a benign 
therapeutic environment, these deficiencies might 
be overcome. The problem is that our knowledge 
of the factors facilitating observational learning 
in therapeutic settings is scant. Most of the re- 
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search on observational learning deals with so- 
cially acceptable behaviors and a good share of 
this work uses children as Ss. What is needed is 
more research examining the conditions necessary 
for adults to copy behavior which for them is 
personally unacceptable, inhibited, or socially 
disapproved. The Whalen (1969) study of self- 
disclosure in a group setting is one such attempt 
and reveals the difficulties in overcoming societal 
sanctions through modeling procedures alone. 


Laboratory Research and Clinical Application 


The transposition of the findings of laboratory 
research to areas of application quite different 
from its original setting is not accomplished 
without difficulty (Heller & Marlatt, 1969). For 
example, Hilgard and Bower (1966) describe six 
types of research which should be undertaken 
before principles of learning developed in the 
experimental laboratory can be adapted to school 
classrooms. Hovland (1959) and McGuire 
(1968), in social psychology, have also called for 
a more careful analysis of person and situation 
variables to resolve contradictory findings in 
studies of social influence and persuasibility. In 
the field of psychotherapy, Ford and Urban 
(1967) describe a three-stage process required 
to translate basic principles of behavior change 
to therapeutic practice. The point is that one 
should not expect to move directly from theo- 
retical hypothesis to clinical application without 
prior information concerning the interaction of 
mediating variables such as personality, task, and 
setting. “Bridging” steps are required between 
hypothesis and application. 

As illustrated in Figure 1, therapeutic hypothe- 
ses may be derived from clinical experience 
(their traditional source) or from behavior 
change research in other areas of psychology. 
Regardless of their source, factors thought to be 
responsible for change should be identified and 
studied in controlled laboratory settings that 
approximate clinical interactions. The purpose of 
clinical laboratory research is to determine what 
factors produce change, under what conditions 
they operate best, and how they should be com- 
bined to produce an effective therapeutic pack- 
age. The therapeutic agents thus identified can 
then be studied in clinical field research to obtain 
information about the interaction of therapeutic 
ingredients with personality and setting charac- 
teristics that are part of actual treatment. 
Finally, the practitioner is in a position to 
apply this knowledge to individual patients with 
some degree of confidence in the expected out- 
come. Also, he can use the information gained 
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f om practice to develop further hypotheses 
about the relevant factors in the change process. 
Needless to say, the above description does not 


Tepresent the usual manner in which research in 


move directly from clinical hypotheses to clinical 
trials. Little attention is given to determining the 


{ herapeutic procedures, or to the conditions under 
Which each therapeutic ingredient is optimized. 
Therapeutic strategies are often expected to ap- 
ply regardless of patient, therapist, or setting 
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Only after a therapeutic approach has become 
entrenched as a “school” with disciples com- 
tted to its perpetuation, do researchers begin 
(0 question the value of some of its therapeutic 
ingredients. An illustration of this point can be 
seen in recent developments in behavior therapy. 
After years of application and professional 
growth, the crucial ingredients in systematic 
desensitization are only now being given careful 
Scrutiny, 
| The sequence in which the research steps 
between hypothesis and application are carried 
Out is not at issue. What is important is that 


i 


in creas lication interviews (Sarason & Ganzer, 
act with’ and (c) teach parents how to inter- 
lice their emotionally disturbed children 
Eng Y, 1964). Studies investigating the condi- 
; under which modeling effects are optimized 


(Analogue Research) 


(Process and outcome 
studies) 
> a 


es 
> CLINICAL TRIALS £ 


The research steps needed between the discovery of psychological 
change factors and their clinical application. 


in clinical-like laboratory settings are less well 
represented in the research literature. The 
Marlatt (1968a, 1968b), Marlatt et al. (1966), 
Jacobson (1968), and Whalen (1969) studies 
stand as rare exceptions. 
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| Recent interest in the use of verbal operant 
nditioning (VOC) as a research paradigm 
f isolating and manipulating variables 
ought to be important to the therapeutic 
teraction has led to a number of studies of 
e effects of interviewer interventions in 
atively unstructured interview analogues 
Kennedy & Zimmer, 1968; Merbaum, 1963; 
lerbaum & Southwell, 1965; Powell, 1968; 
askow, 1962). While emission rates of vari- 
interviewee responses have been success- 
lly manipulated, post hoc explanations of 
erential intervention effectiveness have 
t been subjected to adequate experimental 
ting. 

One such explanation (Merbaum, 1963) 
ggests that the reinforcement value of an 
tervention is dependent on how effective it 
0 providing cues which allow S to dis- 
minate the response-reinforcement contin- 
eney. Such discriminative cues are thought 
Include the interruption of S’s conversation, 
me presence of additional associative labels, 
nd characteristics which orient S to such re- 


Warding properties of Æ as attention, interest, 
and warmth, 
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CONDITIONING AND TRANSFER OF AFFECTIVE SELF- 
REFERENCES IN A ROLE-PLAYED COUNSELING 
INTERVIEW * 


ROBERT J. HOFFNUNG 2 
University of Cincinnati 


The differential effects of five forms of therapylike intervention (Um-Hmm, 
Echoic, Paraphrasic, combined Um-Hmm-Echoic, and combined Um-Hmm- 
Paraphrasic) in the conditioning and transfer of affective self-references (ASRs) 
were tested in a 40-minute role-played initial counseling interview. The Ss 
were 72 female student volunteers. Conditioning of ASRs and transfer to a 
TAT story telling task were found, although hypothesized relationships between 
discriminative cue potency of the interventions and performance were only 
partially supported. An explanation involving the presumed interaction of S 
| awareness of her own self-disclosures with intervention type, degree of threat, 
and conditioning performance was proposed. 


Concomitant with the application of the 
VOC paradigm to the interview have been at- 
tempts to demonstrate transfer of conditioned 
changes in rates of responding to new settings. 
Since transfer or generalization of changes 
occurring within the therapeutic interview to 
other life situations is an important goal of 
most therapies, demonstration of transfer of 
conditioning within the VOC paradigm is 
highly desirable if not essential if the model 
is to have continued utility. With the excep- 
tion of Dicken and Fordham (1967) who 
conditioned positive self-reference statements 
and statements of positive affect and reported 
transfer effects for selected CPI scores, there 
has been little evidence that conditioning 
achieved in an unstructured interview situa- 
tion can be transferred to new settings 
(Lanyon, 1967; Moos, 1963; Rogers, 1960). 
In no case have the differential effects of 
interviewer intervention on both the condi- 
tioning and transfer of a response in an 
unstructured interview situation been reported. 

The present study systematically explored 
the differential effects of five forms of therapy- 
like intervention on the conditioning and 
transfer of affective self-reference (ASR) re- 
sponses in a full length, role-played, initial 
counseling interview. Interventions were de- 
signed to provide differing levels of dis- 
criminative cue potency, and were, from least 
to most potent, as follows: Condition I: 
Um-Hmm; Condition II: Echoic, in which - 
E repeated/reflected the content and mood of 
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S’s ASR; Condition III: Paraphrasic, in 
which Æ restated/rephrased S’s ASR re- 
sponse, substituting an appropriate synonym 
for each “feeling” word uttered, while retain- 
ing both the meaning and feeling of S’s state- 
ment; Condition IV: Combined Um-Hmm— 
Echoic; and Condition V: Combined Um- 
Hmm-Paraphrasic, which represented the 
highest level of discriminative cue potency. 

Role-playing instructions were designed to 
facilitate realism, and the ASR response class 
was selected because of the widely held view 
that a client’s verbalizations of his personal 
feelings are a key component of the thera- 
peutic relationship. Finally, the transfer task 
was arranged to maximize continuity in cues 
with the conditioning interview setting. 

The following predictions were made: (ca) 
With the exception of the Um-Hmm interven- 
tion condition (assumed to be the least 
potent), all interventions would produce 
conditioning of ASRs, and that these effects 
would parallel the order of predcited inter- 
vention potency. (b) Transfer would occur 
as a direct function of the degree of condi- 
tioning in each intervention condition. 


METHOD 


The 72 female University of Cincinnati introduc- 
tory psychology students who served as Ss were 
randomly assigned to six treatment groups of 12 
each, five experimental conditions, and one control 
condition. 

Role-playing interviews were individually con- 
ducted and consisted of a 10-minute operant period 
during which only a small number of noncritical 
interventions were employed to maintain satisfactory 
conversational rapport followed by a 30-minute 
acquisition of conditioning period during which a 
single Æ employed one of the five interventions im- 
mediately following each ASR uttered by S. The 
ASR reponse class is described in detail by Salzinger 
and Pisoni (1960) and included any statement by 
S describing or evaluating her own state, other than 
intellectual or physiological, and included only those 
affect statements which begun with the pronoun 
“TD or “We.” 


Transfer Task 


Immediately preceding and following the role- 
playing, S was required to tell three 2-minute TAT 
stories, three different cards serving as pretransfer 
and three as posttransfer stimuli. Cards were equated 
for their ability to elicit affect responses and presen- 
tations were counterbalanced. The criterion response 
for the transfer task included both “other” and self- 
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referred affective reference statements based on the 
assumption that affective references (ARs) to others 
reflect projected references to self. 

To control for the influence of training and elapsed 
time on AR responding, independent of conditioning, 
Ss in Condition VI received the same pre- and post- 
role-playing tasks but in lieu of the conditioning 
interview and participated in a 40-minute “self- 
interview” during which they discussed 10 general 
topics related to their personal experiences, recording 
themselves in the absence of E. 


Posttransfer Interview 


Following the transfer task, each S was asked to 
complete a self-administered questionnaire (similar to 
that employed by Merbaum & Southwell, 1965) for 
another psychologist interested in reactions to this 
experiment. She was then asked to state the purpose 
of the interview and her evidence for this; to list 
all of the feeling words she could remember express- 
ing about herself during the interview and all of 
those she remembered E expressing; to judge the 
pleasantness of £’s interventions on an 11-point 
Likert-type scale; and finally, to list all younger 
and older siblings. 


RESULTS 


Typed transcripts of the final 2 minutes of 
the first (operant), second, third, and fourth 
10-minute segments of each recorded inter- 
view were coded and independently scored by 
E and two assistants. Reliability for the 
three scorers was .889 as estimated by the 
Ebel (1951) variance ratios for intraclass 
correlations. 


Conditioning Data 


An analysis of variance yielded a signifi- 
can segments effect (F = 2.28, df = 3/165, 
< .05) indicating that ASRs increased for 
all experimental conditions. Neither treatments 
nor Treatments X Segments effects were sig- 
nificant. Quadratic trends in main effects for 
segments were significant (p < .01), both for 
oyerall treatments and for individual treat- 
ment groups, and from Figure 1 it can be 
seen that acquisition did not postively in- 
crease over all segments, but rather peaked 
and declined. 

Due to differences in operant response level | 
among the five treatment groups, an analysis 
of variance based on change scores was per- 
formed. A significant treatments effect (F = 
1.68, df = 4/55, 05 < p < .10) indicated that 
with operant levels held constant, interven- 
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{ ions were to some degree differentially effec- 
tive. A Newman-Keuls test of differences 
between treatment means revealed that the 
Combined Um-Hmm-Echoic, the Paraphrasic, 
and the Um-Hmm interventions produced 
significantly greater conditioning than did the 
Echoic intervention, (p < .01), and that the 
combined Um-Hmm-Echoic and the Para- 
phrasic interventions were also superior to 
the combined Um-Hmm-Paraphrasic inter- 
vention, (p < .05). 


Transfer Data 


I A transfer score was derived for each S by 
d Subtracting the number of affective references 
a given in the first set of TAT stories 
e number given in the second set and 
E orming to eliminate negative signs. The 
ans and standard deviations for the five 
experimental and one control groups are 
j} Presented in Table 1. 
st aa length and verbalization time per 
and emained fairly constant for pretransfer 
ni fic EE tasks and did not differ sig- 
s ly between conditions. An analysis of 
| ce revealed a significant treatments 
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Fic. 1, Total ASR scores summed over scorers, by segments, for each of five E intervention conditions. 


effect (F = 2.38, df = 5/66, p<.05), and 
comparison of all experimental means with 
that of the control by Dunnett’s test (Winer, 
1962) showed that all treatment conditions 
produced greater transfer of ARs (p < .05) 
than did the control condition. The superior- 
ity of the Paraphrasic intervention over the 
Echoic intervention represented the only sig- 
nificant difference in transfer effects between 
treatments (t = 4.86, df = 22, p < .05). Al- 
though total conditioning effects (ASRs) 
were not significantly correlated with transfer 
effects (ARs), a rank-order correlation of 
.70 between mean conditioning and transfer 
scores was obtained. 


TABLE 1 


MEANS AND STANDARD DEVIATIONS OF AFFECTIVE 
REFERENCE TRANSFER SCORES 


Condition 
Statistic 
I | mu | E EIO NoN 
M |1483 | 12.75 | 16.50 | 13.50 | 14.25 | 9.75 
sp | 531| 421| 545| 5.38 4.44 | 4.37 
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Postexperimental Interview 


No relationships were found between birth 
order, family size, ratings of intervention 
pleasantness or conditioning or transfer per- 
formance. Only five Ss were able to verbalize 
the nature of the reinforcement contingency 
and those were distributed throughout the five 
intervention conditions. A gross rating of Ss’ 
involvement in role playing indicated that ap- 
proximately 34 were highly involved, % mod- 
erately, and 4% minimally involved. 


Discussion 


As predicted, conditioning effects occurred 
in all experimental conditions. While predic- 
tions of differential intervention effectiveness 
based on hypothesized differences in their 
discriminative cue potency were supported in 
part, the absence of more clear-cut findings 
suggests that an alternative explanation is 
needed. A number of researchers have 
proposed that awareness of the response- 
reinforcement contingency may influence 
VOC performance, although there is consider- 
able controversy about whether awareness is a 
learned prerequisite for conditioning or merely 
a behavioral concomitant of, but not a neces- 
sary condition for, its occurrence (e.g., Dixon 
& Oakes, 1965; Spielberger & DeNike, 1966). 

Although few Ss were able to report aware- 
ness during the brief postexperimental inter- 
view, the accuracy with which they recalled 
both their own ASRs and the interventions 
of E suggested that more were aware and that 
this would have been confirmed by a more 
detailed interview procedure. The effects of 
awareness on consequent performance may 
have been influenced by the nature of S’s re- 
sponses (the things she heard herself saying), 
the type of intervention employed (what she 
heard E saying), and such situational factors 
as S expectations, instructional set, and de- 
gree of situational threat. Most Ss apparently 
accepted as valid the role-playing instructions 
and appeared willing to talk about personal 
feelings from the outset. With the exception 
of the Paraphrasic intervention group, ASR 
responding increased for all conditions during 
the first conditioning segment. - oe 

Awareness would have exposed S to her 
own ASRs and, with the exception of Ss in 
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the Um-Hmm condition, confronted her with , 


E’s reflections or restatements of these same 
self-disclosures. Awareness of such self- 
disclosures is often threatening particularly 
when negative or self-critical feelings are 
involved. Categorization of ASR data into 
positive, neutral, or negative responses re- 
vealed that negative self-disclosures predomi- 
nated, the ratios being %, %, %, respectively. 
For the aware S, such feedback would prob- 
ably have been threatening and led to sup- 
pression or partial extinction of ASR 
responding. If interventions most effective in 
providing cues for discriminating the response- 
reinforcement contingency lead to the most 
rapid conditioning, and if awareness and con- 
ditioning are concomitant, awareness and its 
consequences should have occurred most 
rapidly for the most potent interventions. 
From Figure 1 it can be seen that Um-Hmm 
took the longest to produce peak conditioning 
while acquisition was more quickly achieved 
with the other more potent interventions. For 
the Um-Hmm, Echoic, and Um-Hmm-Echoic 
interventions, peak conditioning was followed 
by one or more decreases in response rate, 
suggesting that at that point, something inter- 
fered with the acquisition process. Anecdotal 
reports of Ss in the two Echoic conditions 
indicated that they found it annoying and 
unpleasant to hear Æ just repeat and parrot 
back the things they had said about them- 
selves. This supports the notion that S’s 
awareness of her own responses, or of E’s 
interventions was threatening. The possibility 
that reflections may have aversive qualities 
was raised by Merbaum and Southwell 
(1965) who found Echoic interventions to 
be ineffective as reinforcers. The relative 
stability of conditioning effects in the Para- 
phrasic and Um-Hmm-—Paraphrasic interven- 
tion groups suggests that they were at least 
threatening in terms of the affects of aware- 
ness on performance. In both, the parroting 
quality of the Echoic interventions was 
mediated by a rephrasing of S’s content, and 
in one, by the addition of Um-Hmm, 
Turning to the transfer data, we find that 
transfer occurred for all experimental groups 
but not for the comparison control group, 
strongly supporting the second experimental 
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“hypothesis. The finding also supports the 
i notion that transfer is largely dependent on 
continuity between cues in conditioning and 
transfer settings. Such continuity was maxi- 
mized by having the same Æ administer both 
the conditioning and transfer tasks, in the 
same room, with minimal delay between tasks. 
The relative insensitivity of transfer effects 
the variability in conditioning perform- 
ace may have been due in part to the re- 
iricted range of the transfer scores and in 
rt to the projective nature of the TAT 
isk which allowed S to attribute her feelings 
) characters in her stories and thereby 
fend against the threat of self-disclosure. 
Variability in performance due to aware- 
and threat is characteristic of other life 
‘ions. For example, in counseling where 
tation of client expression and awareness 
personal feelings is often a desired goal, 
blocking and resistance frequently en- 
tered may be analogous to the shifts in 
responding found in the present study. 
re research should investigate the pro- 
d relationships between intervention po- 
, response class, awareness of threat, and 
nditioning in situations resembling the 
unseling interview. 


REFERENCES 


EN, C., & Forpnam, M. Effects of reinforcement 
f self-references in quasi-therapeutic interviews. 
pet of Counseling Psychology, 1967, 14, 145- 
Dixon, P. W., & Oaxes, W. F. Effect of intertrial 
ivity on the relationship between awareness and 
ee Operant conditioning. Journal of Experi- 
PAR Psychology, 1965, 69, 152-157. 


531 


Eset, R. L. Estimation of the reliability of ratings. 
Psychometrica, 1951, 16, 407-423. 

Kennepy, J. J., & ZIMMER, J. M. Reinforcing value 
of five stimulus conditions in a quasi-counseling 
situation. Journal of Counseling Psychology, 1968, 
15, 357-362. ba; 

Lanyon, R. I. Verbal conditioning: Transfer of 
training in a therapy-like situation. Journal of 
Abnormal Psychology, 1967, 72, 30-34. 

Mersaum, M. The conditioning of affective self- 
references by three classes of generalized reinforcers. 
Journal of Personality, 1963, 31, 179-191, 

Merpaum, M, & SourHwett, E. A. Conditioning of 
affective self-references as a function of the dis- 
criminative characteristics of experimenter inter- 
vention. Journal of Abnormal Psychology, 1965, 
70, 180-187. 

Moos, R. H. The retention and generalization of 
operant conditioning effects in an interview situa- 
tion, Journal of Abnormal and Social Psychology, 
1963, 66, 52-58, 

Powe, W. J., Jr. Differential effectiveness of inter- 
viewer interventions in an experimental interview. 
Journal of Consulting and Clinical Psychology, 
1968, 32, 210-215. 

Rocers, M. J. Operant conditioning in a quasi- 
therapy setting. Journal of Abnormal and Social 
Psychology, 1960, 60, 247-252. 

Satzincer, K., & Pisoni, S. Reinforcement of verbal 
affect responses of normal subjects during the 
interview. Journal of Abnormal and Social Psy- 
chology, 1960, 60, 127-130. 

SPIELBERGER, C. D., & DeNrxe, L. D, Descriptive 
behaviorism versus cognitive theory in verbal 
operant conditioning. Psychological Review, 1966, 
73, 306-326. 

Wasxow, I. E. Reinforcement in a therapy-like situ- 
ation through selective responding to feelings or 
content. Journal of Consulting Psychology, 1962, 
26, 11-19. 

Winer, B. J. Statistical principles in experimental 
design, New York: McGraw-Hill, 1962. 


(Received August 21, 1968) 


of Consulting and cs, Psychology 


Journal 
1969, Vol. 33, No. 


5, 532-53 


ADJUSTMENT, MENTAL HEALTH OPINIONS, AND 
PROFICIENCY OF CHILD CARE PERSONNEL * 


JOHN W. GIEBINK ? DONALD O. STOVER 


University of Wisconsin 


AND 


MMPI validity and K-corrected diagnostic scale scores, responses to a mental 
health opinion questionnaire, biographical data, and job performance ratings 
of 52 child care personnel were intercorrelated. There were significant (p < .05) 
relationships between 11 MMPI scales and measures of job proficiency, but 
only 4 of the 40 mental health opinion items were related to proficiency. 
A principal components analysis of the entire correlation matrix yielded two 
major factors. Items related to adjustment and job proficiency were strongly 
loaded on the first factor. The second factor reflected the biographical vari- 
ables, The remaining factors accounted for small portions of the total variance 
and seemed to be relatively specific in nature. Since mental health opinions 
in contrast to personal adjustment were seemingly unrelated to job proficiency, 
it was suggested that their “mental illness” determined content was irrelevant 
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to performance on a job requiring a “learning” orientation. 


An assumption often made but seldom 
tested is that personal adjustment affects the 
performance of persons caring for emotionally 
disturbed children and adults. Both adjust- 
ment and mental health attitudes are fre- 
quently considered important criteria in se- 
lecting staff to care for emotionally disturbed 
persons. These two qualities would seem to be 
especially related to the job proficiency of 
child care workers in an intensive residential 
treatment center, for their work requires the 
ability to enter into close therapeutic rela- 
tionships with children whose objectionable 
behavior has resulted in separation from home 
and community. 

Although studies suggest that certain per- 
sonality and attitudinal attributes may be 
altered by contact with persons confined in 
mental hospitals (e.g., Holzberg & Gewirtz, 
1963; Holzberg, Gewirtz, & Ebner, 1964), evi- 
dence concerning the relationships between 
existent personality and attitudinal character- 
istics and the behavior of care personnel is 
sparse. After comparing psychiatric aides 
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with normative groups on the Edwards Per- 
sonal Preference Schedule and the Leary In- 
terpersonal Checklist, Lawton (1964) at- 
tempted to relate these measures to their job 
performance. Although performance ratings 
were “unstable” and interjudge agreement was 
low, he described the better aides as less 
authoritarian, more naively benevolent, and 
as seeing themselves as less discrepant from 
the patients than the poor aides. Less than 
the chance number of significant correlations 
were obtained in a study relating opinions 
about mental illness, interpersonal values, and 
some biographical variables to factor scores 
and total scores of performance ratings of 
psychiatric aides (Tollefson, 1964). A factor 
analysis of all the variables in this study 
yielded no factors with strong loadings from 
both predictor and criterion measures, Pryer 
(1967) found no relationship between three 
objective personality measures on psychiatric 
aides and ratings of their job performance. 
There are some possible explanations for 
the lack of empirical support for the assump- 
tion that there would be a relationship be- 
tween personality and/or attitudes and job 
performance in caring for disturbed persons. , 
It is quite likely that persons with markedly 
deviant personality attributes as well as those 
who hold unusual opinions about mental 
health will be screened out prior to employ- 
ment, thus restricting the Tange of scores on 
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these attributes and consequently decreasing 
the chances of finding significant relationships. 
Instability of the personality and attitudinal 
measurements and the lack of reliability of 
ratings of performance would also limit the 
correlations obtained. 

The rather meagre results reported previ- 
ously along with the methodological difficulties 
noted above suggested the use of procedures 
that would enhance the chances of finding 
relationships among adjustment, opinion, and 
performance measures. This could be done by 
obtaining reliable performance ratings and 
using a correlational approach to maximize 
the opportunity of finding a statistical rela- 
tionship among MMPI scale scores, responses 
to a mental health opinion questionnaire, cer- 
tain biographic variables, and performance 
ratings. 

METHOD 


The Minnesota Multiphasic Personality Inventory 
(MMPI) and a mental health opinion questionnaire 


533 


(Nunnally, 1961) were routinely administered to all 
nurses and child care workers within 2 weeks of 
their employment at the Wisconsin Children’s Treat- 
ment Center (a facility for research and treatment 
for severely disturbed children). Although the mental 
health opinion questionnaire consisted of 60 items, 
only those 40 items that had been factored by Nun- 
nally were used in the present study. Each of these 
items was rated on a 7-point “agree—disagree” scale. 
The Ss’ performance on these measures was not 
available to the staff at the Center, and conse- 
quently could not enter into decisions about selec- 
tion, evaluation, or retention of employees. 

Three evaluations of job performance were avail- 
able for the 52 Ss in this study. One of these, the 
Specific Proficiency Rating (SPR) consisted of 98 
true-false items. After 6 months’ employment, the 
SPR was completed on each staff member by his 
nursing supervisor. The Hoyt (1941) internal con- 
sistency estimate of reliability for this scale was 
87. The other two performance measures consisted 
of global ratings made on 5-point scales by the 
director of nursing. The Retention Rating was a 
judgment of the amount of effort that either was 
expended or would potentially be expended to retain 
the employee. The Global Proficiency Rating was 
a retrospective judgment of the overall effectiveness 


TABLE 1 
CORRELATIONS BETWEEN PREDICTOR AND CRITERION VARIABLES 
i i A 4 Global proficien: 
Variables enai Retention rating aH s 
MMPI 
L —.789 —.708 
K 7 —.426 
E — .684 —.722 
Hs —.795 —.644 
D -408 
Hy —.707 —.602 
Pa —.557 —.441 
Pa —.805 —.690 
Pt —.748 —.583 
Se —.472 — .326 
Ma —.792 —.714 
Si —.690 — .630 
Biographic 
Child care worker—nurse 329 
Mental health opinions 
omen have no more emotional problems 
than men do. k —.383 
X rays of the head will not tell whether a z 
Person is likely to become insane. —.302 
à person concentrates on happy memories, 
€ will not be bothered by unpleasant 
things in the present. —.295 
© good psychiatrist acts like a father to 
S patients, —.400 
most any disease that attacks the nervous 
System is likely to bring on insanity. —.301 


Note—Onty those correlations which were significant (p < .05, r = .273, df = 50) are reported, 
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TABLE 2 
Trem Loapines ror Factors I anb II 
FOLLOWING VARIMAX ROTATION 
Item Factor loading 
Factor I 
MMPI 
Pa —.961 
Hs = .949. 
Pi —.938 
jA —.926 
Hy —.920 
Ma —.918 
F —.897 
Specific Proficiency Rating .842 
MMPI 
Si —.784 
Se —.748 
Global Retention Rating TAL 
MMPI 
PD —.703 
Factor II 
Male-Female 896 
Position (child care worker, nurse) -886 
Degree (BA or RN) —.839 
Education beyond high school .810 
MMPI 
MF —.762 
Age —.477 
Mental health opinion items 
The mentally ill pay little attention 
to their personal appearance. —.347 
Mental health is one of the most 
important national problems. —.347 
You can tell a person who is men- 
tally ill from his appearance. —.346 
Helping the mentally ill person with 
his financial and social problems 
often improves his condition. —.317 


of the employee at the end of his first 6 months’ 
employment. 

The biographic variables of sex, age, position 
(nurse or child care worker), and education, along 
with the three proficiency ratings, three MMPI 
validity scale scores and the 10 K-corrected diag- 
nostic scale scores, and 40 mental health opinion 
items were included in the total analysis. 


RESULTS AND Discussion 


An initial analysis consisted of the inter- 
correlations of all variables. The relationships 
among the three criterion variables of job 
proficiency were examined first. The Reten- 
tion Rating correlated .70 with the Specific 
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Performance Rating, and .54 with the Global 
Performance Rating. The Global Performance 
Rating correlated .05 with the Specific Per- 
formance Rating. Thus, the Global Perform- 
ance Rating appeared to be the least related 
to the other measures and consequently least 
satisfactory. 

The correlation matrix was next inspected 
to determine the strengths of the relation- 
ships of biographic, MMPI, and mental health 
opinion scores to the job proficiency mea- 
sures. Those relationships which resulted in 
statistically significant (p < .05) correlations 
are presented in Table 1. Most of the MMPI 
scales are clearly related to the Specific Per- 
formance and Retention Ratings, while only 
a few of the mental health opinion items were 
so related. The strength and direction of the 
correlations between the personality variables 
and job performance indicates that adjustment 
is related to job proficiency. Few significant 
correlations between the Global Proficiency 
Rating and other variables were obtained. 

In order to further clarify the relationships 
among the variables, the correlation matrix 
was subjected to a principal components 
analysis. Nineteen components with eigen- 
values greater than one were subjected to 
orthogonal rotation according to Kaiser’s 
(1958) Varimax procedure. These 19 factors 
accounted for 83.23% of the total variance. 
The first factor accounted for 15.09% of the 
total variance, the second accounted for 
10.18%, and the third for 5.09%. The remain- 
ing 16 factors accounted for less than 5% 
each. Items loading .30 or higher on the first 
two factors are presented in Table 2. 

The first factor is composed of the major 
MMPI scales, the Specific Proficiency Rating, 
and the Global Retention Rating. The mag- 
nitude of the loadings of the proficiency scales 
and the MMPI scores is impressive. Eleva- 
tions on any of these MMPI scales would 
indicate psychological disturbance. Conse- 
quently, their combination with the proficiency 
scores on the first factor again supports the 
hypothesized relationship between. general 
psychological adjustment and competence in 
caring for severely disturbed children, 

The second factor appears to be determined 
by the S sampling. The item loadings indicate 
that Ss were comprised of nurses and child 
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care workers, and reflects the fact that child 
care workers were males working toward col- 
lege degrees, while the female nurses already 
had their degrees and were somewhat older. 
Four mental health opinion items also ap- 
peared on this factor. 

Each of the rest of the several factors ac- 
counted for a small percentage of the total 
variance, and for the most part they were 
comprised of mental health opinion items. 
Only a few items loaded very heavily on each 
of these factors and they consequently seemed 
to be relatively specific in nature. This multi- 
plicity of factors is consistent with the fact 
that the mental health opinion items were 
contained in 10 factors in Nunnally’s original 
work, The Specific Proficiency Rating did not 
load on any of these remaining 17 factors, 
while the Retention Rating and Global Pro- 
ficiency Rating each loaded slightly on a cou- 
ple of them. 

In contrast to the results of previous stud- 
ies, the present findings support the assump- 
tion of a relationship between general psycho- 
logical adjustment and child care proficiency 
in a setting demanding the formation of 

“meaningful relationships with severely dis- 
turbed children. Demonstration of this rela- 
tionship may be a consequence of using stable 
Measures of job performance and personality. 
Admittedly the use of a large number of 
variables would enhance the chances of finding 
significant relationships, and the factor ana- 
lytic procedure would make maximum use of 
minimal individual differences. Nevertheless, 
the clarity of the relationship is impressive, 
for the S sample represented a restricted ad- 
Justment range because of preemployment 
Screening, 

Although any correlational study is limited 
by the items that have been selected for use, 
the resulting correlations, and in this case the 
factor analytic solution, is informative of the 
Kinds of items that are most associated with 
one another. Thus, it is important to note 
those items that did not seem to be related to 
pe Proficiency as well as those which were 
rent be related.. It does seem that the 
if ae health opinions held by these persons 
„~ € time of their employment were rela- 


tively unrelated to their ultimate job pro- 
ficiency, 
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The use of individual item scores which 
would be expected to have lower reliability 
than scores based on several items may have 
contributed to the low correlations. However, 
another possible explanation for this lack of 
a relationship derives from the content of the 
mental health opinion questionnaire itself. 
An examination of the questionnaire reveals 
that it is more concerned with “mental ill- 
ness” than with mental health; many of the 
items deal with the etiology and treatment of 
“mental illness.” As has been pointed out fre- 
quently (e.g., Sarbin, 1967), this model may 
not be pertinent. Attitudes based on the medi- 
cal model could be irrelevant to the kinds of 
behavior which are important for the psycho- 
logical treatment of disturbed children. This 
point may have been underscored in the pres- 
ent study since the nurses and child care work- 
ers were rated for their performance in a fa- 
cility which is little influenced by the mental 
illness concept, where personnel are highly 
aware of “behavior,” and where learning 
principles are used to guide treatment pro- 
cedures. 
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This therapy analogue study dealt with the interaction of client personality 
(dominance—Dom) and therapist directive versus nondirective techniques in 
producing client resistance. Forty High-Dom and 40 Low-Dom undergraduate 
male Ss, while role-playing client problems, were required to make choices 
between 32 pairs of high- and low-resistant client responses following thera- 
pist statements. Results showed that nondirective techniques elicited most 
client resistance, Low-Dom Ss tended to give more resistant responses, and both 
effects were most pronounced when Ss assumed hostile roles. The S anxiety 
was also found indirectly to be related to resistance. 


Familiar to virtually all therapists is pa- 
tient behavior during the therapy interview 
which appears to be uncooperative and even 
negativistic with regard to the goals of ther- 
apy. Generally conceived, this is one aspect 
of the classical problem of “resistance,” a 
concept born within the framework of tradi- 
tional psychoanalytic theory (cf. Fenichel, 
1945, p. 26) but which has come to possess a 
variety of etiological connotations, including 
clients’ fear of social disapproval and thera- 
pists’ resentment about patients’ noncoopera- 
tion. Regardless of one’s theoretical allegiance, 
the significance of the resistance phenomenon 
in therapy is hard to overestimate. Indeed, 
Schwebal (1960) has remarked that “The 
various investigations of the correlates of suc- 
cess or failure, of continuing or terminating 
in counseling and therapy, may well be sepa- 
rating the high and low resisters [p. 482].” 
The position taken here is that understanding 
of resistance in therapy may be advanced by 
conceptualizing this behavior as a function of 
client personality variables, and of differential 
client responses to various therapeutic tech- 
niques. 

Classified broadly, empirical studies of re- 
sistance have come from two directions which 
could probably benefit from integration: (a) 
those that relate resistive client behavior to 
client and therapist characteristics defined 

_Ppsychometrically (e.g., Heilbrun, 1961a, 
1961b; Taulbee, 1958) and (b) those which 
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relate resistive client behavior to directive 
and nondirective therapist interview behavior 
(e.g., Bergman, 1951; Carnes & Robinson, 
1948; Frank, 1964; Frank & Sweetland, 1962; 
Truax, 1961, 1963). While these studies are 
important and provocative, they do not per- 
mit evaluation of the interactions of the com- 
bined roles of client personality and therapist 
interview behavior in producing client resistive 
behavior. Based on findings of these earlier 
investigations, two major hypotheses were 
generated: Hypothesis I: High-dominance Ss 
would offer more resistance in a therapy ana- 
logue situation than would low-dominance Ss. 
Hypothesis II: Directive therapist statements 
would elicit more resistant client responses in 
a therapy analogue situation than would non- 
directive therapist statements. 


METHOD 
Subjects 


High-dominance Ss. Forty Ss were selected from a 
larger pool of approximately 240 male undergradu- 
ates at the University of Texas, who were given the 
Adjective Check List (ACL; Gough & Heilbrun, 
1965), Those with highest scores (T of 60 or above) 
on the Dominance scale (Dom) of the ACL repre- 
sented the High-Dom group (High Dom). 

Low-dominance Ss. Forty Ss were similarly se- 
lected from the same pool on the basis of having 
the lowest Dom scores (T of 44 or below) on the 
ACL (Low Dom). 


Test Stimuli 


Typescripts. During the course of the experiment, 
Ss were asked to assume the roles of eight hypo- 
thetical clients (two each with academic, interper- 
sonal anxiety, homosexual, and hostile impulse con- 
trol problems) who had sought professional help. 
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All Ss were instructed to respond to the therapist 
statements in a manner consistent with what they 
believed their own behavior would be in each situa- 
tion and were informed that there were no “right” 
or “wrong” answers. 

A short description of the problem leading the 
client to seek professional help was followed by four 
typical therapist statements, each of which was 
followed by a choice of two client responses, one 
resistant, the other a nonresistant, cooperative re- 
sponse. The Ss responded to directive therapist 
statements in one of the two cases in each problem 
area, and to nondirective statements in the other 
case of the same problem area. Orders of presenta- 
tion of therapist statements and the problem areas 
were counterbalanced to minimize sequence effects. 

Similarly, the order of the resistant-nonresistant 
client alternative responses was alternated in each 
case. 


Therapist Statements 


As in other studies of therapist directiveness-non- 
directiveness (Bohn, 1965; Carnes & Robinson, 
1948; Finesinger, 1948; Frank, 1964; Parker, 1967; 
Parsons & Parker, 1968), directive therapist state- 
ments were defined as those which would tend 
clearly to lead, direct, or control the verbal activity 
during the therapy session, and included direct ques- 
tions and forcing the topic directively. Nondirective 
therapist statements were defined as those which 
would tend to give responsibility of decision for 
choice of area and direction of verbal activity largely 
to the client, as well as those responses which reflect 
or clarify the client’s affect. Nondirective leads, 
clarification of feeling, and restatement of content 
Were the three types of nondirective therapist state- 
ments used. The classification of these statements in 
the analogue material was assessed by asking three 
Judges (experienced therapists) to categorize each 
therapist statement. Interjudge agreement with re- 
gard to the major directive-nondirective categories 
was high (94%). 


Client Responses 


3 Resistant client responses were defined as those 
indicating overt or covert opposition to the thera- 
pist and the therapeutic process and/or an unwilling- 
ness to explore problems, to establish rapport, or to 
TA collaborative efforts. Here also, a similar group 
a are judges was asked to indicate which alterna- 
whi In each pair of client responses was resistant and 
which was nonresistant. One hundred percent inter- 
Judge agreement was achieved. 


Ability of Ss to Assume Roles 


Debriefing indicated that Ss generally had little 
culty assuming the client roles. However, in order 
ity pean a more systematic assessment of Ss’ abil- 
th roe Day, a postexperiment questionnaire was 
Gain, fe focused on Ss’ feelings of nervousness 
sane e experimental task, and was evaluated in 

ton to $ Dom level, as described subsequently. 


to 
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TABLE 1 


ANALYSIS OF VARIANCE OF RESISTANT RESPONSES AS 
A Function OF “CLIENT” DOMINANCE, THERAPIST 
DIRECTIVENESS, AND TYPE OF 
CLIENT PROBLEM 


Source df F $ 
Between 79 
Dominance (A) 1 3.15 08 
Error 78 
Within 560 
Directiveness (B) 1 7.28 .008** 
AXB 1 225 13 
Error 78 
Client Problems (C) 3 1.21 .30 
AXC 3 36 49 
Error 234 
BXC 3 3.24 .02* 
AXBXC 3 24 | 87 
Error 234 
Total 639 
* 
#3 Si 
RESULTS 


Hypotheses I and II were tested with a 
Type VI analysis of variance (Lindquist, 
1953). The between-groups variable was 
dominance while directiveness and type of 
client problem were the two within-groups 
factors. Results of the analysis are summarized 
in Table 1. 

Surprisingly, the overall results were con- 
sistent with the reverse of both hypotheses. 
As shown in Table 1, there was a nonsignifi- 
cant tendency ( < .08) for Low-Dom Ss to 
give more resistant responses than High-Dom 
Ss. In relation to Hypothesis II, the results 
were highly significant (p< .008) but di- 
rectly contrary to those predicted, with more 
resistant responses following nondirective 
therapist statements than after directive state- 
ments. 

A third finding which was not predicted 
was the significant interaction (p < .02) be- 
tween type of client problem and therapist 
directiveness. This interaction is shown in 
Figure 1, from which it can be seen that non- 
directive therapist statements were followed 
by more resistant responses than were direc- 
tive statements in all client problem areas 
except academic. 

For academic problems, while not statisti- 
cally significant (¢ = 1.33, p > .05), this re- 
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X resistant 


responses 
per case 


Interpersonal 


Academic 
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X resistance following: 
nondirective therapist statements 
directive therapist statements 


Homosexual Hostility 


Client Problem 


Fic. 1. Interaction of therapist directiveness with problem type. 


lationship was reversed. The mean numbers 
of resistant client responses per case following 
directive therapist statements were compared 
with those for nondirective statements for each 
of the four problem areas. The difference was 
significant only for hostility control problems 
(t = 5.85, p < .001). Following this finding, 
an independent sample of 20 undergraduate Ss 
was asked to rate on a 9-point scale the de- 


TABLE 2 


Nervousness PLAYING CLIENT ROLES: 
Hicu- anp Low-Dom AVERAGES 


Group Not bothered 


. Felt uneasy . 
High Dom ` 30.1 9.9 
23.6 16.4 


Low Dom 


Note.—x? = 6.32,p <.02. 


gree of each problem’s seriousness (1 = mildly 
serious; 5=quite serious; 9 = extremely 
serious). Seriousness was defined as the extent 
to which the problem would (a) cause the 
client to experience interpersonal difficulties, 
(b) reduce his effectiveness in reaching per- 
sonal goals, (c) interfere with his achieving 
a reasonable degree of personal satisfaction, 
and (d) create the need for professional 
therapeutic intervention. The mean ratings 
obtained were academic, 2.07; interpersonal, 
5.88; homosexual, 6.65; hostility, 8.14. By ¢ 
test comparisons, all differences in’ average 
ratings of seriousness were. significant. (p < 
:05) except between interpersonal and homo- 
sexuality: problems. The results: of this analy- 
sis suggest that resistance, particularly to non- 
directive therapy techniques, is related to the 
perceived severity of the problem, 


ek, 


i 


THERAPY RESISTANCE 


Results of the postexperiment question- 
naire summarized in Table 2 indicated that, 
averaging over all client roles, significantly 
more Low-Dom Ss than High-Dom Ss re- 
ported feeling uneasy while performing the 
task. As described below, it seems clear that 
this relationship may have considerable bear- 
ing on explanation of the Dom-resistant find- 
ings being generally the reverse of prediction. 


Discussion 


While the results were not consistent with 
the specific predictions, the more general hy- 
pothesis that resistance may be a function of 
therapist technique and of client personality 
characteristics has received support. The main 
effect showing significantly fewer resistant re- 
sponses to directive than to nondirective 
therapist statements must of course be quali- 
fied in the light of a significant interaction be- 
tween directiveness and type of client prob- 
lem. This interaction suggests that resistance 
may be reduced if therapist technique is varied 
according to the type of client problem. Closer 
examination of the analogue data indicated 
that nondirective statements elicited signifi- 
cantly more resistance than directive state- 
ments with the “most severe” (hostility con- 
trol) problems and the least resistance (less 
than directive—a reversal of the main effect) 
with the least severe (academic) problems. 

Therefore, while the main effect would sug- 
gest that these male Ss, most of whom had not 
had counseling or therapy, would favor a more 
directive, controlling therapist to one who 
Places the responsibility for direction of the 
Interview onto the client, this may be least 
likely to be true with milder complaints (i.e., 
academic or study problems). Considering the 
Similarity of the present results to Klein’s 
(1967) finding that directive advice-giving 
‘erapy was strongly preferred over nondirec- 
_ tive, reflective therapy by naive adult Ss in an 
€xperimental situation, another major impli- 
Cation is that in an ambiguous situation such 
_ 4S an initial therapy interview, it is easier, 
= demanding, and perhaps less threatening 
O respond to direct questions and themes 
Chosen by the therapist than to nondirective 
Statements, 

Other possible explanations of these results 
ear on problems related to the analogue situ- 
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ation itself. The Ss here were not actually 
seeking help for the problems they role-played. 
With the single exception of academic prob- 
lems, the problem areas here were probably 
not personally significant to many of the Ss. 
Consequently, it could be argued that in the 
analogue situation, Ss were not only respond- 
ing to therapists’ statements but were also 
attempting to play a largely foreign role. Of 
course, in some respects, this is not unlike the 
situation in which an actual client finds him- 
self when he comes for an initial interview. 
Any directive therapist remark that would 
help S in his task by specifying what he 
should talk about (as opposed to nondirective 
cues that would leave S shouldering responsi- 
bility largely on his own) would doubtless 
tend to be followed by “cooperative” client 
statements. It is not altogether clear that this 
kind of confounding has been successfully 
dealt with in most analogue studies. It is 
clear, however, that it is difficult to account 
precisely for the contribution of these factors 
when generalizing from the analogue to actual 
clinical practice. 

The finding of a tendency toward a nega- 
tive relationship between S Dom and resist- 
ance, although in reverse of prediction, is 
consistent with the psychoanalytic notion 
that greater resistance is associated with 
greater anxiety. The postexperiment question- 
naire data in this study indicated that Low- 
Dom Ss felt more uneasy playing these client 
roles than did the High-Dom Ss and signifi- 
cant negative correlations have been reported 
between the Dom scale and the MMPI Pt 
scale as well as the Welsh Anxiety scale 
(Gough & Heilbrun, 1965). Perhaps the basic 
implication of this finding is that there is a 
hierarchy of factors affecting client responses 
in the therapy situation and these may be 
considered in terms of their prepotency. The 
evidence here suggests that client anxiety is a 
more powerful influence than client Dom in 
accounting for client behavior, at least early 
in the therapy relationship. 
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TEMPORAL CORRELATES OF THE ACHIEVEMENT VALUE 
AND MANIFEST ANXIETY ' 


THOMAS J. COTTLE? 


relatedness between time zones. 


From the point of view of time, it is the 
} concept of the future that distinguishes most 
Clearly the achieving person from the anxious 
| one. Whereas the achiever involves himself in 
mastering destiny (Strodtbeck, 1958), the 
anxious person seems overly apprehensive or 
indeed dreading the future’s arrival (Persky, 
Grinker, Mirsky, & Gamm, 1950). Moreover, 
whereas achievement values reinforce the 
urge to keep moving forward in time (McClel- 
land, 1961), anxiety increases one’s appetite 
for the safety of the familiar and actually 
| Serves as a danger of impending events 
| (Freud, 1936). 
| Several psychologists have demonstrated 
| Systematic variations in time perception 
| mong achieving and anxious persons (Epley 
g: Ricks, 1963; McClelland, 1961; Wallace & 
Rabin, 1960; Wohlford, 1964). McClelland 
(1961), for example, reported that high 
: achievers used more future tense sentence 
Construction than low achievers of a compa- 
table sample, Anxiety, on the other hand, has 
been shown to deflect individuals away from 
e future and increase sentiments that suc- 
a in personal action has a low probability 
ot occurrence. Wohlford (1964) observed: 
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A study of the relationship between achievement values, manifest anxiety, 
and orientations to time was reported. Controlling for the effect of intelligence, 
it was found in a sample of respondents, ages 18-23, that whereas achievement 
is related to accepting the possibilities of the future and to perceiving an 
“integration” between the past, present, and future, anxiety is associated with 
apprehensions about the future, and functions in decreasing the sense of 
In addition, anxiety is associated with fan- 
tasies of temporal recovery and preknowledge. It was suggested that the 
achievement value enhances a realistic acceptance of time’s unalterable proper- 
ties, while anxiety encourages fantasy resolutions of realistic temporal problems. 


The farther into his future a high anxiety individual 
roams on a pre-conscious level, the more he is 
frightened by what he sees there, and this causes 
him to draw back more into the present and past 
in his conscious thought [p. 82]. 
Wohlford’s words are reminiscent of Freud’s 
(1936) definition of anxiety as “the reaction 
to a situation of danger; [usually] circum- 
vented by the ego’s doing something to avoid 
the situation or retreat from it [p. 65]. 
Accordingly, it is not surprising that anx- 
ious persons would attribute a heightened 
degree of potency to the future (MacKinnon, 
1944). In so doing, they may unwittingly be 
confessing an inability to confront the un- 
known or simply cope with their own appre- 
hensions. Achievers on the other hand, seem 
to be governed by a sense of personal respon- 
sibility (Strodtbeck, 1958) and even vital 
self-preoccupation (Henry, 1949). Thus, if 
anxiety cause an exaggeration of the future’s 
dangers, achievement values reinforce an exag- 
gerated sense of man’s ability to control these 
dangers. ‘ 
One way to understand the variations in 
time perceptions of anxious and achieving 
persons is to examine the extent to which 
individuals relate the past, present, and 
future. There is, for example, a constant 
forward thrust in the achiever’s present which 
results perhaps from his sense of the unat- 
tained or his frequent feelings of unfulfillment 
(McClelland, 1961). It is not surprising that 
his involvement with chronological time 
should be one of such eager anticipation that 
he sets his watch a few minutes fast (Cortes, 
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1960, cited in McClelland, 1961). Through 
active planning, achievers unite the present 
and future. As McClelland (1951) wrote, 

It is as if the [achievement] need has served to relate 
present achievement experiences to future ones, to 
promote understanding of the present in terms of a 
wider context. Motives seem to tie the present to 
the future, the specific to the general and long run 
[p. 486]. 

In contrast, anxious persons appear less 
able to establish this union of time zones. In 
many of their perceptions, they reflect a 
temporal as well as personal disunity (May, 
1950) or what some have called ego disinte- 
gration. For them, the temporal horizon is 
muddled, futures are not adequately differen- 
tiated from presents (Melges & Fourgerousse, 
1966), and the fantasy of regaining the past 
is common. 

A connection between time zones, and par- 

ticularly between the present and future, sug- 
gests a sense of competence in one’s ability 
to shape the future through personal effort 
(Muller, 1967). If this is true, achievers 
should portray the connection more often than 
anxious persons. Equally important, confi- 
dence in affecting outcomes, being the agent 
of causation (Bakan, 1966) or otherwise in- 
fluencing one’s environment (White, 1959), 
should lessen the need for fantasy solutions 
to temporal problems. The past now may be 
relinquished and the future left unknown until 
that proper moment when the feeling of ef- 
ficacy described by William James (1912, 
cited in Muller, 1967, p. 9) comes to be 
experienced: 
Sustaining, persevering, striving, paying with effort 
as we go, hanging on and finally achieving our 
intention—this is pure action, this is effectuation in 
the only shape in which, by a pure-experience phi- 
losophy, the whereabouts of it anywhere can be 
discussed . . . here is causality at work. 

The present study examines relationships 
between achievement values, manifest anxiety, 
and six areas of time perception. In so doing, 
11 major hypotheses are generated. 

The first two variables explore an experi- 
ential time orientation and establish basic 
temporal definitions of the present. It is 
hypothesized that the achiever’s forward look 
will display itself (a) in an emphasis on ex- 
pectation, rather than on reflection, and (b) 
in a demonstration that the future border 
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of the present, essentially the time between 
what an individual defines as now and the 
commencement of his future, will seem more 
important, hence, longer than the past border 
of the present, essentially the time between 
now and the conclusion of one’s past. On the 
other hand, anxiety, which has been defined 
in terms of fear of the unknown (Freud, 
1936) (c) should reduce the significance of 
expectations but (d) increase the degree to 
which past experiences become important. 
This past orientation should reflect in a dem- 
onstration that the present’s past border, that 
is, the time of completed events still within 
the present, seems more salient, hence longer. 

The notions governing these hypothesized 
associations, therefore, are, first, that salience 
and extension are equated, and, second, that 
the salience of time zones, as in past and 
future orientations, reflects in the extensions 
of the present’s boundaries corresponding to 
the orientation. 

Because of their belief in personal control 
or the sense that they will affect the future, 
as they have the present, achievers (e) should 
demonstrate temporal relatedness or the sense | 
of a connection between the past, present, — 
and future. In contrast, as a way of disen- 
tangling the confusion of time zones (Melges — 
& Fourgerousse, 1966), anxious persons (f) 
should demonstrate temporal unrelatedness, 
which symbolizes, moreover, their lack of a 
sense of personal efficacy and control (Cottle, 
1967; White, 1959). 

Anxiety about the pace of life and appre- 
hension about the future (g) should increase 
as a function of manifest anxiety but (4) de- 
crease as a function of valuing achievement. 
This last hypothesis is advanced with the 
rationale that the constant establishment of | 
new aspirations characteristic of achievers i 
makes their future seem less ambiguous and 
foreboding. Furthermore, because of achiev- 
ers’ preference for moderate risk-taking situa- 
ations (McClelland, 1961), achievers should | 
manifest a belief that sufficient time exists to 
complete present activities and conceive future 
plans. Thus, future events should appear more 
manageable to achievers, 

Anxious concern with the future (i) should 
cause an exaggeration of the sense of its po- f 
tency (MacKinnon, 1944) and, in an almost » 
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wishful manner, its goodness as well. But the 
responsibility for personal management of the 
future, so paramount in the achiever’s out- 
look, (j) should lessen the potency and evalu- 
ation ratings of the future. Once again, pre- 
occupation with an unmanageable stimulus, in 
this case the future, produces heightened 
estimations of its power and goodness. 

Finally, anxiety (&) should encourage re- 
treats to fantasy explorations of the past and 
future. Rationale for this hypothesis comes, 
first, from Freud’s (1936) observation that 
anxiety evokes temporal retrieval fantasies 
which permit one to rearrange prior action 
or redo or undo unsatisfactory experiences. 
Second, in light of the apprehension generated 
by future unknowns, preknowledge fantasies 
should seem especially exciting as they offer 
arather protected way of dealing with these 
unknowns and uncertainties. 

Predicting relationships between achieve- 
ment values and temporal fantasies is more 
| difficult. Central to achievement is its mini- 
mization of prior involvements, particularly 
| with family members (Strodtbeck, 1958). 
While this feature may reflect in the related- 
hess variable, that is, in the disconnection of 
the past from the present and future, it is not 
cleat whether commitment to achievement 
Permits the luxury of temporal retrieval 
through fantasy. 

Fantasies of preknowledge also may cause 
achievers conflict as their many ideals and 
heightened investment in the future may make 
them more eager to learn how the future 
Works out. Yet, in a more realistic framework, 
_ the mystery produced by unknowns may pro- 

vide the very energy igniting the achievement 
drive in the first place. Thus, while anxiety 
should support fantasy resolutions of prob- 
lems, reactions of achievers to temporal 
fantasies are not easily predicted. 

A serious question arising in the present 
research was to what extent does intelligence 
influence Perceptions of time? Although the 
data are not always consistent, intelligence 
as been shown to increase future extensions 
on various time measures and Thematic Ap- 
Torception Test responses (Epley & Ricks, 
963; Murray, 1959; Wohlford, 1964). Fur- 
aries studies demonstrating interrelation- 

"PS between intelligence, achievement, and 
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anxiety (Lavin, 1965) indicate that intel- 
ligence should be examined for the effect it 
might have on the two independent variables. 


METHOD 
Subjects 


The Ss were 332 men and 98 women from a mid- 
west Naval Training Station. All were students in 
a voluntary medical corpsman program consisting 
of elementary medical education and patient care. 

Given their parents’ occupations and completed 
education levels, Ss were judged to be from the 
lower middle class. All had finished high school but 
few had been to college. Most came from moderate 
sized cities, rarely from rural areas. Although ranging 
in age from 18-23, less than 4% were married, and 
in the main, all were Protestant or Catholic. Finally, 
based on tests to be described, these students were 
found to be significantly more intelligent than their 
regular Navy colleagues. 

One advantage of using this rather homogeneous 
population is that all students were engaged in 
practically the identical activities during the time of 
testing. Thus, if the program or Navy environment 
affected temporal perceptions, the effects should have 
been similar on all 430 students. 


Instruments 


Achievement values. Achievement values (VAch) 
were measured with an 8-item inventory (Strodtbeck, 
1958), derived from the theory that achievement 
implies individual and autonomous action, personal 
control of destiny, and minimization of dependencies 
on parents. 

Items are presented as statements with which 
respondents are either to agree or disagree. Examples 
of items include, “Planning only makes a person 
unhappy since your plans hardly ever work out 
anyway”; “When a man is born the success he’s 
going to have is already in the cards, so he might 
as well accept it and not fight against it”; “Nowa- 
days, with world conditions they way they are, the 
wise person lives for today and lets tomorrow take 
care of itself.” The maximum VAch score is eight. 

Anxiety, Anxiety was measured with the Taylor 
Manifest Anxiety (MA) scale (Taylor, 1953, 1956). 
A shortened form consisting of 21 items was used 
as it was found in a matched sample to correlate 
highly with the complete MA scale (for males, 
r= 87; for females, r=.85).8 The MA scale in- 
structs Ss to agree or disagree with statements 
regarding self-perceptions of anxiety symptoms. 
Maximum score is 21 with the higher the score, the 
greater the manifest anxiety. 


8 Corpsmen in the same program but not partici- 
pating in the study served as the matched sample. 
These Ss also received all the remaining tests, but in 
no instance did they vary significantly from those 
students actually selected for the study. This com- 
parison of samples was undertaken in order to con- 
firm the fact that Ss chosen were a random sample 


of corpsmen. 
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Intelligence. Intelligence was measured with the 
Army General Classification Test, (1945). Adminis- 
tered to entering military personnel, the test consists 
of two aptitudes, verbal (GCT) and mathematical 
(ARI). Intelligence represents the sum of the two 
subtests each given equal weight. In the population 
studied, the correlation between subtests was .74. The 
maximum intelligence value is 150. 

Experiential time orientation. The Experiential 
Inventory (Cottle, 1968) explores predispositions to 
reflection (past orientation), momentary encounter 
(present orientation), and expectation (future orien- 
tation). Temporal designations are determined by 
listing important life experiences and assigning these 
experiences to a specific time zone. 

The first half of the inventory’s instructions are 
as follows: t 


Please list the 10 most important experiences of 
your life. These may be experiences you have had, 
you are having, and experiences you expect to 
have, You only need to write a few words for 
each experience and you may list your experiences 
in any order you wish. 


At this point temporal orientations are implicit. 
In the second half of the instructions, however, time 
becomes the central operating variable: 


Now that you have listed 10 experiences, please 
study the time zones below. 
Time Zones 

1. Distant Past 

2. Near Past 

3. Present 

4. Near Future 

5, Distant Future 
Now take each experience and decide if it has 
occurred, is occurring, or will occur. Then choose 
the number of the time zone that best represents 
the time of the experience and write this number 
in front of the experience. Do this for all 10 
experiences. 


An experiential mean is calculated by summing 
the numbers of the listed time zones and dividing 
the number of experiences by 10. Essentially this 
value indicates where an individual locates himself 
in time: the higher the score, the more future 
oriented the S. 

The mean is misleading, however, particularly in 
the center of the distribution of scores. For example, 
a mean of 3.0, theoretically a perfect present orienta- 
tion, may result from locating all experiences in the 
present or from dividing experiences equally between 
the distant past and distant future. Thus, a numeri- 
cal present orientation may mask an avoidance of 
present experiences, 

To control for this, and for the purpose of estab- 
lishing experiential orientation categories, respondents 
were divided into three groups. To a certain extent, 
the constitution of these groups is a function of the 
frequency distribution of experiences in the five time 
zones, : 

The first group consisted of those persons reporting 
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no near or distant future experiences. While they 
were called “past-present orienters” (pa-pr) they 
also may be thought of as being “future avoidant.” In 
that all remaining persons listed at least one future 
experience, they were further divided into those 
listing three or more near and/or distant future 
experiences, so-called “future orienters,” and those 
listing less than three but at least one future experi- 
ence. This last group was labeled “samplers” as it 
turned out that they generally tended to distribute 
their 10 experiences throughout the five zones more 
widely than the other two groups. 

It should be pointed out that while pa-pr orienters 
by definition list no future experiences, future orient- 
ers typically list some near and/or distant past 
experiences. This finding is anticipated somewhat by 
the rather low or pa-pr oriented experiential mean 
values (for males, X = 2.34, SD = .654; for females, 
X =2.38, SD= .624). 

Present duration. The Duration Inventory (Cottle, 
in press a) inquires into the temporal units selected 
for defining the duration of the past and future 
borders of the present, as well as the near and dis- 
tant past and future. 


Instructions for the Inventory are very simple: 


Read the sentences below and fill in the blank spaces 
with one of the listed words. Indicate your choice 
by placing the number of the word in the blank 


space. Select only one word for each blank space. 


1. Seconds; 2. Minutes; 3. Hours; 4. Days; 
5. Weeks; 6, Months; 7. Years. 

I. The PRESENT as I think of it extends from 
ago to from now. 
II. The DISTANT PAST as I think of it includes 
things and events which occurred ago, 
while the NEAR PAST includes things and 
events which occurred ago. 
The DISTANT FUTURE as I think of it, 
includes things and events which will occur 
from now, while the NEAR FUTURE 
includes things and events which will occur 
from now. 


Ill, 


It should be noted that while an artificial distinc- 
tinction exists between clock (short) and calendar 
(long) units, a questionable scale incorporating the 
seven options nonetheless is established: the higher 
the number, the longer the duration. In this paper, 
only duration of the past and future borders of the 
present will be considered. 

Temporal relatedness. The degree to which time 
zones are perceived as relating is examined in the 
Circles Test (Cottle, 1967). The Test’s instructions 
are to 


think of the past, present, and future as being in 
the shape of circles. Now arrange these circles in 
any way you want that best shows how you feel 
about the relationship of the past, present, and 
future. You may use different size circles, 


Completed circle configurations normally conform | 


to one of four designs: 
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1. Atomicity (Whitrow, 1961). Circles fail to touch 
and by demonstrating no spatial relatedness earn 
zero points on the dimension. 

2. Continuity. By touching at peripheries only, 
circles depict a flow of units with each unit 
occupying its own finite “space.” Two points 
are earned for each peripheral touch. 

3. Integration (Murray, 1959; cf. Lessing, 1967; 
Wessman & Ricks, 1966). When circles overlap 
one another so that a particular space is shared 
by at least two zones, four points are earned. 

4. Projection (Gurvitch, 1964). Circles may be 
drawn within (or projected into) one another 
or perfectly contiguous. That is, the space of 
the present and future may be identical. Either 
configuration receives a maximum of six points. 


Total relatedness scores represent the sum of indi- 
vidual circle relationships: past-present, past-future, 
present-future. The summing operation poses a prob- 
lem in interpretation, however, for as many as three 
different designs may be drawn in one configuration, 
Tn these instances assignment to a particular typol- 
ogy is made. The exact location of relatedness is 
determined by computing individual circle scores. 

Other variables are generated in the Circles Test 
but they have been omitted in the present study 
(Cottle, 1967). 

Temporal anxiety. The Temporal Anxiety (TA) 
Scale consists of 12 statements dealing generally with 
the pace of life, fear of the future, and temporal 
orientations. The statements were randomly dispersed 
among a 39-item Time Attitude Inventory which 
instructed Ss to register agreement-disagreement on 
a 7-point scale. 

To determine whether Ss judged time-anxiety items 
as belonging to a single cluster, a factor analysis 
of the 39 items was performed. A principal axis 
Solution and Kaiser Varimax rotation operation was 
employed. Results of the analysis revealed all 12 
anxiety items loading on the first factor. The factor 
consisted of the following items where the number 
ìn parentheses refers to the item’s position in the 
Time Attitude Inventory: (32) Im afraid I won't 
be able to lead a full life; (31) The pace of life is 
too fast for me; (19) I’m often so worried about 
what is going to happen, I forget about right now; 
(33) It’s hard for me to work on a task when I 
know there's not enough time to finish; (34) I live 
ìn the past; (39) A minute seems too small an 
amount of time to be of any use; (23) I dislike 
change ; (18) I live in the future; (20) More than 
knowing what's happening in the world, I like to 
now why things happen; (1) Sometimes I feel I’m 
Hoe time and losing; (11) Often I think how nice 
t would be to stop time; (15) I live in the present. 

All the items load positively except for No. 15. 
a e average loading value is 48.2 and the percentage 
Š aaa variance is 30.4. Individual TA scale 
alues were mechanically computed factor scores: 

a Values indicate high temporal anxiety. 
ed potency and evaluation. Potency and evalu- 
; pate Tatings were measured on Semantic Differential 

€s (Osgood & Suci, 1955; Osgood, Suci, & 
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Tannenbaum, 1957). The Semantic Differential in- 
structs respondents to use adjective pairs for express- 
ing, in this case, potency and evaluative attitudes 
toward the future. Each of the two factors consisted 
of five pairs of polar adjectives displayed on 7-point 
scales. The evaluation factor was comprised of the 
pairs: clean-dirty; good-bad; kind-cruel; fair-unfair; 
and pleasant-unpleasant. Potency adjectives included: 
hard-soft; large-small; strong-weak; brave-cowardly ; 
and rugged-delicate. In each of the above pairs, the 
first word listed is the high-evaluation or potency 
word. In actual presentation, scales were intermixed 
with high words randomly assigned to the left- or 
right-hand side of the scale. 

The presence of potency and evaluation factors 
was confirmed by factor analyzing results of the 
Semantic Differential. For this analysis, an orthogonal 
rotation was employed. In the present study, how- 
ever, raw means of the two factors have been used. 

Temporal fantasy. Interest in temporal recovery 
and preknowledge were measured on the Money 
Game (Cottle, in press b). In this task, fantasy is 
made explicit by transforming time allotments into 
purchaseable commodities. Important to notice in 
the instructions is Z’s acknowledgment of the 
fantasy: 


Pretend that you had a lot of money, more money 
than you could possibly use. Pretend also that 
someone had the power to sell you time, any time 
that you would want, and you would have this 
time given to you right now, knowing what you 
now know. Furthermore, having this time you 
could do whatever you wanted with it. 


The Ss then are asked to consider the following 
money options which they may spend for the pur- 
chase of an hour, a day and a year from their per- 
sonal past and future: (a) $0.00, (b) $10, (c) $100, 
(d) $1,000, (e) $10,000. Interest is operationally de- 
fined as the amount selected. The Money Game also 
includes allotments from time before birth and fol- 
lowing death, but these sections are not considered 
here (Cottle, in press b). 

The various time allotments of the game were 
included as it was felt that while some persons might 
desire an hour’s peek at their future, they might 
find a year too much or too threatening. Still others, 
eager to purchase a year might consider an hour 
too brief to be of worth. However, as the inter- 
correlations between allotments and payments were 
so high (average r = .86), the idea of examining indi- 
vidual allotments was dropped, and instead, the 
three payment amounts for each epoch were summed. 
Thus, fantasy scores ranged from no payments for 
any allotment to the full investment of $30,000, 
$10,000 for each allotment. 

The issue of reality constraints has been purposely 
ignored in the Money Game. Nothing in the instruc- 
tions states that obtaining premature glimpses of the 
future will in any way disturb “more normal” 
eventualities. Yet, the effect of money cannot be 
assessed as there is no way of knowing the degree 
to which playing with money influences the fantasy. 
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TABLE 1 
CORRELATIONS OF TEMPORAL VARIABLES WITH VAcH, MA SCALE, AND INTELLIGENCE 
VAch MA scale Intelligence 
Temporal variables 
M F M F M F 
Experiential time orientation -07 :19 —.04 .07 -06 Aly 
Temporal relatedness 254e .26* —.26* Al ci bagi .20* 
Temporal anxiety —.28** —.28* Hiper .48** —.01 —.01 
Present past duration —.02 -18 —.02 —.07 —.10 —.09 
Present future duration —.02 -20* 01 —.21* 10) —.18 
Future potency —.08 —.05 .07 05 —.00 15 
Future evaluation .06 16 —.04 —.06 10 08 
Past recovery —.10 —.08 .26* 12 = 11 —.09 
Future preknowledge —.07 02 01 16 —.02 —.08 
* 
mb Sat 
Procedure a correlation between temporal recovery and 


Testing was done in group paper-and-pencil fashion 
during scheduled class periods under the supervision 
of a civilian examiner. Time instruments were ad- 
ministered in the second week, achievement and anx- 
iety scales in the fourteenth week. GCT and ARI 
scores along with personal and family information 
were taken from Navy records. 

To check stability of the time instruments, each 
was readministered in the twelfth week to all respon- 
dents, Reliability coefficients ranged from .73 on the 
Semantic Differential evaluation scale to .91 on the 
preknowledge section of the Money Game. Reliability 
values for the VAch scale and MA scale were not 
computed.* 


RESULTS 
Correlates of Achievement and Anxiety 


The correlations shown in Table 1 indicate 
that valuing achievement is positively corre- 
lated with temporal relatedness (Circles Test) 
and negatively correlated with temporal anxi- 
ety (TA scale). The prediction that achieve- 
ment orientations would be associated with 
an extended future border of the present is 
found, however, only among women. Mani- 
fest anxiety on the other hand, as hypothe- 
sized, is inversely related to temporal related- 
ness among men only and positively related 
to TA scale scores of both sexes. The hy- 
pothesis that anxiety correlates with temporal 
fantasies is partially confirmed as men show 


*In addition to retesting, respondents were asked 
to describe their reactions to the temporal instru- 
ments and to tell what they had intended to com- 
municate in the various tasks. In this manner, some 
measure of face validity was established. 


the MA scale (r = .26, p < .05). 

In terms of simple correlations, intelligence 
relates significantly only to temporal related- 
ness (for males, r = .21, p < .01; for females, 
r= .20, p< .05). Thus, for the most part, 
performance on the time instruments occurs 
independently of intelligence. However, while 
remaining independent of anxiety, intelligence 
relates positively to valuing achievement. 
Achievement and anxiety are themselves nega- 
tively correlated. These findings are sum- 
marized in Table 2. 


The Effect of Intelligence 


The means and standard deviations of the 
temporal variables shown in Table 3 provide 
some indication of performance on the tests. 
In the cases of experiential orientations and 
temporal relatedness, the frequency distribu- 
tions in the various categories are better per- 
formance indicators. These distributions are 
noted later. 

Several findings in Table 3 deserve men- 
tion. First, women tend to expand both pres- 
ent borders more than men (past border ¢ = 
2.54, p < .05; future border ¢ = 1.34, ms) 
who, in turn, show higher temporal recovery 
and preknowledge scores than women (recov- 
ery £ = 1.97, p < .10; preknowledge ¢ = 5.60, 
$ < .05). Regarding this last finding, men re- 
ported that recovered time would be used for 
redoing such that present and future condi- 
tions might be altered. Thus time retrieved 
would be time remade. Women, however, re- 


per 
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TABLE 2 
INTERCORRELATIONS OF VAcH, MA SCALE, AND INTELLIGENCE 
1Q VAch MA scale 

M F M F M F 
Intelligence a z 
VAch .24** .23* = ma 
MA scale —.04 —.07 —.14* ener = in 
ERN 


ported that time retrieved would be time re- 
lived. 

The high TA scale scores demonstrate that 
anxiety about time is common among both 
sexes. In contrast, given the limitless wealth 
offered in the Money Game, fantasy scores 
are low.’ That payments for small allotments 
do not adequately account for the depressed 
summed means is demonstrated in the low 
payments for years alone (combined male- 
female past X = 2.81; future X = 2.90). 

Tn order to test the remaining hypotheses 
and control for the effect of IQ, men and 
women were divided at the combined mean 
into high- and low-intelligence groups. VAch 
Scores also were divided this way into high 
and low categories. Male MA scale scores 

SAs a matter of interest, more than 70% of all 


respondents failed to pay anything for time of the 
historical past and future. 


were trichotomized in order to locate possible 
curvilinear relationships. The smaller number 
of women in the sample, however, forced the 
use of a simple mean dichotomy. 

Looking only at the statistically significant 
findings, an association is found between 
achievement value scores on the one hand, 
and experiential future orientations and low 
future potency ratings on the other. How- 
ever, the former association is significant only 
among lower IQ men (x? = 6.43, df= 1, p< 
.05) and the latter only among higher IQ 
men (x? = 4.58, df = 1, p < .05). An addi- 
tional discovery is that intelligence functions 
in raising the percentage of pa-pr orienters 
(35%) but in lowering the percentage of 
future orienters (36%) and samplers (29%). 

High achievement among women tends to 
be associated with extended (i.e., above the 
mean) future borders of the present, but the 


TABLE 3 
MEANS AND STANDARD DEVIATIONS OF ALL VARIABLES 
Males Females 
Variables 
M SD M SD 
__ Ie 
Independent 

Intelligence 113.4 11.72 109.6 10.22 

VAch 5.62 1.11 5.41 1.03 

MS Scale 8.73 414 9.39 4.09 

€mporal 

Experiential time orientation 2.34 -654 2.38 -624 

\ Past duration of present 2.92 2.30 3.51 2.46 

uture duration of present 3.19 2.43 3.96 2.44 

p! 

Temporal relatedness 2.61 3.56 2.12 3.21 

© TAscale 15.15 6.29 14.48 6.19 

Futuré potency 5.12 1:03 5.11 1.15 

Future evaluation 5.18 1.36 5.47 1.55 

ast recovery 6.84 1.62 6.38 1.58 

uture preknowledge* 6.74 1.70 6.29 1.50 


* Values summed for three allotments. 


548 


tendency reaches significant proportions only 
among the high-IQ group (x? = 4.70, df = 1, 
p< .05). Moreover, as the correlations be- 
tween the variables suggested (Table 1), 
temporal relatedness is enhanced by valuing 
achievement independent of intelligence (x? 
= 4.95, df=2, p< .10). Indeed, just high 
VAch women demonstrate integrated or pro- 
jected designs which have been combined for 
the purpose of computing these chi-square 
values, 

Whereas among lower intelligence men, 
both moderate and high anxiety levels are 
associated with temporal atomicity (x? = 
13.68, df = 4, p < .01), among higher intelli- 
gence men, they are associated with high fu- 
ture evaluation scores, that is, scores above 
the mean (x? = 5.68, df = 2, p < .10), and 
high payments, that is, above the mean for 
time of the past (x? = 8.32, df = 2, p < .02) 
and future (x? = 5.39, df=2, p<.10). 
Temporal anxiety, however, is related to mani- 
fest anxiety among low-intelligence men (x? = 
11.09, df = 2, p < .01), high-intelligence men 
(x? = 26.84, df= 2, p< .01), and high-in- 
telligence women (x?=6.24, df=2, p< 
.01). 

A common characteristic of all the above 
findings is that where insignificant values ob- 
tain, the direction of the percentages normally 
is the same as in the significant cells. Hence, 
while intelligence in fact is sufficiently power- 
ful to reduce associations to levels of statisti- 
cal insignificance, in no instance does it re- 
verse the effects of achievement and anxiety 
on time perceptions. 


DISCUSSION 


Results of the study may be summarized as 
follows: Achievement values in men and 
women encourage temporal relatedness on the 
Circles Test but function in lowering temporal 
anxiety (TA scale). Manifest anxiety, on the 
other hand, increases temporal anxiety and 
atomicity on the Circles Test. All of these 
findings occur independently of intelligence. 

Among high-intelligence groups, achieve- 
ment is associated with experiential future 
orientations in men and with extended future 
border durations of the present (Duration In- 
ventory) in women. Manifest anxiety relates 
positively to evaluations of the future (Se- 
mantic Differential) and to increased interest 
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in recovering the past and foreknowing the 
future (Money Game). These last two find- 
ings, however, obtain for men only. 

Among low-intelligence males, achievement 
functions in lowering potency and evaluation 
ratings of the future. Holding aside the effects 
of sex and intelligence, 8 of the 11 hypotheses 
are in some way confirmed. However, as anzi- 
ety failed to produce experiential past orien- 
tations, increase the present’s past duration, 
or elevate perceptions of the future’s potency, 
Hypotheses c, d, and i remain unconfirmed. 

The universal finding that temporal anxiety 
increases with manifest anxiety but decreases 
through valuing achievement seems to reflect 
the dominant perceptions of the ways in 
which individuals incorporate the future. 
Whereas the achiever accepts the challenge of 
the unknown and turns the current of time 
into energy for activity and control, the 
anxious person finds the current constricting 
and overpowering, and ultimately seeks a re- 
turn to the familiar and secure. Even with 
the variations attributed to intelligence levels, 
the image of the anxious man is one of overly 
swift time passage coupled with a desire to 
retrieve prior moments and foresee future 
ones. In addition, he appears unable to tack 
these moments onto present experiences. His 
world is one of clutching at a rushing time in 
order, perhaps, to still it or, at least, to keep 
a timeless past near him as long as possible: 
“Like a man on shipboard looking out over 
the stern at the foaming wake, he has turned 
his back on the future and sees time as a 
wave rushing backward into the past [Bar- 
rett, 1967, p. 362].” 

The dilemma of turning back or facing for- 
ward has been articulated by psychologists 
and philosophers alike. Heidegger’s descrip- 
tion of the authentic man, for example, as one 
“whose basic possibilities have been disclosed 
to him including the possibilities of death, 
and who stands resolutely up to those possi- 
bilities [cited in Barrett, 1967, p, 361]” 
surely resembles descriptions of the achiever’s 
special belief in personal control of outcomes 
(McClelland, 1961; Strodtbeck, 1958). These 
notions of possibility and personal control 
Tepresent, moreover, the antithesis of Freud’s 
(1936) notion that anxiety expresses helpless- 
ness, or Melges and Fourgerousse’s (1966) Y 


TEMPORAL CORRELATES OF ACHIEVEMENT AND ANXIETY 


perhaps severe proclamation that “if an indi- 
vidual has no plans or cannot deal with the 
present in terms of the past, then behavioral 
disorganization will result [p. 138].” 

This last passage, however, may help to 
explain the temporal relatedness variable and, 
for that matter, the synthesizing force inher- 
ent in the spread of the temporal horizon 
(Fraisse, 1963; Heidegger, 1962). If past- 
present connections ate made through the 
sense of personal efficacy, control of activity 
or, more simply, autonomy, then present-fu- 
ture connections become the inferred exten- 
sion of this autonomy. Inference gives life 
and meaning to the future. Prior achievement, 
therefore, breeds possibility and reinforces 
credible planning. Thus, indeed, “it is as if 
the [achievement] need has served to relate 
present achievement experiences to future 
ones [McClelland, 1951, p. 486].” 

In philosophical terms, it is precisely in the 
action of planning that future possibilities or 
the sense of possibility is first confronted 
and ultimately actualized. But planning 
transcends feelings of competence, lists of 
significant expectations, or clinical admoni- 
tions, for according to Barrett (1967), “not 
only do we plan specific projects for today or 
tomorrow, but our life as a whole is a project 
in the sense that we are perpetually thrown- 
ahead-of-ourselves-toward-the-future[p.361].” 

Here, finally, is a key to understanding the 
characteristic responses of achieving and 
anxious persons to time. At one unequivocal 
level, time is only forward moving. We ex- 
Perience this as the sense of being-ahead-of- 
Ourselves (Barrett, 1967; McClelland, 1961), 
as a forward look (Epley & Ricks, 1963), or 
future directedness (Kummel, 1966). Whereas 
the achiever accentuates this future directed- 
Ness even to the point of seeking change and 
unfamiliarity (Winterbottom, 1958), anxiety 
leads one away from the future and thereby 
Urges a magical undoing of an intractable 
reality. The undoing, which Freud (1936) as- 
Sociated with reactions to danger situations, 
May be the present tense and, therefore, the 
functioning mechanism of which temporal re- 
Covery (redoing) and preknowledge (pre- 
doing) are fantasy analogs. All three mecha- 
nisms fail, of course, as they either avoid or 
deny the linear flow of time with its irretriev- 
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able past, its unknowable future, and, most 
importantly, its finite limits of existence 
(Cohen, 1962, Ch. 5; Heidegger, 1962). 

However, the sense of being-ahead-of-one- 
self must be qualified by the fact that men 
and women need not experience it in exactly 
the same way. The data indicate that valuing 
achievement affects frequencies of expecta- 
tions and assessments of the future among 
men only. In women the value leads not to 
teverberations in future perceptions, but in 
future aspects of the present. Whereas mascu- 
line anticipation implies an active or agentic 
(Bakan, 1966) insertion into the future, the 
implication for women is an incorporation of 
now, or a continuous adding of moments to 
now. Achievement and anxiety, then, initiate 
systematic transformations of attitudes about 
the future but primarily among men (Bakan, 
1966; McClelland, 1961), for, as a function of 
achievement, women lean more to a percep- 
tion not unlike Gurvitch’s (1964) “present 
[italics added] in advance of itself” (cf. Cot- 
tle, in press a). 

Considering the sophistication of contempo- 
rary philosophical and psychological litera- 
ture and particularly the stress on “the sys- 
tematic description of what is given in experi- 
ence [Barrett, 1967, p. 368],” commonly 
known as phenomenology, it is ironical that 
for the most part, definitions of past, present, 
and future remain unstudied. Future research 
must examine the kinds of primitive chrono- 
logical differentiations made in the Duration 
Inventory, for these establish truly pheno- 
menological temporal parameters. As they 
normally are conceived, temporal “orienta- 
tions” become restricted in their applicability 
and even invalid if such dramatic variations: 
are demonstrated in the determination of 
“simple” chronological “durations” (Cottle, im 
press a). Can one meaningfully discuss a fu- 
ture orientation, for example, if for some the 
future commences seconds from now, while 
for others an expanse literally of years ap- 
pertains to what they themselves call present? 

We must begin to speculate, furthermore, 
on the temporality inherent in social-psycho- 
logical processes like achievement and anxi- 
ety, as both, in their way, manifest character- 
istic approaches to the future. Achievement 
strivings and anxiety are themselves experi- 
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ences of and in time, experiences which, pre- 
sumably, the sexes do not equally share. And, 
while they remain negatively correlated in 
statistical operations, the complexity of that 
period of time known as the future reconciles 
many of their seeming disparities, both philo- 
sophical and psychological, as this passage 
reveals: 


I am prepared for the unexpected, I lunge toward it 
with hunger, even though I’ve seen so much I 
seldom encounter a surprise; however, I am most 
near death when I begin believing I’ve seen it all. 
So I continuously doubt the future. My faith is 
behind me, not out in front. ...I am pushing at 
crashing speeds into the unknown. ... My power 
is in me, in all of us. Life is this power [Cartwright, 
1967, p. 90].” 
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DIFFERENT PSYCHOLOGICAL EFFECTS OF 
LATERALIZED BRAIN DAMAGE? 
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Conflicting evidence exists concerning the differential psychological effects of 
lateralized brain damage on the Wechsler scales. Consideration of various 
reasons for discrepant results led to the selection of the WAIS Vocabulary and 
Block Design subtests as the best measures of verbal. (language) and per- 
ceptual organization (visuoconstructive) factors. Comparison of mean scores 
of these subtests in unilateral lesion, bilateral lesion, and control groups of two 
separate experiments confirm hypothesized differences: right-hemisphere damage 
results in impaired visuoconstructive performance; left-hemisphere damage 
affects language abilities. The results could not be accounted for by variables 
such as age, education, and emotional disturbance, or localization, diagnosis, 


acuteness, and severity of brain disorder. 


Behavior suggestive of a functional asym- 
metry of the two cerebral hemispheres in man 
has been the subject of numerous studies in 
the past 100 years. Accumulated clinical evi- 
dence has suggested that the left hemisphere 
is of primary importance in the control of 
language and symbolic processes, while the 
control of nonverbal, visuoconstructive abili- 
ties appears to be localized in the right hemi- 
sphere (Millikan & Darley, 1967). 

Much of the experimental and clinical work 
leading to these hypotheses has involved the 
study of patients with unilateral cerebral dam- 
age. When quantification of verbal and non- 
verbal abilities and deficits has been at- 
tempted, one of the most frequently used in- 
struments has been either the Wechsler-Belle- 
vue or Wechsler Adult Intelligence Scale 
(WAIS). A number of studies have reported 
that left-hemisphere damage is accompanied 
by a decrement in Verbal IQ (VIQ), while 
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right-hemisphere dysfunction often results in 
a lowered Performance IQ (PIQ) (Guertin, 
Ladd, Frank, Rabin, & Hiester, 1966). Reitan 
(1962), in particular, has stated that such 
patterns of impairment are found regardless 
of the specific location of the lesion within 
the hemisphere. 

Such findings, however, are by no means 
universal since equivocal or contradictory re- 
sults have been reported, particularly with re- 
spect to PIQ. As Benton (1962) states, 

The prediction that patients with lesions of the 
left hemisphere will show lower verbal scale scores 
than patients with lesions of the right hemisphere is 
hardly an exciting one, since we have been aware of 
this fact for decades. It is the prediction that pa- 
tients with lesions of the right hemisphere will show 
greater deficit on non-verbal spatial tasks than will 
patients with lesions of the left hemisphere which is 
of particular interest. But it is precisely with respect 
to this prediction that the findings of the psycho- 
metric studies have been weakest [p. 258]. 

Smith (1966a, 1966b) probably represents 
the most outspoken critic of the notion of the 
differential organization of verbal and non- 
verbal abilities, at least as measured by the 
Wechsler scales. Studying rather large popula- 
tions of brain-damaged Ss, he has failed to 
support the claim of differential effects of 
lateralized lesions, and suggests that sampling 
or population differences may be responsible 
for the findings of Reitan and his colleagues. 
Satz, Richard, and Daniels (1967) using 
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multivariate covariance analysis to adjust 
for age, education, and intelligence, reported 
differential VIQ decrements in Ss with left- 
hemisphere lesions, but no differential impair- 
ment in PIQ of Ss with right-hemisphere les- 
ions when these groups were compared with 
other populations. These authors also stress 
the importance of sampling differences. 

When these negative findings with the 
WAIS are considered in relation to the many 
positive indications of the role of the right 
hemisphere in perceptual and visuoconstruc- 
tive functions (Bogen & Gazzaniga, 1965; 
Teuber, 1962), it would appear that careful 
attention should be given not only to the 
characteristics of the population studied, but 
also to the nature of the dependent variable, 
that is, the Performance scale. 

Benton (1962) and Heilbrun (1956) have 
called attention to the fact that the Wechsler 
Performance subtests are not truly “non- 
verbal,” nor were they originally designed to 
be. To the extent that language and symbolic 
processes contribute to adequacy of perform- 
ance on these subtests, patients with verbal- 
symbolic deficits might perform poorly 
whether or not they are displaying visual- 
constructive deficits. There are many indica- 
tions that the Performance scale is comprised 
of a more heterogeneous group of subtests than 
the Verbal scale. For example, the average 
correlation of subtests of the Performance 
scale with the total Performance score is 
lower than the comparable relationship on the 
Verbal scale (Wechsler, 1958). Further, Per- 
formance subtests do not have as high load- 
ings on the nonverbal factor as do Verbal sub- 
tests on the verbal factor (Berger, Bernstein, 
Klein, Cohen, & Lucas, 1964). In short, 
compared to the Verbal scale, the Performance 
scale is not as good a criterion measure for 
testing the hypothesis of differential hemi- 

spheric control of psychological functions. 

A more straightforward approach to the 
analysis of the differential effect of left- versus 
right-hemisphere pathology on verbal and 
what Guertin et al. (1966) term “perceptual 
organization” abilities would be to examine 
those WAIS subtests which best reflect these 
abilities. The results of a number of factor- 
analytic studies of the Wechsler scales have 
consistently shown that the Vocabulary and 
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Block Design subtests have the highest load- 
ings (respectively) on the verbal and percep- 
tual organization factors (Berger et al., 1964). 
Further, these subtests have the highest cor- 
relations with the Verbal and Performance 
IQs and are generally the most frequently 
chosen subtests when the full WAIS is not ad- 
ministered. For these reasons, the Vocabulary 
and Block Design subtests were used in the 
present studies. The general hypothesis to be 
investigated was that left-hemisphere damage 
leads to impaired language performance (Vo- 
cabulary score) and right-hemisphere damage 
results in impaired visuoconstructive per- 
formance (Block Design score). 

An important methodological variable 
which has been emphasized by a number of 
investigators is that of the comparison groups. 
Hospitalized, non-brain-damaged patients fre- 
quently manifest a lower PIQ than VIQ 
(Guertin et al, 1966; Parsons & Kemp, 
1960). Accordingly, inferences about the effect 
of damage to the right hemisphere on Per- 
formance subtests must take these findings 
into consideration. 

Still another aspect of the sampling prob- 
lem is that of equivalence of the severity and 
acuteness of the unilateral lesion groups. Arri- 
goni and De Renzi (1964) note that right- 
hemisphere cases are usually more severely 
brain-damaged than left-hemisphere cases be- 
cause the symptoms of the latter lead to 
earlier identification. This suggests that se- 
verity of disorder is an important variable to 
be considered before inferences of differen- 
tial impairment can be made, Similarly, 
acuteness or recency of brain disorder may 
lead to greater differential psychological im- 
pairment than long-standing or chronic condi- 
tions (Reitan, 1962). 

With the above considerations in mind, 

two populations of brain-damaged patients 
from different experiments were separated 
into subgroups with lateralized damage in 
order to test the following specific hypothe- 
ses: 
1. Within groups: patients with left-hemi- 
sphere damage will perform more poorly on 
the Vocabulary than on the Block Design 
subtests and the tight-hemisphere-damaged 
group will manifest the opposite pattern. 
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2. Between groups: the patients with left- 
hemisphere damage will perform more poorly 
on Vocabulary than the right-hemisphere-dam- 
aged group, and the latter will perform at a 
lower level on the Block Design than the 
left-hemisphere-damaged group. 

In each experiment, for comparative pur- 
poses, a group of bilateral and diffuse brain- 
damaged Ss and a group of non-brain-dam- 
aged, but hospitalized, patients were included. 


METHOD 


Data were collected in two experiments. The first 
experiment served as an initial test of the hypothesis; 
the second and more extensive study provided a 
cross-validation. 


Subjects 


Experiment I. Forty-four male veteran Ss with 
definite diagnoses of brain damage were rated by a 
neurologist experienced in using research rating 
scales, and placed into three groups: 20 Ss with 
predominately left-hemisphere damage (LHD), 16 Ss 
with predominately right-hemisphere damage (RHD), 
and 8 Ss with bilateral or diffuse damage (BHD). 
Forty-five male Ss without medical evidence of brain 
damage (NBD) served as a comparison group. These 
Ss came from medical, surgical, and psychosomatic 
wards, There were no significant differences among 
any of the groups on age (overall M=45.5 years) 
and education (overall grade M = 10.2). LHD and 
RHD groups did not differ in the percentage of Ss 
falling into four main classifications of neurological 
disease. The Ss were distributed as follows: Trauma, 
23%; CVA, 42%; Tumors, 25%; and Degenerative 
disorders, 10%, Further, there was no significant dif- 
ference between the locations of the lesions within 
the hemisphere for the two groups: in the LHD 
group, the number of times the frontal, temporal, 
Parietal, and occipital lobes were involved was 17, 9, 
10, and 1, respectively. Comparable data for the 
RED group were 10, 5, 10, and 0. All Ss were right- 
handed. 

Experiment II. Ninety-eight Ss with unequivocal 
evidence of brain damage were rated by neurologists 
one to the same criteria as used in Experiment 
$ Twenty-five Ss were placed in the RHD group; 
24 Ss in the LHD group, and 49 Ss in the BHD 
group. The NBD group consisted of 50 Ss drawn 
from medical, surgical, and neurological wards. The 
8toups were comparable on age (overall M = 45.9) 
and education (overall M = 9.53), with the exception 
ct the LHD group whose mean age (52.0 years) was 
Significantly higher than the RHD, BHD, and NBD 
eure. However, since age-corrected scaled scores 
ae used in the analysis, this difference was not 
=o critical. Again, as in Experiment I, LHD 
SE RED did not differ in percentage of Ss falling 
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into the four neurological disease categories. The Ss 
were distributed as follows: Trauma, 14%; CVA, 
53%; Tumors, 22%; and Degenerative diseases, 11%. 
For the LHD group, the number of occurrences of 
frontal, temporal, parietal, and occipital lobe in- 
volvement was 15, 14, 13, and 3, respectively; for 
RHD, the comparable figures were 13, 12, 14, and 7. 
Approximately 25% of the Ss in each of the groups 
were female. The bulk of the Ss were right-handed; 
one S in the LHD, one S in RHD, and 8 Ss in the 
BHD were left-handed. 

As noted earlier, the S characterization of emo- 
tional disturbance and recency and severity of brain 
disorder may also lead to a differential deficit. If not 
taken into account, they may obscure the hypothe- 
sized relationships. The role of these variables is 
examined in the Results section. 


Tests 


It is important to note that Ss had to be suffi- 
ciently intact and communicative to undertake a 
rather extensive battery of tests over a period of 
several hours, 3 

In both experiments, Ss were given the Vocabu- 
lary and Block Design subtests of the WAIS. In 
Experiment I, these were the only subtests from the 
WAIS used, but they were administered in the con- 
text of a larger battery of psychological procedures 
in a concept-identification study. All Ss were tested 
by one E (J. B.). In Experiment II, the full WAIS 
was given by different but experienced Es. Age-cor- 
rected scores were used in all analyses and one-tailed 
t tests were used to test the hypotheses. 


RESULTS 

Within-Groups Analysis 

In Table 1 and Figure 1, the mean age-cor- 
rected Vocabulary and Block Design scaled 
scores are presented for the various groups. 
Within LHD and RHD groups, differences 
between the mean scores are in the predicted 
direction and significant: for LHD, Vocabu- 
lary is significantly lower than Block Design; 
for RHD, the opposite holds true. Within the 
BHD groups there were no significant differ- 
ences between Vocabulary and Block Design 
scores. In the control group of Experiment I, 
Vocabulary was significantly lower than 
Block Design, an unexpected finding that 
would raise a question concerning interpreta- 
tion of the LHD results were it not for Ex- 
periment II, in which the control group dis- 
played no difference between means on the two 
subtests. The latter finding is in accord with 
previously reported results for control Ss from 
this general population (Parsons, Morris, & 
Denny, 1963). Therefore, the Vocabulary- 
Block Design difference for controls in Experi- 


554 


O. A. Parsons, A. VEGA, JR., AND J. BURN 


TABLE 1 TABLE 2 
MEAN VOCABULARY AND Brock DESIGN AGE- t TESTS FOR BETWEEN-GROUPS DIFFERENCES 
CORRECTED SCALED SCORES AND ¢ TESTS FOR = 
WITHIN-GROUP DIFFERENCES Group Vocabulary Block Design 
x Experiment I 
Group x Block t LHD-RHD 1.82* (LHD) | 2.34* (RHD) 
Vocabulary |” Design LHD-BHD | 0.32 0.95 
RHD-BHD 1.10 0.80 
Experiment I LHD-Control | 2.49** (LHD) | 2.16* (LHD) 
LHD 6.80 8.45 2.24* RHD-Control | 0.07 5.19** (RHD) 
RHD 8.94 6.06 Pb iry BHD-Control | 1.25 2.61** (BHD) 
BHD 7.25 7.12 18 Experiment II 
Control 8.87 10.24 3.31** LHD-RHD 2.20* (LHD) | 1.90* (RHD) 
Experiment IT LHD-BHD 1.13 1.21 
LHD 5.80 7.62 Tise RHD-BHD 2.15* (BHD) | 0.83 
RHD 8.28 5.80 3.03** LHD-Control | 4.16** (LHD) | 2.92** (LHD) 
BHD 7.12 6.53 1.18 RHD-Control | 1.88* (RHD) | 5.49** (RHD) 
Control 9.72 9.86 0.11 BHD-Control | 3.97** (BHD) | 5.00** (BHD) 
*p <.05. Note.—Groups in parentheses have the lower mean score, 
*D <.01. (Means are reported in Table 1.) 


ment I is considered to be due to sampling 
error. 


Between-Groups Analysis 


Again, the results were similar for both 
experiments (Table 2). The predicted rela- 
tionships were found: LHD scored signifi- 
cantly lower than RHD on Vocabulary while 
RHD scored significantly lower than LHD on 
Block Design. Comparisons of LHD and 
RHD with the BHD group were nonsignifi- 
cant with one exception: in Experiment II, 
BHD scored significantly lower than the RHD 
group on Vocabulary. 

Comparisons of brain-damaged groups with 
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AGE-CORRECTED SCALED SCORES 


VOCABULARY BLOCK VOCABULARY 


DESIGN 


=p Sai 
controls gave rise to results consistent with 
expectancies. The RHD group in both experi- 
ments received significantly lower scores than 
controls on Block Design and on Vocabulary 
in Experiment II but not on Vocabulary of 
Experiment I (the latter probably due to the 
unusually low Vocabulary score of controls in 
Experiment I), The LHD groups were sig- 
nificantly lower than controls on both Vocab- 
ulary and Block Design. The BHD groups 
were lower on three of the four comparisons 
with the controls, the exception again being 
the Vocabulary scores in Experiment I. 


CONT. 1 
= === <@ CONT. 2 


DESIGN 


Fic. 1. Mean Vocabulary and Block Design age-corrected scaled scores for all 
groups in both experiments, 
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Correlational Analysis 


Another approach to the hypothesized dif- 
ference in hemispheric functions is possible 
through correlational analysis. If the hypothe- 
sis is valid, lower correlations between Vocab- 
ulary and Block Design would be expected in 
the unilateral lesion groups than in BHD and 
controls. In Table 3, Pearson product-moment 
correlations are presented for the two experi- 
ments separately and for the combined sam- 
ples, It can be seen that correlations for the 
controls and RHD groups are remarkably 
consistent in the two experiments while cor- 
relations for the LHD and BHD are more 
variable. Using the combined sample as the 
best estimate of the correlation, RHD is the 
only group in whom the correlation is not 
significant. Tests for differences among these 
correlations indicate no significant difference 
among them, although the difference between 
RHD and BHD groups approached signifi- 
cance (z = 1.80, p= .07). 


Additional Test of Hypothesis 


In order to provide an additional test of 
the hypothesis, two other subtests of the 
WAIS, Similarities and Object Assembly, 
which have high loading on the verbal and 
perceptual organization (visuoconstructive) 
factors, respectively, were selected. The mean 
Scores and correlations for these subtests for 
the Ss of Experiment II are presented in 
Table 4. One-tailed £ tests for differences 
between means within groups and be- 
tween groups were as follows: within groups 
between Similarities and Object Assembly, for 
LHD, t = 2.52, p< .02; for RHD, t = 1.93, 
$ < .05. Between groups: LHD versus RHD 
on Similarities, £ = 2.48, p < .02; on Object 
Assembly t= 1.31, p= .10. Thus the same 
Significant relationships were obtained as re- 
Ported above for Vocabulary and Block De- 
Sign, with the exception of the nonsignificant 
difference on Object Assembly. Correlational 
analysis (Table 4) indicates that the RHD 
group has a significantly lower correlation 
between Similarities and Object Assembly, as 
tested by the r to z transformation (two- 
tailed test), than each of the other groups. 
The LHD group differed significantly only 
from the RHD group, and the BHD. group 
differed significantly from RHD and controls. 
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TABLE 3 


CORRELATIONS BETWEEN VOCABULARY AND BLOCK 
DESIGN AGE-CoRRECTED SCALED SCORES 


Prenna eenen Combined 
Group 
N r N r N r 
LHD 20 |.491* | 24 | .272 44 | .369* 
RHD 16 | .203 25 | .209 41 | .235 
BHD 8 | .217 49 | .S81** | 57 | .549** 
Controls | 45 | .465** | 50 | .461** | 95 | .442** 
* 205. 
=b $o 


Other S Characteristics which May Have 
Influenced the Results 


It is important to examine the possible 
effects of emotional disturbance, severity of 
psychological impairment, and recency of 
brain damage on the hypothesized differential 
impairments resulting from left- versus right- 
hemisphere damage. In several previous 
studies from our laboratories, hospitalized, 
non-brain-damaged Ss sampled in the same 
manner as in the present studies have had 
MMPI profiles similar to the brain-damaged 
Ss (Parsons et al., 1963; Parsons & Stewart, 
1966). MMPIs were available only for the 
Ss of Experiment II. Since it is the RHD 
group in whom it is crucial to eliminate the 
possibility of the emotional disturbance per se 
giving rise to decrement on Block Design 
(Parsons et al., 1963), the MMPIs of this 
group were compared to those of the control 
group. The mean profile was similar for both 
groups: a heightened “neurotic triad” with 
secondary peaks on Pt and Sc. The mean T 
scores for the clinical scales (including Mf 


TABLE 4 


MEANS AND CORRELATIONS FOR SIMILARITIES AND 
OBJECT ASSEMBLY AGE-CoRRECTED 


SCALE SCORES 
E AN bject 
Group Similarities ee bly r 

LHD 5.33 7.04 SaR? 
RHD 8.09 5.91 =.235 
BHD 6.91 6.88 657** 
Controls 9.94 9.44 -310* 
oh Si: 
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and Si), a measure which correlates highly 
with degree of emotional disturbance (Parsons 
et al., 1963), were 58.8 for the RHD and 
60.7 for the controls (¢ = 1.00, ms). Thus the 
RHD and control populations appeared to 
have similar psychological disturbances both 
in degree and kind. For the LHD the mean T 
score was 62, slightly (but not significantly) 
higher than the other two groups. 

The two unilateral lesion groups were not 
significantly different as regards recency of 
brain disorder. Dividing the groups into those 
who tested within 6 months of onset and 
those who were tested after, there were 9 and 
13 Ss, respectively, for RHD and LHD in the 
more recent damage groups. The average 
period of time between onset and testing for 
all Ss in both groups was 18.3 months. The 
pattern of differential impairment between 
LHD and RHD within the recent and chronic 
groups was the same as reported above for 
the overall analysis. 

Finally, the important question of equiva- 
lence of severity of psychological impairment 
due to brain damage could be assessed in the 
LHD and RHD of this experiment. The 
Halstead Battery of neuropsychological tests 
had been administered to all Ss. Data have 
been reported which indicates that the Bat- 
tery reflects presence and degree of severity 
of brain damage (Reitan, 1962). In our lab- 
oratory, the mean T score for all tests in 
the Battery has proved to be a potent dis- 
criminator between brain-damaged and non- 
brain-damaged populations (Vega & Parsons, 
1967). The mean T score for the LHD was 
37.9; for the RHD 40.0, a nonsignificant 
difference (¢ = 1.14). 

It is concluded that the variables of emo- 
tional disturbance, severity, and recency of 
brain damage cannot account for the differen- 
tial pattern of impairment observed in the 
LHD and RHD. 


Discussion 


In two separate groups of patients with 
unilateral left- and right-hemisphere brain 
damage, differences in scores on the Vocabu- 
lary and Block Design subtests of the WAIS 
supported the hypothesis that left-hemisphere 
damage results in an impairment of language 
abilities and that right-hemisphere damage 
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results in a decrement in visuoconstructive 
ability. Further confirmation for the hypothe- 
sis was found in an additional analysis per- 
formed on the Similarities and Object Assem- 
bly scores of the Ss in Experiment II. 

The consistent relatively clear-cut findings 
are attributable in part to the use of age- 
corrected scaled scores for each S. The well- 
known relationship between age and Perform- 
ance subtest scoring is handled more effec- 
tively this way, in small samples of approxi- 
mately the same age, than by the more tradi- 
tional method of equating groups for mean 
age. Undoubtedly the use of the two WAIS 
subtests which best represent the hypothesized 
ability on the Performance scale, Block De- 
sign and Object Assembly, also contributed to 
the stability of the results. Support for this 
latter statement may be found in a compari- 
son of results cited above (Experiment II) 
with findings based on IQs derived from using 
all subtests on each scale. Compared to RHD, 
LHD had a lower VIQ (M = 89.1 and 75.8, 
t = 2.79, p < .01), but RHDs were not sig- 
nificantly different from LHDs on Perform- 
ance IQ (M = 78.7 and 83.3, t = 1.13, ns). 
Using the complete Performance scale attenu- 
ates differences in the RHD group and gives 
rise to nonsignificant results similar to those 
discussed by Benton (1962). 

The correlational analyses provided addi- 
tional evidence for differential hemispheric 
control of function. The correlations between 
tests representing a verbal factor and those 
representing the visuoconstructive factor in 
the RHD groups were consistently low and 
nonsignificant. In the LHD groups the cor- 
relations were significant and did not differ 
statistically from the controls or BHD. In 
accord with comments by Benton (1962) and 
Heilbrun (1956), it may well be that lan- 
guage factors contribute to a greater degree 
to performance on Performance scale sub- 
tests than the visuoconstructive abilities enter 
into the Verbal scale scoring. 

There are at least three studies in the litera- 
ture directly comparable to the present one in 
the problems to which they are addressed and 
in the use of Vocabulary or Similarities and 
Block Design or Object Assembly, The 
Present results confirm those of Costa and 
Vaughan (1962) who found differences on 
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Vocabulary (Mill Hill) and WAIS Block 
Design as a function of lesion lateralization. 
The results also confirm the findings of 
Heilbrun (1956) in that Similarities was sig- 
nificantly lower in his LHD compared to 
RHD group. Heilbrun did not find overall 
significant differences between his lateralized 
lesion groups on a spatial battery of tests 
but did find differences on the Block Design 
when the severe dysphasics were removed 
from the analysis. 

Finally, the findings provide partial con- 


_firmation of the results reported by Arrigoni 


and De Renzi (1964). These investigators, 
in a large-scale study of the effects of lateral- 
ized lesions, used the Verbal Scale of the 
Wechsler-Bellevue I and the Block Design 
and Object Assembly. They found the LHD 
group lower than RHD on the Verbal sub- 
tests, but the two groups were not significantly 
different on Block Design and Object Assem- 
bly. Arrigoni and De Renzi suggest that the 
positive results found by others could be due 
to the greater severity of damage in the RHD 
patients. Such a criticism does not hold for 
the present study in that the LHD and RHD 
groups of the major study (Experiment II) 
had the same level of impairment on the 
Halstead Battery. On the other hand, Arrigoni 
and De Renzi did not control for level of 
emotional disturbance in their Ss, a variable 
Which has been shown to affect Performance 
Scale subtests (Parsons & Kemp, 1960; 
Parsons et al., 1963). 

It is clear that research in this area, espe- 
cially that which attempts to demonstrate the 
Tole of the right hemisphere in visuoconstruc- 
tive deficits, is fraught with difficulties. How- 
ever, the present results, in which the vari- 
ables of age, education, emotional disturbance 
and localization, diagnosis, severity, and re- 
cency of brain damage have been controlled 
Or otherwise accounted for, give positive sup- 
Port to the growing body of evidence for the 
differential role of the hemispheres in psycho- 
logical processes. 
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RELATIONSHIP OF ACADEMIC UNDERACHIEVEMENT 
TO INCIDENTAL LEARNING 


MICHELE S. MONDANI anp THOMAS A. TUTKO1 
San Jose State College 


Forty-eight junior high school students were tested to determine whether or 
not there was any difference between achievers and underachievers in the 
learning of irrelevant material. All students were administered a test of social 
responsibility in a typical classroom situation. Subsequently, all were given a 
posttest on material printed on the responsibility test but irrelevant to it. 
Highly significant results on a ¢ test between mean scores on the posttest of 
the achievers and underachievers demonstrated that underachievers learned 
significantly more of the incidental material. The problem for underachieve- 
ment does not appear to be an incapacity to learn but a difficulty in focusing 


attention on the central material. 


The concept of underachievement has been 
one that has gained a great deal of popular- 
ity in educational research. Studies have at- 
tempted to explore this multifaceted concept 
from many viewpoints. Numerous aspects of 
underachievement have been investigated in- 
cluding exploration. of the underachiever who 
has superior ability (Miller, 1961), determin- 
ing childhood factors (Winterbottom, 1953), 
describing and understanding the dynamics 
(Gowan, 1957), exploring frustration and 
hostility and its relationship to underachieve- 
ment (Shaw & Black, 1960; Shaw & Grubb, 
1958), and counseling and treatment of the 
underachiever (Miller, 1961). 

Throughout these studies little attention 
has been given to the actual process of learn- 
ing itself. Moreover, there has been no inves- 
tigation of actual behaviors that an under- 
achiever goes through during the exact time 
that learning should be taking place. Such a 
study should yield valuable insights not only 
in the understanding of the process by which 
underachievement takes place but also the 
means by which the problem may be at least 
partially alleviated. This study is an attempt 
to approach the problem from this very angle, 
that is, what behaviors are manifested by the 
underachiever during the learning period? 

It is generally assumed that during the time 
in which learning is supposed to take place, 
the underachiever is learning nothing or at 

1 Requests for reprints should be sent to Thomas 


A. Tutko, Department of Psychology, San Jose State 
College, San Jose, California 95114. 


least very little. One thing at least is readily 
apparent—he does not learn that which is 
required of him. But in fact it is hard to con- 
ceive of an individual learning nothing in a 
classroom situation. Although the under- 
achiever may not be absorbing what the in- 
structor intends for him, some learning ought 
to be taking place. He must therefore be 
“tuned in” to material which is irrelevant or 
incidental to the major learning material. 

In an attempt to determine if this assump- 
tion is correct, the authors drew upon some 
literature from the area of experimental psy- 
chology called incidental learning which has 
been defined by Brown (1954) as “learning 
which occurs in the absence of (a) experi- 
mentally administered instructions to learn 
and (b) introspectively reported intention to 
learn [p. 163].” In essence, the major work 
done in this area indicates that individuals 
have a tendency to observe and learn ma- 
terials which are incidental to a main body of 
knowledge. Usually these studies have em- 
ployed research in which a main body of 
stimuli is presented to § and he must mem- 
orize it. In addition, other stimuli appear si- 
multaneously but are considered irrelevant to 
the main learning task. If S remembers any 
amount of this “irrelevant” material, inci- 
dental learning is said to have taken place. 

The major aim of this study was to de- 
termine if underachievers are learning the in- 
cidental information rather than the central 
information. 
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ACADEMIC UNDERACHIEVEMENT AND INCIDENTAL LEARNING 


METHOD 
Subjects 


The 48 Ss (24 female and 24 male) employed in 
this study were all ninth-grade junior high school 
students.2 All of the Ss were considered above 
average in ability and were placed in the most 
advanced two classes in the school. The school used 
a system of dividing the students into three main 
groups—X, Y, and Z. The X group was considered 
the high potential and performance group followed in 
rank order by the Y and Z groups. The students 
involved in this study were of the X and Y+ groups. 
It should be noted that while division of the stu- 
dents into groups is based on potential, performance 
and improvement are contributing factors as to 
whether or not the students remain in the original 
grouping. Hence, the Ss tested were those with the 
greatest potential and those whose performance 
proved adequate to remain in the more demanding 
class, 

Although the students were performing sufficiently 
well to remain among the top-ranked groups, there 
were quite a few individuals labeled “underachievers.” 
Thus, the students were divided into two groups— 
those who were performing below their predicted 
level and were subsequently considered underachiev- 
ers and those who were performing at least at the 
predicted level. The evaluation and division of Ss 
into the two groups was made by teacher judgment. 
The teacher had known the students for almost a 
full academic year and divided them into either 
achievers or underachievers with consideration given 
to SCAT scores, past performance, GPA, and the 
general degree of effort demonstrated by the stu- 
dent. The use of teacher evaluation is a valid one, 
and, indeed, oftentimes the most reliable means of 
detecting such cases as latent underachievement (Mil- 
ler, 1961). The Ss, then, were divided into two 
groups with 17 considered achieving below their 
ability (labeled underachievers) and 31 considered 
achieving at their predicted ability (labeled average 
achievers), 


Procedure 


The Ss were tested in a single group and every 
effort was made to maintain an atmosphere close to 
the average classroom situation. The Ss were told 
pes SY etage 


_* The authors gratefully acknowledge the coopera- 
tion and assistance of Peter Burnett Junior High 
School of the San Jose Unified School District. 

3 Cooperative School and College Ability Test 
(SCAT). This test measures the capacity of students 
™ grades 4-14 to undertake additional schooling. 
Four operational skills are measured: (a) getting 

e meaning of isolated sentences; (b) performing 
numerical computations rapidly; (c) associating 
Meanings of isolated words; (d) solving arithmetic 
Problems, The SCAT test is usually given in connec- 
tion with the Sequential Tests of Educational Prog- 
Tess (STEP) which indicates the student’s achieve- 
ment while the SCAT indicates his ability. 
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they were to be part of a testing program and that 
all other information concerning the test and the 
Purpose of it would be given at the conclusion of 
the session. They were given precise instructions on 
the use of IBM cards so that a normal emphasis 
was given to testing procedure as would be experi- 
enced with a task given by their usual instructor. 

The Ss were then administered the Personality 
Scale for Social Responsibility in which there were 
56 questions to be marked either true or false ac- 
cording to their personal attitudes (Gough, Mc- 
Closky, & Meehl, 1952). Emphasis was placed in 
the instructions on the fact that there were no 
“right” or “wrong” answers and that honesty in 
answering was most important. In this way, each S 
set his own level of motivation since he was aware 
that in no way would he be graded. 

This social responsibility test was only important 
insofar as it served as an orienting task; that is, it 
served to expose the Ss to the incidental stimuli. The 
incidental stimuli were as follows: Page 1—the date 
in the right-hand corner was May 1975; in the left- 
hand corner was typed D.C. 7; in the instructions 
the Ss were told to indicate a true response with a 
+; a line composed of question marks (????) sepa- 
rated the instructions from the body of the test; a 
star or asterisk (*) was placed beside Question 8. 
Page 2—the entire page was printed in green while 
the other pages were in purple ditto; there was a 
flower doodled in pencil as if it had been placed 
there by someone who had taken the test previ- 
ously; Question 25 had another asterisk beside it. 
Page 3—a second doodle of a stick figure was placed 
at the top of the page; a comma was placed at the 
end of the last sentence on the page; the initials 
MS.M. were placed at the bottom of the page. An 
effort was made to incorporate the incidental ma- 
terial in such a way so as to appear inconspicuous. 
For that reason, the doodles were drawn in pencil, 
the pages had been somewhat bent, and in general, 
the tests looked as if they had been previously used. 
Special and constant instructions were given not to 
communicate in any way with other students so as 
to minimize the possibility of any one student call- 
ing attention to the incidental material. The Ss were 
instructed to answer only the social responsibility 
test and to either look over the test or to turn it 
over when they had completed answering the ques- 
tions. Once again, an attempt was made to main- 
tain the typical classroom situation by permitting 
all students the same time with the material. All 
tests were collected at the same time and the Ss were 
again instructed not to talk. 

Immediately following the collection of the re- 
sponsibility test, a postexperimental test for the 
incidental material was administered. This postex- 
perimental test consisted of 15 questions concerning 
the incidental stimuli. The questions were of both 
the recognition and recall types so that students 
were not handicapped or aided by their particular 
ability in test taking. A typical recall-type question 
was “There were two drawings on your test sheet 
(done in pencil). What were they?” An example of 
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a recognition-type question is “There was a line 
drawn between the instructions and the first ques- 
tion. Was the line made of: stars *, crosses +, 
question marks ?, dots ., or dashes —?” Again, Ss 
worked independently and were told there was no 
penalty for not remembering the material. The 
number of correct items on the incidental material 
formed the basis of comparison between the two 
groups. 


RESULTS AND Discussion 


To determine whether there was a signifi- 
cant difference in the amount of incidental ma- 
terial retained by the achievers and the under- 
achievers, a £ test was run between the mean 
number of correct responses on the postex- 
perimental test. The mean number correct for 
the underachievers was 9.29, for the achievers 
5.77, The difference between the means 
was highly significant (¢ = 3.16, df= 46, p 
< .01). It is readily apparent that the under- 
achievers retained a greater amount of inci- 
dental material than did those labeled achiev- 
ers, 

Tt would appear that there is a tendency of 
the underachiever to concentrate on those 
materials irrelevant to the learning task while 
the achiever concentrates on the material cen- 
tral to the learning task. In a sense the under- 
achiever does well but only on the informa- 
tion which has little to do with the course con- 
tent. 

These results would be in line with the 
work being done with underachievers by Hew- 
ett (1965). In his hierarchy of tasks positively 
reinforcing the student for attention on the 
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course content is the basic step in having the 
student work toward greater achievement. Al- 
though the present study did not have actual 
classroom material as a learning task, it may 
be assumed to some degree that the findings in 
this study may be generalized to specific 
classroom learning material. 
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SELF-CARE UNIT: 
SOME LESSONS IN INSTITUTIONAL POWER 


ALAN P. TOWBIN? 
West Haven Veterans Administration Hospital, Connecticut 


The hypothesis that the chronic psychiatric patient purposefully pursues a goal 
(continued hospitalization) that has harmful effects on him as a potential par- 
ticipant in the extrahospital community was the initial basis of a rehabilitation 
program. Initial failure led to a revision, which resulted in the return to the 
community of all 15 members, A framework for explaining the failure of the 
first and success of the second phase is presented: the staff’s use of their power 
in defining the hospitalization situation. Exercise of power to redefine the 
hospitalization situation is probably a prerequisite to high discharge rates for 
chronic patients; redefinition of the conditions of admission is suggested as a 
basis for precluding the accretion of a chronic population. 


In the recent past, traditional concepts of 
the chronic mental hospital inmate have 
undergone rapid change. The man once seen 
as the mentally debilitated victim of an ir- 
reversible disease process came to be regarded 
as the unwitting victim of institutional de- 
humanization (Goffman, 1961). More re- 
cently, the assumption that he is a purposeful 
participant in a social system, with goals 
that are deviant in relation to the system 
(Braginsky & Braginsky, 1967; Rakusin & 
Fierman, 1963; Simon, 1965; Towbin, 1966) 
has proven useful in the treatment of the 
chronic psychotic. These explanations may be 
integrated in the hypothesis that by his pur- 
poseful participation in the institutional sys- 
tem, the inmate may be harmed in the sense 
that dependency is fostered, ties to potential 
sources of support in the community are 
weakened, skills become obsolete, and he 
acquires a stigma (Scheff, 1966). This hy- 
pothesis informed the rehabilitation program 
reported here. In its initial form the program 
failed. A revision was instituted which proved 
very successful. In retrospect, the success of 
the revision indicates that modification of 
traditional assumptions about patients is not, 
in itself, an adequate basis for such a pro- 
gtam. Traditional assumptions about the role 
of the staff, the conditions they create for 
the admission and discharge of inmates, and 


1 Also affiliated with Vale University. Requests for 
Teprints should be sent to Alan P. Towbin, who 
is now at the Learning Center, Farmington, Con- 
necticut 06032. 


especially their use of their power need to be 
scrutinized and modified. The two programs 
are described, and a theoretical analysis in 
terms of the staff’s role in operating the sys- 
tem is used to explain the failure and success 
of the programs, 


INITIAL PHASE 


The staff of the chronic psychiatric wards 
of the hospital? had long felt that some 
modified living situation was needed to help 
patients make the move back into the 
community. 

It was decided to set up a unit, explicitly 
identified as an experiment, to be evaluated 
at the end of 1 year. Laffal* had established 
a self-care unit at this setting, but extraneous 
factors forced its closing before it could be 
assessed, The plan for the Phase 1 program 
was based on his prospectus. The rationale 
for the unit as set forth at that time was that 


2West Haven Veterans Administration Hospital, 
West Haven, Connecticut. The author is grateful 
to L. B. Fierman, Chief of Psychiatry, and J. 
Levine, Chief of Psychology, for their support of this 
project. Many psychology interns contributed to the 
project as members of the staff group. They include: 
M. Hersen, D. Travland, R. Hartsough, Suzanne 
Woolsey, Michaelene Plotnik, Arlene Brandon 
Marcus, K. Toney, D. Seelig, H. Kahn, and Fredda 
Kelly. Nursing staff members Gladys Punch, Edwina 
Diaz, and Sara Graham were involved throughout the 
two phases of the project. Grateful acknowledgment 
is hereby made to all of them. The original impetus 
for this project came from R. Almond. 

3J. Laffal. Prospectus for a self-care unit. Unpub- 
lished manuscript, 1964. 
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for certain patients, not necessarily all psychi- 
atric patients, an interim setting of maximal 
independence and minimal ties with the hos- 
pital would be of therapeutic value in assisting 
the patient to adjust ultimately to complete 
independence from the hospital. These are 
patients who function adaptively in the insti- 
tution, and do not seem to require complete 
hospital care but seem to be afraid to leave. 
Such patients seemed to have developed an 
adjustment to institutional living which is a 
handicap to independent living. We at- 
attempted to provide them with a setting in 
which they would experience intense social 
support through a high degree of peer-group 
interaction and as much autonomy as can 
be provided in a hospital. The plan for the 
unit was accepted by the chiefs of the ser- 
vices involved and the hospital administration. 
A physically distinct section of a chronic 
ward, a corridor of single and double rooms 
of 16 beds, was set aside for the unit. 

In a prospectus shown to all applicants, a 
time limit of 6 months to 1 year was set 
forth; during the second 6 months, the 
progress of each member was to be reviewed 
the prospectus said, and all would be expected 
to leave the unit by the end of the year. Those 
unwilling or unable to leave the hospital by 
that time, could seek a transfer to another 
ward or institution, or reapply to the unit. 
Group therapy was offered to the unit (and 
never accepted) and individual therapy was 
made available where staff time permitted. 

The following criteria were used for accept- 
ance of members: that they be engaged in 
some regular activity in or out of the hos- 
pital—educational, prevocational, or voca- 
tional. Applicants who were not on intermin- 
able drug therapies were preferred. All appli- 
cants being treated with drugs picked up their 
drugs weekly at the pharmacy and took the 
medication as prescribed, as they would if 

, living in the community. Applicants who were 
legally or Veterans Administration incompe- 
tent were rejected. They were advised on 
means of eliminating this bar to their accept- 
ance. It was recognized that many applicants 
were attracted to the unit by the privileges 
of single or double rooms, freedom from staff 
intrusion, and extended sign-out, and could 
not muster a convincing statement that they 
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wanted to or hoped to leave the hospital. 
However, patients who gave this impression 
were not rejected if they were acceptable 
according to the explicit standards. 

Members were told to organize themselves 
into a self-governing body with elected of- 
ficers. This body was held responsible for all 
administrative functioning of the unit. After 
the first 10 or 12 members were selected by 
the staff, the staff passed on the eligibility 
of an applicant but the unit voted on whether 
or not to accept him. The unit was respon- 
sible for all decisions concerning discharge 
from the unit, disciplinary procedures for 
members, room assignments, reporting head 
count daily, as well as decisions concerning 
passes and leaves of absence. All members had 
the privilege of signing out ad lib between 
eight o’clock in the morning and midnight 
during the week, and for the entire weekend. 
Thus the unit had its own “turf” which re- 
quired no contact with or supervision by 
nursing staff, although members could and 
did seek such contact. 

After about 2 months, things seemed to be 
going well. There had been 17 patients on this 
unit. Three patients had been discharged from 
the unit since it opened—two to take jobs 
in distant communities, one to continue 
psychotherapy as an outpatient, Eight out 
of the 14 patients currently on the unit were 
working, in or out of the hospital. A group 
cohesiveness developed: patients wished one 
of their number to be called if a problem 
arose between nursing staff and a self-care 
patient. Since they took that stand, there had 
been no management problems with self-care 
patients. 


SECOND PHASE 


By the end of the year, however, a gloomier 
picture emerged. The laissez-faire policy had 
produced little result in terms of the hypothe- 
sis. The ward staff (nurses, nursing assistants, 
psychology trainees, and ward chief) spent 
several months reviewing the results and de- 
veloping a plan to remedy the faults identified 
in the old one. It was felt that the patients 
had used the situation to continue their insti- 
tutionally based lives with no action directed 
toward leaving. On the 4th of May, 1967, 
the second phase of the self-care unit (SCU) 


Lessons IN INSTITUTIONAL POWER 


was initiated. The following memorandum was 
sent to the chiefs of the psychiatry and the 
psychology services. Copies were presented 
to the members of the SCU as well. 


The self-care unit was officially opened on Febru- 
ary 11, 1966. A total of 22 patients had been on 
the unit in the following year. The outcome of this 
year is as follows: six were transferred to other 
wards, for the most part, back to G6W. Five of these 
22 are on the unit now (and have been for six 
months or more) and are still employed. Five others 
are employed or in a full-time rehabilitation pro- 
gram. The remaining patients were discharged MHB; 
of these six, one is deceased, one has been readmitted 
following a period of employment: the remaining 
four are out of the hospital and employed full time. 
The “success” rate for this year is less than 20%. The 
policy of “laissez-faire” of setting no specific goals 
or standards, requiring no planning by SCU members 
for leaving the hospital, has not produced much in 
results. There has been some progress, especially in 
the increased responsibility of the members for each 
other, and the development of a more effective 
group spirit. 

The members of the SCU have recognized that 
many of the patients on the unit have made no plans 
and are, in effect, idle. Their efforts to set norms 
for their associates has not been effective—perhaps 
because the staff has not participated enough. The 
staff of the ward has, therefore, decided to change 
the policy of the SCU as follows: 


1. It is the aim of the SCU program to return 
patients to community living as quickly as pos- 
sible. It is clear that a condition for com- 
munity living for SCU members is a full-time 
job. We want to implement this policy as 
follows: 

2. The SCU group is assigned the task of helping 
its members develop a plan for leaving the 
hospital. 

3. This plan must have as its aim, obtaining a 
full-time job outside the hospital. 

4. By two weeks from Monday each member 
will be expected to have come up with a 
detailed plan for discharge that is acceptable 
to the staff. No plan will be presented to the 
staff that has not been presented to the SCU 
group, and approved by the group. Any plan 
must include activity logically related to a dis- 
charge plan, and must be detailed with respect 
to the time the patient expects it to take. 

5. The SCU is expected to make a weekly report 
on each member’s progress—the development 
of his plans, and the steps he is taking toward 
realizing the plan. 

6. Each member will have not more than 6 
months to realize his plan, and leave the 
hospital. 

7. Decisions about passes, leaves, etc, will be 
made by the SCU group, which should consider 
how these passes, leaves, etc, will contribute 
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to the realization of the member’s discharge 
plan. 

8. It is up to the group to make the necessary 
arrangements for implementing this policy— 
including increasing the frequency of meetings, 
if necessary. 

9. The SCU as a group will be held responsible 
for carrying out this policy. Privileges of the 
group may be withheld if the group fails to 
carry out its responsibility to help each other 
prepare for a return to the community. 

10. The ward staff will not meet with the SCU 
group, but invites the SCU to send a repre- 
sentative to a staff meeting, at which the 
progress report of the SCU will be reviewed. 
The time of this meeting will be announced. 
The ward staff will provide in writing an 
evaluation of the work of the SCU on its task 
of helping its members arrange to leave the 
hospital. 

11. The SCU may, as a group, make recommenda- 
tions to the staff, with respect to transfer, 
discharge, or other changes in the status of the 
members of the unit. 


Many of the organizational features of this 
plan were taken from Fairweather’s (1964) 
description of the “step-program” carried out 
under his direction. In the weeks that fol- 
lowed, there was a high rate of interaction 
between the unit and the staff. The require- 
ments of the task were spelled out in more - 
detail, and the unit was confronted with the 
assumptions which seemed implicit in the way 
they did not carry out the task. For example, 
the unit would present a member’s medical 
complaints as grounds for that member’s 
inactivity. Staff pointed out that experts 
(physicians) were required for determination 
of physical disability; if the unit needed 
expert opinions, staff would help to obtain 
them. This approaach proved particularly ef- 
fective in reducing the frequency of claims of 
physical disability. Similarly, it seemed im- 
plicit in the unit’s messages to staff that mem- 
bers who claimed they had permanent and 
total disability for which hospital treatment 
could do nothing, expected to be boarded at 
this institution indefinitely. The unit was in- 
formed that the staff would help anyone who 
claimed that position find another place to 
live. 

After about 1 month it seemed to the staff 
that members had united in opposition to the 
requirements of the program, and were not 
helping each other make plans to leave. They 
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were working together to help each other stay. 
The staff decided to lift the unique privilege 
of this group—that of staying out until mid- 
night on their own cognizance. This was a 
blow to the prestige of the unit. It took about 
2 months longer for the group to reach a 
degree of effectiveness in carrying out the task 
sufficient to move the staff to restore this 
privilege. The staff’s weekly evaluation of the 
unit’s discharge plans and progress reports 
maintained pressure on the unit to take action. 

The unit seemed to respond by taking up 
each nonconforming member in turn and ap- 
plying pressure, as a group, to get him going 
on a plan, Each member showed transitory 
“disturbance” but eventually complied. Fif- 
teen patients were members of the unit at the 
end of June 1967. By November 3, 1967, 6 
months from the inception of the second 
phase, 13 had been discharged, maximum hos- 
pital benefit. A month later, the fourteenth 
man left (23 days over his 6-month limit) 
and 2 months later, the fifteenth was dis- 
charged (within the 6-month limit for him). 

Three of those who departed were “drop- 
outs” who left the hospital with little or no 
planning. The 12 graduates of the program 
include 8 with full-time jobs and a place to 
live. A ninth member, having reached the 
appropriate age, elected to retire. In keeping 
with his approved plan, he went from the 
hospital to the railroad station, where he 
boarded a train for Miami, Member 10 de- 
cided to live with his wife, remain a psycho- 
logical invalid, and do the kind of volunteer 
work in the community that he had done in 
the hospital for many years. The last two men 
to leave had a place to live and were in full- 
time vocational rehabilitation programs, 

A brief characterization of the SCU group 
follows. Of the 15, 10 had been in the unsuc- 
cessful Phase 1 program as well. Ages ranged 
from 28 to 60, with a mean 45 years, They 
had been hospitalized for between 3 and 77 
months, with an average stay of 39 months. 
The range of Phase 2 stay was 63 to 206 
days; the average, 121 days, Diagnoses in- 
clude seven schizophrenic or other psychotic 
reactions, four depressive reactions, and four 
personality or psychophysiological disorders; 
four of the members had been diagnosed as 
having chronic brain syndromes as well. The 
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group was occupationally diverse: five un- 
skilled laborers, one salesman, seven semi- 
skilled or skilled workers, one musician, and 
a nurse. Similarly their years of education 
ranged from 4 to 14, with an average of 10 
years. Marriages, prior or current, are more 
frequent than is found in unselected popula- 
tions of chronic psychiatric patients; seven 
were married, five separated or divorced, and 
three never married. While all 15 were dis- 
charged, three left without having an ap- 
proved plan for their financial and living ar- 
rangements, Of the 12 graduates, seven sought 
outpatient follow-up treatment, and of these, 
five were discharged on psychotropic medica- 
tion. 

At this writing (6 months’ follow-up)* only 
one of the members is known to have applied 
to this facility for readmission. He was ad- 
mitted, but returned to full-time employment 
after 1 month. But most striking to the staff 
was the fact that although the members had 
the option of making other arrangements for 
institutional living, they set a date for leav- 
ing in a planned way, and left. To members 
of the hospital staff who knew them, these 
patients appeared entrenched and intractable 
in their attachment to institutional living. 
And all had practical or realistic reasons for 
this attachment, in the sense that Simon 
(1965) has brought out; maybe they are 
crazy but they are not dumb. 

Mr. A. was such a patient. He bitterly re- 
sented his wife, a well-educated woman, who 
seemed to delight in having him jailed for 
fracases that many lower-class families seem 
to tolerate without calling the police, He is 
almost illiterate, and claimed impotence in the 
face of all the forces arrayed against him by 
people who wanted to take advantage of him 
or who regarded him as a discredited person. 
He claimed he could not arrange a divorce; if 
he left the hospital and worked, his wife’s 


*Using the procedure described by Fairweather, 
Simon, Gebard, Weingarten, Holland, Sanders, Stone, 
and Reahl (1960), 9 of the 12 “graduates” have 
been contacted for a 6-month follow-up. One of these 
had changed from vocational training to full-time 
employment status. Two, including the one member 
rehospitalized for a month, are described as getting 
along poorly, the rest satisfactorily. In other respects, 
the condition of the members appears about the same 
as at discharge, 
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creditors would gobble up his earnings. He 
had presented for admission with “paranoid” 
complaints about his fellow workers. Never- 
theless he got his old job back, and left the 
hospital. He has since been divorced. 

Mrs. B. is typical of the patients who use 
physical symptoms to disqualify themselves 
for independent living. She had been hospital- 
ized for 5 years prior to the inception of the 
second phase of the SCU. Her prior psychi- 
atric hospitalizations total an additional 8 
months. While a patient on the locked service, 
she was grossly delusional and intermittently 
assaultive. Her entire hospital course was 
characterized by severe and persistent hypo- 
chondriasis, whiney complaining, tearfulness 
and dark hints that she knew the staff was 
keeping a diagnosis of cancer from her. She 
claimed that she could never work as a nurse 
again since her psychiatric hospitalization dis- 
qualified her, and her physical symptoms 
made regular work impossible. 

Her position is of particular interest since 
she seems, in retrospect, to have maintained 
it by playing one staff member off against 
another, over the years. As a trained nurse, 
she elicited reports from the medical and 
surgical specialists summoned by her psychi- 
atric resident for consultation on her recur- 
rent complaints of back pain and constipation. 
She would then use the information obtained 
to try to discredit the ward nurse to the resi- 
dent, and play on his fears that he might be 
overlooking something or neglecting her. The 
outcome of her interaction with the resident 
was almost invariably another round of con- 
sultation requests and revision of her medical 
tegimen. When she attempted to go this route 
with the unit, the staff worked out a plan 
which was designed to prevent their being 
“split” by the patient (Stanton & Schwartz, 
1954). Consultants were asked not to discuss 
their findings with her. After the neurosurgi- 
cal consultant examined her back, the psychi- 
atric resident told her authoritatively that 
there were no positive findings and that there 
was no objective basis for regarding her as 
physically disabled. This had the remarkable 
effect of appearing to relieve the patient. She 
dropped her physical complaints, perked up, 
found an apartment and a job as a nurse 
within a few weeks, and cheerfully departed. 
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She refused further outpatient psychotherapy 
on the grounds that she was feeling on top 
of things, though she continues to take a 
variety of medications. 


Discussion 


The patient group was heterogeneous with 
respect to factors known to be related to 
length of hospitalization, successful rehabili- 
tation, and low readmission rate. Yet, in terms 
of voluntary departure to a supportive social 
setting under tenable economic conditions all 
of the graduates must be considered successes. 
Long-term follow-up may reveal differences 
in adjustment which can be correlated with 
these prognostic indicators, but at this point 
the correlation is zero because the outcome 
has such low variability. The same can be 
said for “treatment” effects. Seven of the 12 
graduates were treated with individual or 
family therapy during hospitalization, though 
two of these dropped out of treatment within 
2 months of discharge. This highly regarded 
treatment is apparently unrelated to outcome 
in this sample, It seems that there must have 
been some factor (or factors) operating in this 
situation so powerful in its effect as to swamp 
the variability in the sample and produce a 
highly uniform outcome. The retrospective 
character of this analysis places severe limi- 
tations on the conclusions which can be 
drawn from it about the nature of the causal 
factors. With the understanding that the con- 
clusions about what factors contributed to 
the success of the second phase are tentative 
and require more controlled study, they shall 
be set forth here. 

The failure of the first and the success of 
the second phase of the SCU can be explained 
by the same set of factors if psychiatric hos- 
pitalization can be considered as a system 
with input, maintenance, and output phases. 
This system bears a relation to the rest of 
society comparable to that which the penalty 
box bears to the game of ice hockey. A player 
is removed from the game if he violates cer- 
tain rules; he goes to the penalty box for a 
fixed period, leaving his team short one man. 
The referee makes an authoritative decision, 
based on his position and expertise, that a 
player’s actions disqualify him. The penalty is 
of limited duration, but while he is out of the 
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game the player cannot influence it or the 
duration of his exclusion. He returns to the 
game automatically at the end of the penalty; 
his return to participant status calls for no 
authoritative intervention. Now, the citizen is 
transformed into an inmate by the authorita- 
tive opinion of an expert (psychiatrist), who 
decides that the citizen has a condition that 
indicates a need for hospitalization. The de- 
cision is based on expert assessment of the 
citizen’s mental state, not on observable ac- 
tions alone. Once the citizen is declared in 
need of hospitalization, he is assigned the 
status of “inpatient” and disqualified from 
participating in the game, so to speak. That is, 
he is disqualified from managing his own af- 
fairs. He is unable to influence the duration of 
his stay directly; efforts to shorten or prolong 
his stay are taken as mere manifestations of 
his condition, not requests to be granted or 
denied. Thus the patient is not only disquali- 
fied, he is discredited. Unlike the penalized 
skater, the former citizen can leave the pa- 
tient status only if there is a decision comple- 
mentary to the one that placed him in it. The 
expert, now responsible for the treatment of 
the patient’s condition, must opine that the 
condition is sufficiently ameliorated that hos- 
pitalization is no longer needed. 

This analysis leaves the status of the SCU 
member clouded. As was written in the pros- 
pectus, these were patients who no longer 
needed complete hospital care, but who seemed 
afraid to leave. What was not written, but 
believed, was that if we insisted on discharg- 
ing them, they would do as they had done in 
the past: behave in ways that would make us 
regret our decision. The members of the unit 
were told they no longer needed hospitaliza- 
tion, The staff would appear to have effected 
the transformation of inmate back to citizen. 
But what we took away with the right hand, 
we gave back with the left. We ordered no 
discharge, urged no one to leave, kept no 
check of members’ plans, and capped it all by 
the paternal gesture of providing a staff ad- 
visor to the unit. The staff had created an 
“experimental” situation, and we waited to 
see what effect it would have on what was 
defined as the crucial factor—the members’ 
motivation to leave. As Towbin (1966) has 
noted, the clinical investigator is attracted to 
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the idea that he is a mere observer. But the 
members knew better. They knew that the 
staff has the power to order discharge for 
those who no longer need hospitalization, and 
the responsibility to use that power. Since the 
staff ordered no discharges, and was not trying 
to get anyone to leave, they were tacitly en- 
dorsing the members’ stay. Motivation to re- 
main in the hospital had been added to the 
staff’s list of indications of continued need 
for hospitalization. We tacitly affirmed the 
existence of something we had explicitly de- 
nied—the members’ need to be inpatients. 

When we reviewed the failure of Phase 1, 
it was with some annoyance at the members 
for having taken advantage of us, and “holed- 
up” on the unit. Here we reacted much like 
the parents in Johnson’s (1949) study, tacitly 
encouraging behavior about which we ex- 
pressed dismay when the behavior could no 
longer be overlooked. Similar phenomena are 
described by Easson and Steinhilber (1961) 
and by Sargent (1962). 

Of course, the staff did not carry out its 
work in this duplicitous way self-consciously 
and reflectively. At the time we planned the 
second phase we did have some sense that we 
had “let the patients get away with some- 
thing” and decided to force the members to 
take a stand: return to the community or in- 
stitutional living elsewhere. I believe that 
staff anxiety lest members act “crazy” or 
antisocially if discharged unwillingly, was 
probably ameliorated by the provision of this 
alternative. The experimental attitude was 
abandoned; their motivation to stay was no 
longer relevant. Departure of members was 
our goal. If anyone held out for institutional 
living, he could get it somewhere else, and for 
no better reason than that he preferred it. 
Authoritative support for the members’ pres- 
ence in the hospital, in the form of an identi- 
fied indication of need for hospitalization, 
was withdrawn. There were no “escape 
clauses” in the second phase comparable to 
the vague “staff review” and provision that 
members may reapply if they haven’t left in 
a year, provisions with which the Phase 1 
prospectus abounded. 

In the context of the traditional role of the 
expert, it might be said that we were merely 
Providing an exit from inpatient status, by 
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declaring that members were not continuing 
to live in the hospital because they needed it. 
That is true, as far as it goes, but doesn’t fully 
characterize the action. Up to 6 months more 
of living in the hospital was permitted, but 
members could choose to transfer to another 
institution without having to elicit an expert 
opinion that they needed it! Their preference 
would not be an indication, but a request to 
be granted if possible. No unit member took 
this alternative. Nor was the 6 months of per- 
mitted stay put forth as an interval during 
which the staff would “treat” members for 
their condition, so that they could qualify to 
leave. It was, on the contrary, a period during 
which they were required to carry out the 
task of making the transition from institution 
to community dweller. 

The staff no longer related to the unit mem- 
bers in accord with traditional mental hospital 
practice. A voting membership of the staff 
group was established. This staff group ad- 
ministered the remainder of the ward by deal- 
ing with patients only via the task groups 
into which they had been organized. This 
format was then applied to the SCU. The unit 
members met with the staff as a group, and 
when the staff reviewed the unit’s work and 
Prepared an evaluation, a unit member was 
Present as a representative." 

A consultant who was not part of the ward 
staff was offered to the unit to help them with 
their organizational problems, and was ac- 
cepted by them. He never discussed his work 
with ward staff. The rate of interaction be- 
tween unit and staff increased markedly. Unit 
Members made more active use of the voca- 
tional rehabilitation services hitherto avail- 
able but largely ignored. The idea that mem- 
bers were “sick” was actively discouraged in 
unit-staff communication, by the staff’s substi- 
tution of “member” for “patient” in the mem- 
oranda. The members opposed this for a time, 


s This reorganization of the staff seemed to con- 
tribute to the stafi’s effectiveness. It provided op- 
portunity for discussion of the strains created by 
the abandonment of traditional modes of relating to 
inmates, as well as providing new roles as partici- 
pants in decision making about ward policy and 
action. This group activity seemed to enhance the 
Staff's cohesiveness and morale, which were threat- 
ened by the massive opposition of the SCU members. 
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always referring to themselves as “patient so- 
and-so” rather than mister or misses. 

The staff’s premise was that the secondary 
effects of hospitalization take time to over- 
come—loss of social contacts, drop in em- 
ployability, etc., and that, therefore, we were 
not ready to discharge members immediately. 
This was the rationale for the task given the 
group, and the time to carry it out. In keep- 
ing with this task orientation, we frequently 
pointed out indolence and opposition on the 
part of the group, but never challenged their 
competence to carry out the task of planning 
their own lives. The goals of the staff, their 
standards and criteria, and the sanctions 
available to them were out in the open. It was 
not necessary to appeal to the staff’s profes- 
sional judgment or clinical experience or even 
the ultimate beneficence of our motives, let 
alone our superior judgment or wisdom in 
justifying our use of sanctions. We did not 
attempt to resolve conflicts between staff and 
unit by discrediting the members, and the 
complementary self-discrediting mentally sick 
role behavior of the members was not elicited. 

The staff had shifted their role radically 
in the second phase of the SCU. Since this 
change is a departure from the traditional 
transformation and maintenance operations 
used as a framework for this analysis, it is 
easier to say what Phase 2 was not than to 
say what it was. Continued hospitalization 
had come to mean continued living in the 
hospital in Phase 2. It no longer expressed an 
expert opinion about the condition of a man’s 
mind, nor were members regarded or related 
to as if they were disqualified or discredited. 
If anything, the 6-month limit expressed an 
opinion about the inmates’ relation to the 
community, and an uneducated guess at the 
time necessary to carry out the task of chang- 
ing that relationship. The staff were not 
treaters, the members not patients. The staff’s 
definition of its responsibility changed from 
assessment of treatment needs and provision 
of a therapeutic milieu, to the assignment of 
a task and supervision of task performance. 
This included providing incentives for task- 
relevant behavior and deterrents to defiance. 
The status of SCU members was that of peo- 
ple who were temporarily in a living arrange- 
ment located in a hospital, while they did 
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some work under expert (?) supervision. Exit 
from this status was defined by a time limit 
set by staff and the effectiveness of the mem- 
bers’ performance. The benefits of hospital 
living were no longer available indefinitely, 
and had become untied from the type of self- 
discrediting behavior with which these people 
had hitherto affirmed their “need” for hos- 
pitalization. Indeed, those benefits under staff 
control were contingent on competence and 
effectiveness. The only basis on which the 
members could remain in the hospital was out- 
right defiance of the staff; for example, a kind 
of sit-down strike might have succeeded in 
blocking the staff. The staff, it seems fair to 
say, had changed the definition of the situa- 
tion: “hospitalization.” 

The highly selected nature of this group of 
chronic patients raises the question of the 
generality of these results. The writer has 
found that applying the same format to an 
unselected group of long-term psychiatric in- 
patients produced similar immediate results. 
That is, of 15 such patients 10 chose to leave 
the hospital from a program for which they 
had not volunteered, and did so in a planful, 
orderly way within a 6-month period. The 
other five chose transfer to other institutions. 
Unfortunately, no follow-up data has been 
obtained on these 10. Rothstein (1966) un- 
dertook a very similar program with larger 
numbers of chronic psychiatric inpatients, and 
reports on a 4-year follow-up. The program 
he describes also involves a staff-induced 
change in the definition of the hospitalization 
situation. Administration was individual, 
rather than via a patient-group as in the SCU. 
The conformity pressure exerted by the SCU 
group on its members, springing in part from 
the staff’s power to lift the privileges of the 
entire group, seems to have had its counter- 
part in the staff’s power to effect a “quick 
discharge” of any patient who refused to con- 
form to the program or who openly defied it. 
The threat of prompt discharge induced most 
of the patients to expose themselves to a pro- 
gram of progressive responsibility and reward, 
leading to a self-chosen form of extrahospital 
living. Emphasis was placed on the exit phase, 
but the rules around admission were also 
changed so that if patients sought readmis- 
sion, they would be required to reenter the 


same discharge-oriented program. He reports: 


Four years subsequent to discharge, 87 per cent of 
the patients were out of the hospital; the follow-up 
population as a whole averaged 83 per cent of time 
out of the hospital during the four-year post-dis- 
charge period; 50 per cent of these patients had spent 
no time in the hospital during this four-year period 
[p. 367]. 


In the framework of this analysis it seems 
that in both the Rothstein (1966) report and 
the program described here, the staff-produced 
redefinition of the situation of being hospital- 
ized was the overriding factor leading to the 
choice of departure from the hospital. This re- 
definition was manifested in changes in the 
relations between staff and inmates in the 
implications of being in the hospital for one’s 
qualifications and credibility, in the means 
available to influence the staff, as well as in 
the staff’s use of its expertise. The conditions 
for maintenance of—and exit from—the pa- 
tient role had been transformed. From this 
perspective, the chronic psychiatric patient 
may be defined as a person who, in pursuit of 
his own interests, has adapted to a situation 
as defined by the staff of the institution, The 
staff may be characterized as a group which 
usually behaves as if it were innocent of its 
power to define and redefine the situation. 

The staff responsible for the programs re- 

ported here was grossly innocent of the radi- 
cal changes it was imposing on a traditional 
mental hospital setting, and even more inno- 
cent of the power it wielded in doing so. One 
simply does not feel like a representative of 
society or the institution; one feels like one- 
self. We ignored the now obvious fact that 
power suffuses and colors all our actions in 
the mental hospital. An oriental sage said, 
“The fish does not know it lives in water, and 
does not care to have this pointed out.” The 
mental health worker seems not to know he 
works in a power field. Many years ago Stan- 
ton and Schwartz (1954) called attention to 
this phenomenon. 
And in spite of the political problems implicit in 
setting up a highly specialized type of government, 
the hospital attacked these problems handicapped by 
a tacit prohibition upon discussion of government, 
politics, power, influence, and sanctions [p. 244]. 


This innocence of power and the unwitting 
establishment of contingencies for access to 
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benefits is no less relevant to the input side of 
the hospital system than to the maintenance 
and output phases. The expert who makes the 
admissions decision is serving as gatekeeper to 
a very dense accumulation of societal re- 
sources. As Simon (1965) has noted, for those 
who are reluctant to leave the public mental 
hospital, these resources are nowhere else in 
as rich supply. Seen in terms of their use of 
power, the admitting staff sets standards of 
behavior (taken from psychiatric theory) as 
the condition of access to the resources ac- 
cumulated in psychiatric hospitals. That is, 
the condition of access is described by the 
staff’s concept of mental illness. At least for 
the applicants who become chronic inmates 
of mental hospitals, the power of the admission 
staff operates to elicit self-discrediting and 
disqualifying behavior (Braginsky & Brag- 
insky, 1967). From a somewhat different per- 
spective, Wood, Rakusin, and Morse (1965) 
make reference to this problem in their de- 
scription of the conflict of the admitting resi- 
dent. 


The residents were alert to the fact that patients 
learn quickly that they would be admitted if they 
threaten violence. The possibility that they might be 
outwitted by a patient constantly plagued the admit- 
ting residents [p. 58]. 


As long as we operate in an atmosphere of 
innocence of our power, we can be very effec- 
tive in innocently eliciting and fostering the 
behavior which confirms our theories and as- 
Sumptions, and our professional role as well. 
One alternative is a radical transformation of 
the admissions situation, a redefinition com- 
Parable to that described here for the mainte- 
nance and exit phases of the mental patient’s 
tole. For example, applicants for admission 
would immediately be offered membership in 
self-care units, or some comparable organiza- 
tion, with the criterion of “admission” being 
their preference for an extracommunity living 
arrangement. Obviously, such a change in ad- 
missions practice implies far-reaching changes 
in the relation between the psychiatric in- 
Patient facility and the community it serves, 
and does not lie within the power of the 
Mental health professionals to bring about 
unilaterally, 
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CONCLUSIONS 


The results of the programs reported here 
and the analytic framework used to explain 
their failure and success seem to be in keep- 
ing with a growing body of evidence and 
critical thought that the mental hospital as a 
system for the use of social power and the 
mental illness concept as the rationale for 
decision making in that system have serious 
harmful effects on the population they are 
meant to benefit (Ellsworth, Foster, Child- 
ers, Arthur, & Kroeker, 1968; Goffman, 1961; 
Kantor & Gelineau, 1965; Mendel, 1966). 
Many mental health professionals hope that 
the changes represented by the community 
mental health center approach will preclude 
or minimize the accretion of chronic inmates. 
This appears to be a forlorn hope as Kraft, 
Binner, and Dickey (1967) have shown in a 
5-year study of turnover in such a center. 
Experience indicates that changing the physi- 
cal setting, providing more “attention,” and 
even abandoning traditional assumptions about 
the chronic mental hospital inmate, is not 
enough to increase discharge rates signifi- 
cantly. The traditional definition of the hos- 
pitalization situation, as established and main- 
tained by the power of the staff, must be 
modified if the stasis of chronic populations is 
to change. The mental hospital is concep- 
tualized as an organization for the distribu- 
tion of societal resources; to the population 
served, psychiatric therapies are probably the 
least important benefits. Traditional admis- 
sion practice makes self-discrediting behavior 
the ticket for access to these resources. A 
basic redefinition of the hospitalization situa- 
tion would involve abandonment of self-dis- 
creditation as an access condition. Even if 
mental health professionals endorsed this 
change, it would undoubtedly be appreciated 
by the public as a fundamental change in the 
institution-community relationship, and is not 
likely to be accomplished without controversy. 
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SENSATION SEEKING, IMPULSIVITY, AND 
PSYCHOPATHIC PERSONALITY 


R. BLACKBURN ! 
i Broadmoor Hospital, Crowthorne, Berkshire, England 


The hypothesis that sensation seeking is associated with impulsivity and 
psychopathic personality traits was investigated in a sample of psychiatric 
offenders. The Sensation Seeking Scale (SSS) was found to correlate positively 
and significantly with MMPI measures of impulsivity, extrapunitiveness, and 
psychopathy, and with scales of overt and covert hostility. However, diagnosed 
psychopaths did not score significantly higher on SSS than nonpsychopathic 
offenders. Evidence is discussed which suggests that while sensation seeking is 


related to psychopathic personality characteristics, it may also be common 


among certain psychotics. With this qualification, the results support the 


hypothesis. 


Studies of exploratory behavior and the 
consequences of sensory deprivation in both 
animal and human Ss have indicated that 
_ living organisms are motivated by the ab- 
sence or reduction of certain kinds of stimulus 
"variability (Berlyne, 1960; Fiske & Maddi, 
1961; Jones, 1966). An objective measure of 
_ the need for stimulus variability, the Sensa- 
= tion Seeking Scale (SSS) was developed by 
Zuckerman, Kolin, Price, and Doob (1964), 
and has been shown to predict premature 
quitting from and restless bodily movements 
in sensory deprivation and confinement ex- 
periments (Zuckerman, Persky, Hopkins, 
Murtaugh, Basu, & Schilling, 1966), and 
volunteering for unusual psychological studies 
(Zuckerman, Schultz, & Hopkins, 1967). Also, 
Farley and Farley (1967) found a significant 
Positive correlation between SSS and the 
_ Extraversion scale of the Eysenck Personality 
Inventory, as predicted from the hypothesized 
telationship between extraversion and stimu- 
lus seeking (Eysenck, 1964). These studies 
80 some way toward establishing the con- 
Struct validity of the scale. 

Evidence of a positive relationship between 
SSS and field independence as measured by the 
Embedded Figures Test led the authors to 
Teject the possibility that the scale is a mea- 
sure of impulsive or psychopathic character- 
istics (Zuckerman et al., 1964). However, 
Subsequent findings cast doubt on this con- 
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clusion. Zuckerman et al. (1967) report a 
negative correlation between SSS and Gough’s 
Socialization scale, and a small but significant 
positive correlation with the MMPI Ma 
(Hypomania) scale, which has been suggested 
to be a measure of impulsivity (Kassebaum, 
Couch, & Slater, 1959). Moreover, the rela- 
tionship between extraversion and SSS would 
suggest that the latter is related to impulsiv- 
ity since this is a major primary trait con- 
tributing to the introversion-extraversion 
dimension (Eysenck & Eysenck, 1963). 

A need for excitement has long been con- 
sidered to be a major characteristic of 
psychopathic personalities (e.g., McCord & 
McCord, 1964), and Quay (1965) has em- 
ployed the concept of stimulus seeking to 
account for impulsive thrill seeking in such 
individuals. He proposes that much psycho- 
pathic behavior may represent a pathological 
need for sensory input, consequent on low 
physiological reactivity or rapid adaptation to 
stimulation, a proposition which has much 
in common with Eysenck’s theory of the rela- 
tionship between extraversion and antisocial 
behavior (Eysenck, 1964). Quay’s hypothesis 
implies a positive relationship between stimu- 
lus seeking and impulsivity, and also that 
psychopathic individuals should show higher 
levels of stimulus seeking than nonpsycho- 
paths. The present report concerns a test of 
these predictions. It was hypothesized that 
contrary to the earlier assertions of Zucker- 
man et al. (1964), SSS would correlate posi- 
tively with questionnaire measures of impul- 
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sivity and psychopathic attributes, and that 
psychopaths would score more highly than 
nonpsychopaths on SSS. 


METHOD 


The Ss were male patients admitted to a maximum 
security hospital for psychiatric offenders over a 
12-month period, were literate, able, and willing 
to complete psychological tests, and whose “cannot 
say” score on the MMPI did not exceed 50. The 
resulting sample consisted of 83 patients, about 70% 
of the total intake. Their mean age was 32.59 
(SD = 10.58). Mean WAIS Full Scale IQ of 60 Ss 
who completed this test on admission was 99.53 
(SD = 11.99). In terms of the 1959 Mental Health 
Act for England and Wales, 54 of the patients were 
classified as mentally ill (mostly schizophrenic) and 
29 as psychopathic disorder. In clinical terms, how- 
ever, only 21 of the latter group were subsequently 
diagnosed as psychopathic disorder or psychopathic 
personality. 

The following tests were completed by patients 
within a week of admission: (a) the male form of 
the Sensation Seeking Scale (Zuckerman et al., 
1964); (b) the Bendig Covert and Overt Hostility 
Scales (CH and OH) (Bendig, 1962); and (c) the 
MMPI, from which, as well as the regular scales, 
the following additional measures were scored: 
Welsh’s Anxiety (A) and Repression (R) (Welsh, 
1956); Gough’s Impulsivity (Jm) and Social Par- 
ticipation (Sp) (Dahlstrom & Welsh, 1960); Extra- 
version (Ex) (Giedt & Downing, 1961) ; the Hostility 
Battery (Foulds, Caine, & Creasy, 1960), which 
yields scores of General Hostility (GH) and Direc- 
tion of Hostility (DH), positive scores on the latter 
indicating intropunitiveness. The various hostility 
scales were included in the investigation to measure 
antisocial tendencies. With the exception of Welsh’s 
A, the remaining additional scales have all been 
found to have substantial loadings on the MMPI 
second factor, which has been identified as intro- 
version-extraversion (Corah, 1964; Kassebaum et al., 
1959). 


RESULTS 


The mean score of the sample on SSS was 
9.94 (SD = 4.44), which is considerably lower 
than that of American students (Zuckerman 
et al., 1964) or British apprentices (Farley 
& Farley, 1967). However, this can probably 
be accounted for by the higher mean age of 
the present sample, since SSS correlated 
significantly with this variable (7 = —.333, 
2 < .01). The correlation of SSS with IQ was 
positive, but insignificant (r = +.186). 

The relationship of SSS to personality and 
diagnostic measures is shown in Table 1. Only 
residualized coefficients are reported, age being 
partialed-out. 
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TABLE 1 


CORRELATIONS OF THE SENSATION SEEKING SCALE WITH 
MMPI anp HOSTILITY SCALES, 
AGE PARTIALED-OUT 


MMPI Partial r 
L (Lie) —.256* 
F (Validity) 302" 
K  (Defensiveness) —.238* 
Hs (Hypochondriasis) 051 
D (Depression) —.182 
Hy (Hysteria) —.041 
Pd (Psychopathic Deviate) .249* 
Mf (Masculinity-Femininity) 103 
Pa (Paranoia) .265* 
Pt (Psychasthenia) 135 
Se (Schizophrenia) .222* 
Ma (Hypomania) A67*** 
Si (Social Introversion) —.013 
A (Anxiety) 089 
R (Repression) —.359*** 
Ex (Extraversion .213 
Im (Impulsivity) .393*** 
Sp (Social Participation) —.134 
GH (General Hostility) .258* 
DH (Direction of Hostility) —,389*** 
CH (Covert Hostility) +291" 
OH (Overt Hostility) .283** 

Note, —N = 83, 
* p <05. 
"tp < 01 
ed < 001 


The highest correlations are those with 
scales relating to impulsivity, namely Ma, Im, 
and R (Blackburn, 1968a; Kassebaum et al., 
1959). Correlations with other scales relating 
to introversion-extraversion (Ex, Sp, Si) are 
in all cases not significant, suggesting that the 
correlation of SSS with extraversion (Farley 
& Farley, 1967) is carried largely by the vari- 
ance attributable to the impulsivity rather 
than the sociability component of the intro- 
version-extraversion dimension. Significant 
correlations are found with all the hostility 
measures, but more particularly with extra- 
Punitiveness (DH) and with the Pd scale. 
All these correlations are consistent with the 
hypothesis that SSS is related to both im- 
pulsivity and to psychopathic characteristics. 

Not predicted, but of some interest, are 
the correlations of SSS with Pa and Sc, and 
with the three validity scales (L, B, K): 
Together, these suggest a possible relationship 
between SSS and unconventional attitudes, 
and, perhaps, psychotic symptomatology. 


SENSATION SEEKING 


The hypothesis predicts that patients diag- 
nosed as psychopathic personalities will score 
more highly on SSS than nonpsychopaths. To 
test this, those patients diagnosed both legally 
and clinically as psychopaths were compared 
with 21 Ss drawn from the remainder of the 
sample, matched as closely as possible for age. 
(This matching was carried out “blind” by 
a colleague of the author, who did not know 
the identity or test scores of the Ss.) The dif- 
ference between the psychopathic and non- 
psychopathic offenders is in the predicted 
direction (Table 2) but does not reach an 
acceptable level of significance. 


DISCUSSION 


The correlations of SSS with the personality 
scales clearly indicate a fairly close associa- 
tion between sensation seeking and impulsiv- 
ity, as predicted from Quay’s hypothesis. The 
assertion of Zuckerman et al. (1964) that SSS 
does not measure impulsivity must therefore 
be rejected. Further, the significant correla- 
tions of SSS with Pd, extrapunitiyeness 
(DH), and the scales of covert and overt 
hostility are consistent with the view that 
those who reveal psychopathic characteristics 
in self-report inventories will show relatively 
higher levels of stimulus seeking. On the other 
hand, the lack of a significant difference be- 
tween diagnosed psychopaths and nonpsycho- 
paths on SSS does not support the hy- 
pothesis. This result suggests two possible 
qualifications to Quay’s hypothesis: either not 
all psychopaths show high levels of stimula- 
tion seeking and/or others with alternative 
diagnoses may also show such characteristics. 
There is evidence in the results relevant to 
each of these possibilities. 

_ First, the criterion of psychopathy adopted 
in the present study was essentially the psy- 
chiatric diagnosis, and, as generally used by 
Clinicians, the label refers to a somewhat 
heterogeneous group. Indeed, many of those 
described as psychopaths in the present 
sample were also labeled schizoid, hysterical, 
or inadequate, and it seems likely that by 
no means all of them would fit a more strin- 
gent definition of psychopathy such as that 
Proposed by McCord and McCord (1964), that 
ls, “an asocial, aggressive, highly impulsive 
Person, who feels little or no guilt, and is 
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TABLE 2 


COMPARISON oF PsycHOPATHS AND 
NonpsycHoPATss on SSS 


Age SSS 
Group ON E FEAR Ae EE 
M SD M SD 
Psychopaths 21 | 28.81 | 8.18 | 11.62 | 4.77 
Nonpsychopaths | 21 | 29.00| 8,37 | 10,29 | 4.24 
t At JB. 


* p <.1, one-tailed test. 


unable to form lasting bonds of affection with 
other human beings [p. 3].” It is, how- 
ever, these very characteristics of asocial 
tendency, aggressiveness, impulsivity, and 
lack of guilt, as measured by personality 
scales, with which SSS is most significantly 
correlated in the present sample. An objective 
criterion of psychopathy based on a combina- 
tion of these scales would clearly yield results 
consistent with the generalization that a high 
level of stimulation seeking is characteristic 
of the psychopath, but not perhaps character- 
istic of the more neurotic, guilt-ridden patient 
often labeled as an inadequate psychopath or 
character disorder (cf. Foulds, 1967). 
However, the correlations of SSS with the 
Pa, Sc, and F scales of the MMPI suggest 
that sensation seeking may also be a feature 
of the behavior of some psychotic patients. 
There is evidence that among psychotic of- 
fenders, paranoid schizophrenics are both 
more extraverted and aggressive than non- 
paranoid schizophrenics (Blackburn, 1968b), 
and it is suggestive that of the three patients 
achieving the highest score on SSS in the 
present sample (a score of 20), two were 
diagnosed as psychopathic personalities, the 
third as a paranoid schizophrenic. In other 
words, it may well be that many of the at- 
tributes said to characterize the psychopath 
may also be common among paranoid schizo- 
phrenics. This being so, investigations of the 
possible causal antecedents of stimulus seek- 
ing proposed by Quay (1965) would best be 
approached by selecting Ss on the basis of 
personality characteristics of impulsiveness 
or aggressiveness, rather than in terms of 


diagnostic type. 
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PERSONALITY CHARACTERISTICS OF PARENTS OF 
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MMPI’ of parents of neurotic, aggressive, and normal 9- to 11-year-old boys 
were compared, using two of Achenbach’s factors for classification of child 
psychiatric patients. Twenty-three couples, parents of aggressive boys, were 
more deviant than 29 couples, parents of neurotic boys. Experimental parents 
were more deviant than 50 control couples, parents of normal boys. Fathers 
of aggressive boys were higher on Pd than either fathers of neurotic or control 
boys. Fathers of neurotic boys were higher on K. Mothers of aggressive boys 
had more low point Mf. More neurotic boys were oldest and more aggressive 


boys middle children. 


Interest in parent personality patterns asso- 
ciated with behavior problems in children has 
increased as diagnosticians and therapists 
have tried to understand and treat other 
family members who may contribute to emo- 
tional disorders in children. Although there 
is considerable research involving parents and 
children, Butcher (1969) points out that 
most studies dealing with whole families have 
not been well controlled for important S vari- 
ables. In addition, not many studies are com- 
parable because of the wide variety of re- 
search methods used. Some conclusions can 
be drawn, however. Parents of disturbed 
children are more disturbed than nonclinic 
parents, but less deviant than psychiatrically 
disturbed adults. One of the most specific 
findings is that there appears to be a strong 
relationship between aggressive, openly puni- 
tive parental behavior and childhood aggres- 
sion, particularly in father-son relationships. 
For example, Butcher (1966) found fathers 
of adolescent aggressive boys had significantly 
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higher elevations on the Pd scale of the 
MMPI. 

A major problem in “whole family” re- 
search is that of classifying Ss in some mean- 
ingful way. One method of study is to cate- 
gorize the children according to symptoms, 
traditional psychiatric diagnoses, or some 
other criteria, and then examine the parents 
to see if there are meaningful personality pat- 
terns associated with a given category in the 
child. The traditional psychiatric diagnoses 
do not satisfy the research requirements of 
objectivity and reproducibility, however. 
Therefore a different system of classification 
of child psychiatric patients is needed. 

The externalizer-internalizer dimension for 
describing adult psychiatric patients pro- 
posed by Philips and Zigler (1961) was 
found reliable in classification of patients 
into “externalizers’ who “act out” their 
problems, often in antisocial behavior, and 
“internalizers,” who resort to fantasy. Achen- 
bach (1966), working with childhood behavior 
problems, found an externalizer-internalizer 
dichotomy similar to that proposed by Philips 
and Zigler. Using clusters of symptoms, he 
was able to classify some boys, for example, 
as “aggressive” or “externalizers,” who were 
characterized by antisocial, acting-out behav- 
ior; other boys could be classified as “obses- 
sive-compulsive-phobic” or “internalizers.” 

This classification into externalizers and 
internalizers appears applicable to both the 
children and adults in families, and much 
existing research into the interaction of 


575 


576 


parent and child personality could be viewed 
from the standpoint of these two categories. 
For example, the typical parent of a delin- 
quent child might be expected to be an 
“externalizer” since he tends to act out by 
using physical punishment and behaving 
in a generally openly aggressive fashion 
(Bandura & Walters, 1959; Glueck & Glueck, 
1950). 

The purpose of this study was to examine 
relationships between parents’ personality and 
specific behavioral problems in their children. 
The primary goal was to study personality 
characteristics of parents of two well-defined 
and homogeneous clinical populations, “ag- 
gressive” (A) and “neurotic” (N) boys, fol- 
lowing Achenbach’s factors (1966). Back- 
ground data selected for study were age of 
parents, occupation and education of fathers, 
size of family, birth order of boy, and IQ 
of clinic boys. 


METHOD 
Subjects 


Experimental Ss were parents of 9- to 11-year-old 
Caucasian boys who had been patients at four psy- 
chiatric facilities. Only those parents were included 
who were married, living together, and were natural 
parents of the boys. The MMPI had been adminis- 
tered to all of these parents as part of initial 
diagnostic procedures. 

In addition, the boys whose parents were included 
in the study had to meet several other criteria. 
Their age was between 9 and 11 years inclusive at 
time of initial contact with the family. IQ had to 
be 90 or above, or in a few cases where no IQ 
was available, there was evidence of at least average 
grades in school. There was no evidence of brain 
damage or severe physical disabilities, Initial survey 
of the files resulted in a sample of 395 sets of parents 
whose sons met the above criteria. Of these, 29 pairs 
fit the experimental A classification, and 23 pairs 
were experimental N. 

Control Ss were 50 pairs of parents of Caucasian 
boys in regular (not “special”) fourth, fifth, and 
sixth grades.* All these parents were married, living 
together, the natural parents of the boys, and pre- 
sumably psychologically healthy, 


Measures 


Measures used in the study were the MMPI, a 
background data sheet for clinic families, a less de- 


8 University of Minnesota Hospital Psychiatric 
Department, Institute of Child Development Clinic, 
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tailed background data sheet for control families, and 
a 91-symptom check list developed by Achenbach 
(1966). In addition, Achenbach’s symptom lists for 
his “aggressive behavior” and “obsessive-compulsive- 
phobic” rotated factors were used to determine the 
classification of Ss—labeled here experimental A or 
aggressive and experimental N or neurotic, respec- 
tively. These categories were chosen rather than 
externalizers or internalizers because they appeared 
to represent the extreme end points of Achenbach’s 
internalizer-externalizer first principal factor. It was 
hoped this restriction would minimize the chance of 
overlapping symptoms in the children, and thereby 
insure more distinct, homogeneous parent groups. 
(It should be noted all the aggressive boys in this 
study would also be classified as externalizers, and 
the neurotics as internalizers.) 

MMPI scores and background data for experi- 
mental Ss were obtained from clinic files in which 
initial diagnostic work had been completed. The 
child’s presenting complaints were checked off on 
the 91-symptom check list according to the pro- 
cedure devised by Achenbach. Psychologists’ and 
psychiatrists’ reports were referred to only for intel- 
lectual evaluations and tests for organicity. If 60% 
of the symptoms coincided with experimental List A 
or List N, they were labeled accordingly. (‘The 
researcher was the sole judge, since previous studies 
showed an average interjudge reliability of 80%.) 
The control group of parents were all volunteers 
who completed the MMPI and background data 
sheet in supervised group testing, Each couple was 
also contacted individually in an effort to elicit any 
possible abnormalities in parent or child which might 
not meet conditions of selection. 


RESULTS 


Background data analysis (using a chi- 
square test of proportions) showed no signifi- 
cant differences except in birth order of the 
clinic child, with neurotic boys tending to be 
firstborn and aggressive boys middle children 
($ < 01). 

The parent MMPI’s were compared by a 
multivariate analysis of variance, an analy- 
sis of frequency of high and low points in 
individual K-corrected profiles, inspection of 
group mean profiles, and an analysis of con- 
figural differences between Hy-Pd and Pd-Mf 
scale scores. Individual MMPI’s were ex- 
cluded where scales ? or L were greater or 
equal to a T score of 70, or where the F or K 
scales were greater or equal to a T score of 75. 
Mean T scores for the six groups are given 
in Table 1 and group mean profiles are 
shown in Figure 1. 

A multivariate analysis of variance was 
performed using raw scores. (This method of 
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TABLE 1 
Mean MMPI T SCORES FOR EXPERIMENTAL A, EXPERIMENTAL N, AND CONTROL MOTHERS AND FATHERS 
Experimental A Experimental N oh Control 
Scale 
Mothers Fathers Mothers Fathers Mothers Fathers 
L 50 46 50 50 50 46 
F 53 53 50 50 50 50 
K 55 53 59 61 59 55 
Hs 56 54 52 52 52 50 
D 55 56 57 51 51 53 
Hy 61 56 59 60 56 56 
Pa 57 64 56 57 55 50 
Mf 43 54 47 57 47 57 
Pa 59 53 56 53 56 53 
Pt 56 54 55 52 56 54 
Sc 57 55 54 51 54 51 
Ma 53 55 45 53 50 55 
Si 54 52 51 48 53 52 


analysis did not permit corrections for K and mean MMPI scale scores nested in 13 X 13 
sex differences included in the usual T-score matrices within each cell, the final Fs based 
profiles.) There were six cells with the 13 on determinant ratios. 


O 0 
O Ww 


K Corrected Score 
a 
a 


LF K Hs D Hy Pd Mf Pa Pt Sc Ma SI 
Scales 


Mean K- Corrected profiles +—--x Exp.A Fathers 
o—=——o Exp.N Fathers — Control Fathers 


K Corrected Score 


LF K Hs D Hy Pd Mf Pa Pt Sc Ma Si 
Scales 


Mean K - Corrected profiles *---* Exp. A Mothers 
o———oExp.N Mothers —— Control Mothers 


Fic. 1. Profiles of fathers and mothers. 
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TABLE 2 


RESULTS OF MULTIVARIATE ANALYSIS OF VARIANCE OF 
Mean MMPI Raw Scores, EXPERIMENTAL A, 
EXPERIMENTAL N, AND CONTROL 
MOTHERS AND FATHERS 


Source df |F value| value 
Grand mean 13/186 | 2299.12 | <.000000001 
Sex 13/186 | 39.09 | <.000000001 
Treatment® 
Controls vs. 13/186 3.07 | <.001 
Experimental 
Experimental A ys. 
Experimental N | 13/186 2.75 | <.01 
Sex X Treatment 26/372 1.05} .40 (ns) 


"Overall treatment effect is average of two treatment 
analyses given, 


In addition to the grand mean, sources of 
varation tested were sex, treatment, and the 
Sex X Treatment interaction. Treatment ef- 
fects were broken into two parts; controls 
versus experimental N and A combined 
(ie., controls versus experimental) and ex- 
perimental N versus experimental A. 

The results of the multivariate analysis are 
given in Table 2. Only the interaction effect, 
Sex X Treatment, was not significant, 


Univariate ANOVAs 


Univariate tests within treatment, sex, and 
Sex X Treatment main effects were also per- 
formed, as shown in Table 3. None of the 
interaction effects was significant. 

Seven of the univariate tests for sex reached 
significance. Most of the variation resulted 
from using raw scores in the analysis, since 
no allowance had been made for the known 
differences in male and female T scores. On 
scales Pa and Ma, however, a significantly 
different response between males and females 
was found, where such differences are not 
normally expected. In the absence of signifi- 
cant interaction effects, it appeared this dif- 
ference extended to all three groups, experi- 
mental and control both. Females were uni- 
formly higher than males on the Pa scale 
and males were higher than females on the 
Ma scale. ; 

Significant differences between the experi- 
mental groups combined versus controls were 
found on scales Hs, D, Hy, and Pd, with the 
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experimental group more deviant on each of 
these scales. 

Significant treatment effects for experimen- 
tal A versus experimental N parents were 
found on scales K, Hs, Pd, Pt, Sc, and Ma. 
On K, experimental N parents scored higher, 
but on all the rest, experimental A parents 
scored higher, in the more deviant direction, 


Frequencies of MMPI Scales above T = 70 


The next analysis involved computing fre- 
quencies of individual MMPI scores equal to 
or greater than a T score of 70, in order to 
study relative group differences in elevation of 
each scale by means of the chi-square test of 
proportions. This was done by counting the 
number of high points. A high point is here 
defined as any scale elevation greater or equal 
to T score = 70, For example, one profile (an 
experimental A mother) was coded 49 78' 
10-3 26/5. This would be counted as four 
high points. Mothers and fathers were ana- 
lyzed separately. None of the analyses for the 
mothers was significant. The proportions of 
high point codes K and Pd were significantly 
different for fathers, however. Experimental 
N fathers had more high point K (p < .01) 


TABLE 3 


Resvits or MMPI Inpivipvat Scare ANOVAs, 
EXPERIMENTAL A, EXPERIMENTAL N, AND 
CONTROL MOTHERS AND FATHERS 


Treatment (Rows) Sex (Columns) 


Avs. N Cvs.A+N 


(af = 13/186) (af = 13/186)| @/ = 13/186) 


Scale 

L ns ns ns 

F ns ns ns 

K |F =1247 ns ns 

Hs | F = 1214*** | F = 5.25* F = 5,84* 

D ns F = 10.39** | F = 24,247* 
Hy ns F = 11.55** | F = 23.38**** 
Pd |F = 8.59 F = 16.77*** ns 

Mf ns ns F = 472,00" 
Pa ns ns F = 7.22** 
Pi |F = 7.36" ns F=7.71* 
Sc | F = 13,97#* ns ns 

Ma |F =9.67** ns F = 6,73* 

Si ns ns ns 


Note.—None of the interaction effects were significant. 
*p <.05. 
FD <.01, 
> <.001. 
+ b < .00001. 
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and Experimental A fathers had more high 
point Pd (p < .001). 


Frequencies of “Low Mf” in Mothers 


“Low Mf’ has come to have a special 
meaning clinically for females, particularly in 
combination with high elevations in other 
scales. (It refers to a High score in the direc- 
tion of femininity.) For this analysis, a 
female profile so classified was one in which 
Mf was the lowest clinical scale, as plotted 
on the standard K-corrected profile. The 
experimental A mothers had significantly more 
low Mf profiles than either the experimental N 
or the control mothers ($ < .05). 


Configural Differences 


Configural differences in individual scores 
on Hy and Pd were analyzed by subtracting 
the K-corrected score on Pd from Hy for each 
parent. The Pd minus Mf difference score 
was obtained likewise. The distribution of 
both these differences appeared markedly 
skewed for fathers in all three groups (in 
the direction of greater differences), so 
median rather than mean values were calcu- 
lated. The fathers and mothers were then 
compared separately using the Extension of 
the Median Test (Siegel, 1956). 

Tests on both Hy-Pd and Pd-Mf differences 
were not significant for mothers. Fathers were 
significantly different on Hy-Pd at the .05 
level, with the experimental A fathers higher 
on Pd than Hy, while both experimental N 
and control fathers were lower on Pd than Hy. 
The Pd-Mf differences were even greater (at 
the .001 level), with the experimental A and 
experimental N fathers having a higher Pd 
than Mf score, while the control fathers 
Were reversed in direction, with Mf higher 
than Pd, 


Summary of Results of MMPI Testing 


To summarize results for fathers, experi- 
Mental A fathers scored higher than either 
experimental N or control fathers on Pd and 
C. Experimental N fathers scored lower than 
either experimental A or control fathers on 
Pt but higher than either experimental A 
or control fathers on K. Experimental A 
fathers were higher on Pd than Hy using 
median difference scores, while experimental 
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N and control fathers were higher on Hy 
than Pd. Both experimental A and experi- 
mental N fathers were higher on Pd than Mf 
while control fathers were higher on Mf than 
Pd. The configural analyses confirmed the 
generally higher scores on Pd in the experi- 
mental A fathers, both absolutely and rela- 
tively to other scales. 

Experimental A mothers scored lower than 
either experimental N or control mothers on 
K. They scored higher on Scales Hs, Pd, Pt, 
Sc, and Ma than either experimental N or 
control mothers. They had more low point 
Mf than either experimental N or control 
mothers. Both experimental A and experi- 
mental N mothers scored higher than controls 
on Scales D and Hy. The mothers in all three 
groups had much more similar group profiles 
than did the fathers. 


Discussion 


Although the experimental A parents had 
the lowest socioeconomic status (SES) rating, 
they were not significantly different from the 
other two groups, as had been expected. This 
was contrary to the findings of the Philips 
and Zigler (1961) and Achenbach (1966) 
studies, The three groups were comparable on 
age of parents, age of children, and number 
of children, as well as SES. 

The only demographic characteristic on 
which the clinic boys differed significantly was 
birth order. Neurotic boys tended to be first- 
borns, and aggressive boys, middle children. 
Sampson’s (1965) review of the research in 
ordinal position presented very little compa- 
rable empirical data. Sampson concluded, 
however, that laboratory studies of aggres- 
sion in younger children, such as those by 
Sears (1951), Haeberle (1958), and Mac- 
Farlane (1954), show that firstborn males 
are somewhat less aggressive than second- 
born males. The findings of the present study 
support these conclusions. 

Previous research (Butcher, 1969) sug- 
gested that experimental parents combined 
would be more deviant than controls and 
that experimental A parents would be more 
deviant than experimental N parents. Both 
these expectations were fulfilled. 

The greater deviance of the experimental A 
fathers, particularly on the Pd scale, is the 
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most striking finding in this study. In each 
statistical analysis, the Pd scale is markedly 
higher in this group, indicating the same kind 
of difficulty with control of overt aggression 
previously noted by Butcher (1966). The 
other notable scale elevation in the experi- 
mental A fathers was on Sc. Together with 
the high Pd, this indicates a group with poor 
impulse control and inability to tolerate 
meaningful close relationships, remaining cold, 
distant, hostile, and unpredictable. 

The present investigation did not attempt 
in any way to directly examine parent and 
child identification; nevertheless, the symp- 
toms in Achenbach’s list of “aggressive” traits 
in boys such as temper tantrums, stealing, 
lying, and inadequate guilt feelings, appear 
very similar to the antisocial traits associated 
with elevation on the Pd scale. 

The most notable finding in the experi- 
mental N fathers was the relatively high 
elevation on the K scale. Since the three 
groups were roughly comparable in back- 
ground, this discrepancy cannot be attributed 
to differences in education or socioeconomic 
level. The experimental N fathers appear rela- 
tively more defensive as a group and less apt 
to admit shortcomings, This group may feel 
particularly unable to admit feelings of anx- 
iety, or other “neurotic” symptoms. In other 
respects, the neurotic fathers appeared very 
similar to the control fathers, 

Both experimental and control mothers fol- 
lowed the characteristic pattern of elevations 
on scales Hy, Pd, and Pa with relatively low 
Mf, previously found (e.g., Hanvik, 1959) to 
be characteristic of the typical mother of a 
child brought for counseling. The three groups 
differed relatively little from each other in 
configuration. The best explanation for this 
is that the mothers were not as important 
in influencing the development of character- 
istic pathology in their sons as were the 
fathers, which would agree with the usual 
conclusions of identity theory. The experi- 
mental A mothers did appear more deviant 
than either experimental N or control 
mothers, however, even though their group 
Profile was similar. They were significantly 
higher on Pd and Sc, as were the experi- 
mental A fathers. It appears they also were 
hostile and had inadequate impulse control, 
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but their very low Mf suggests their hostility 
would be expressed by passive-aggressive 
maneuvers rather than directly. 

The degree of pathology of experimental A 
parents is probably limited in this research 
by the requirement that parents be married 
and living together. As Loeb (1966) points 
out, parental separation and divorce is related 
to greater delinquent tendencies in children. 
(Children of divorced and separated parents 
had many more problems including aggres- 
sion or acting-out behavior (64%) than chil- 
dren of continuously married parents (23%) 
($ < .001).) Although previous studies have 
found relatively greater pathology in mothers 
of “externalizers” or “acting-out” children 
(Cummings, 1967; Liverant, 1959; Rexford, 
1966), this may well have resulted from the 
preponderance of mothers without husbands 
in these studies. 

The most important clinical implication in 
this research is the necessity of including the 
father in assessment and treatment of boys, 
particularly aggressive boys. Also indicated 
is the importance of including fathers in 
further research in the area of parent-child 
personality interactions. 
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LOCUS OF CONTROL IN PSYCHIATRIC PATIENTS 


MARTIN HARROW ! anp ANTHONY FERRANTE 
Yale University 


Acute psychiatric inpatients (V = 128) were administered the I-E scale during 
their first and seventh weeks of hospitalization. The overall sample’s scores at 
Time Period 1 were within the average range found for nonpatient samples, 
Schizophrenics, however, were more external than nonschizophrenics (p < .02). 
Older patients were more internal (p < .05). I-E scores correlated significantly 
with scores on self-confidence and frustration. The results, with an acute 
sample, suggest that patients with greater psychopathology and fewer social 
skills (schizophrenics, younger patients, and to a slight extent, males) are more 
external. At Week 7, after clinical improvement, the overall sample’s I-E scores 
had not changed significantly, Depressives, however, became more internal 
(p < .02). Schizophrenics differed from nonschizophrenics on change scores 


(p < .001). Female patients became more internal (p < .01), 


An important dimension of personality 
which has been subject to increasing atten- 
tion during the last 10 years is the variable 
called locus of control (Lefcourt, 1966; 
Rotter, 1966; Rotter, Seeman, & Liverant, 
1962). Locus of control, a construct generated 
within Rotter’s social learning theory, refers 
to the extent to which an individual perceives 
events in his life as being a consequence of 
his own actions, and thus under his personal 
control. It is assessed in terms of whether 
one believes that events in peoples’ lives 
result from their own efforts, skills, and inter- 
nal dispositions (internal control), or stem 
from external forces such as luck, chance, fate, 
or powerful others (external control). 

This dimension of an individual’s orienta- 
tion toward the world has been investigated 
in many different normal populations and 
found to relate to social and behavioral fac- 
tors (Gore & Rotter, 1963; Strickland, 1965). 
Externality is clearly associated with ethnic 
background and socioeconomic and educa- 
tional levels and related to higher levels of 
conformity and passivity (Battle & Rotter, 
1963; Crowne & Liverant, 1963; Graves, 
1961). Conversely, internality has been 
thought to be related in curvilinear fashion 
to ego control (James, 1957). 

Although there has been extensive interest 
in broadening knowledge about locus of con- 


1 Requests for reprints should be sent to Martin 
Harrow, Department of Psychiatry, School of Medi- 
cine, Yale University, 333 Cedar Street, New Haven, 
Connecticut 06510, 


trol, little research has been conducted with 
psychiatric patients. It would seem important 
to determine the extent to which individuals 
with severe behavior disorders feel that events 
in their lives are a consequence of their 
own behavior. In addition, it would seem 
important to investigate whether various kinds 
of severely disturbed people (e.g., schizo- 
phrenics, depressives, character disorders) 
differ in the degree to which they perceive 
what happens to them as resulting from their 
own actions, and whether their attitudes in 
this area change during the course of treat- 
ment. One study which did use schizophrenic 
Ss found that their attitudes about personal 
control were external (Cromwell, Rosenthal, 
Shakow, & Zahn, 1961). This latter research, 
however, did not explore whether an external 
attitude is unique among schizophrenic pa- 
tients or is a characteristic of all “mental” 
patients regardless of diagnosis. In addition, 
the relationship between locus of control in 
“mental” patients, and their age, sex, and 
level of chronicity were not explored, even 
though these latter factors may be important 
influences. A comprehensive investigation of 
the locus of control construct in psychiatric 
Patients would be of interest to both the 
personality theoretician and the clinician. 
The present research, part of a larger study 
of the personality profiles of psychiatric pa- 
tients, presents a systematic investigation of 
attitudes about locus of control in a large 
sample of hospitalized patients. In this paper 
We are concerned with the following questions: 
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Locus or CONTROL In PSYCHIATRIC PATIENTS 


General Characteristics of Locus of Control 
in Psychiatric Patients 


1. What is the distribution of mental pa- 
tients on the Internal/External continuum, 
and how does it relate to previously collected 
samples of “normals”? 

2. Is there a relationship between these 
scores and diagnoses? 

3. Are these scores related to age or sex? 


Changes of Locus of Control Scores following 
7 Weeks of Hospitalization 


1. Do psychiatric patients become more 
internal as their manifest pathology de- 
creases? 

2. If there are any changes in internality 
or externality, are they related to the type 
of disorder of the patients (their formal 
diagnosis) ? 

3. Are changes in patients’ attitudes about 
locus of control related to age or sex? 


METHOD 
Patient Population 


The Ss were 128 consecutive admissions to a short- 
term, psychiatric inpatient service. The patients rep- 
resented the major types of psychiatric disorders 
necessitating hospitalization (40 schizophrenics, 41 
depressives, 34 character disorders, 5 manics, and 8 
of varied diagnoses). Since the sample was later 
subdivided and examined separately according to 
diagnosis, special attention was devoted to this factor, 
with all patients being diagnosed according to stan- 
dard psychiatric nomenclature. The diagnosis used 
for each patient was arrived at by consensus of two 
experienced clinicians (a board-certified psychiatrist 
and a clinical psychologist), each with over 6 years’ 
experience in the field. 

Demographically the patient sample was typical 
of the populations of many short-term, private psy- 
chiatric hospitals, with the major deviation from 
the usual population norms being a greater propor- 
tion of upper-middle-class and female patients. 
Thus, there were 45 males and 83 females among the 
Patient population. The mean age of the sample was 
31.2 years and the range was 12-72 years. The mean 
educational level of the sample was 14.0 years and 
the range was 7-20 years. 


Se etting 


The research was carried out on the Acute Psychi- 
atric Inpatient Service at Yale-New Haven Hospital. 
ine unit relies heavily on milieu, group, and family 

erapy, with frequent use of ataractic medications 
ae Harrow, Becker, Schwartz, & Miller, 
19 ; Harrow, Astrachan, Becker, Detre, & Schwartz 

67). The staff attempts to foster rapid social 
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telearning, control of symptoms, responsibility for 
behavior, and a more independent life adjustment 
after discharge (Norton, Detre, & Jarecki, 1963). 

The overall orientation of the staff, and thus of 
the service, is an internal one, with emphasis on 
patients accepting responsibility. In this group- 
oriented ward, a high premium is placed on patients 
becoming active and responsible group participants, 
and members of the patient community. Within the 
framework of a therapeutic community, clinical im- 
provement in a patient’s condition is accompanied 
by increasing responsibility for oneself and for help- 
ing other disturbed patients. The primary therapeutic 
goal of the unit is the rapid return of the patient 
to the community with a reduction in symptomatol- 
ogy and the restoration of ability to function in the 
family, with peers, and at work (Detre, Kessler, & 
Jarecki, 1963). 


Data Collection 


The instrument for measuring locus of control was 
the Internal/External (I-E) scale (Rotter, 1966), 
which is a 29-item forced-choice scale with six buffer 
items. The 23 critical test items attempt to measure 
the degree that S perceives events in his life as 
being a consequence of his own actions, or beyond 
his personal control. 

The questionnaire was administered to all patients 
during their first week of hospitalization.2 Of the 
128 patients who took the test during the first time 
period, 88 were still in the hospital 6 weeks later, 
at which point they were administered the test 
again for the second time. 

The I-E test was one of a number of personality, 
attitude, and performance tests routinely adminis- 
tered to all patients at prescribed time periods over 
8 weeks. In addition to the test material obtained 
from the patients, extensive daily and weekly ratings 
of all patients’ clinical conditions and behavior were 
collected routinely from the patients’ therapists and 
from other staff observers. The testing program, and 
the present study in particular, are part of an ongoing 
research project investigating the relationship between 
personality factors and behavior disorders. 


RESULTS AND Discussion 


Initial Scores: Overall Sample 


In regard to the data on the patient sample 
as a whole, the initial scores were expected to 
be external because of the following factors: 
(a) a reaction by the patients to acute psychi- 
atric symptoms which infringe upon their 
sense of mastery, and thus may lead to exter- 

2Several patients were discharged or transferred 
during the first week, before data could be collected 
from them. Several others, who were too dis- 
organized to be tested at Week 1, were tested for 
the first time during the second to fourth week, 
and then for a second time 6 weeks later. Their 
data are included in the present sample. 
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TABLE 1 
Mean Scores on Locus or CONTROL 
AT Time Pertop 1 
t 
Group M N j 
Schiz- IMfanics» 
ophrenics*) 
Total sample 8.70 | 128 — = 
Schizophrenics 10.07 | 40 — 2.70* 
Depressives 8.27 | 41 1.93 2.45* 
Character disorders | 8.06] 34 1,91 1.96 
Manics 4,20 5 2.70* = 
Others 9.50 8 32 2.76" 


*The ¢ scores represent comparisons between the schizo- 
pupae subgroup and each of the other diagnostic groups 
isted, 
b The # scores represent comparisons between the manic 
subgroup eet each of the other diagnostic groups listed. 
p <.05. 


nality; (b) acute mental hospitalization which 
is usually at least partly coercive and thus 
not completely under the patients’ own con- 
trol; and (c) differences between these hos- 
pitalized patients and normal samples on 
long-standing factors such as personality dif- 
ficulties and susceptibility to upsetting events, 
both of which would tend to give the patients 
a diminished sense of mastery and a dimin- 
ished sense of control over their own fates. 

The actual results presented in Table 1 
show an initial mean locus of control score 
of 8.70 for the entire group of patients, This 
score lies within the range of mean scores 
presented by Rotter (1966) for several non- 
psychiatric samples. To check this further, 
the data from the current psychiatric patients 
were compared with those presented by 
Rotter (1966) from his sample of 1,180 Ohio 
State University students (mean I-E score of 
students = 8.29), and there were no signifi- 
cant differences (¢ = 1.10, p > .20). It must, 
of course, be kept in mind that the present 
patient sample is predominantly upper middle 
class with above-average intelligence (their 
mean IQ, based on the WAIS Information 
subtest, falls in the lower part of the 
superior range of intelligence), and samples 
of this type typically have been slightly more 
internal. 

Despite these considerations, the data re- 
ported in Table 1 indicate that this group of 
hospitalized psychiatric patients does not dif- 
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fer significantly from nonpsychiatric samples, 
Thus, the results would seem to suggest that 
the above factors associated with psychiatric 
hospitalization do not influence locus of con- 
trol in an external direction for the patient 
population, considered as a whole. It is still 
possible, however, that the above factors have 
an external influence, but that the effect is not 
uniform for the entire sample, and instead 
only influences subgroups of this population. 

This possibility is investigated in the suc- 
ceeding sections. 


Initial Scores: Categorized According to 
Diagnosis 

The data on locus of control for the various 
diagnostic groups also are presented in Table 
1. The results fit a theoretically consistent 
pattern if one considers the premorbid his- 
tory of many patients with these types of 
disorders and how their history (in terms of 
success or failure in dealing with environ- 
mental events) would be expected to influence 
the extent of externality. 

The schizophrenic subgroup was the most 
external and the mean score of these patients 
(10.07) was significantly more external than 
the mean (8.07) for the total sample of non- 
schizophrenic patients (¢ = 2.51, df = 126, p 
< .05). Similar results (in terms of schizo- 
phrenics being external) were obtained by 
Cromwell et al. (1961), although their 
schizophrenic sample was not compared to a 
sample of nonschizophrenic psychiatric pa- 
tients. Looked at in terms of their history, 
the schizophrenic sample should contain a 
greater percentage of patients who have had 
long-standing difficulties in adjustment and 
have experienced chronic disappointment. 
Furthermore, this disorder includes symptoms 
such as delusions and paranoid ideation which 
are threatening to others, produce greater 
social stigma, and may lead to a sense of 
Personal futility, with some resulting ex- 
ternality. 

The depressives are a group with a presum- 
ably better premorbid adjustment, and a less- 
chronic, long-term disorder than the schizo- 
phrenics. Although their current depressive 
illness (with symptoms such as worthlessness 
and self-depreciation) may have influenced 
the scores in the external direction, their 
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scores tend toward the internal as compared to 
schizophrenics (p < .10, see Table 1). 

The patients with character disorders may 
have some long-standing personality diffi- 
culties, but these are generally ego syntonic, 
and many of these patients have performed 
in a competent manner in the social and oc- 
cupational areas over many years. A large 
number of these patients have had difficulty 
in only one or two select areas, often related 
to their life styles. Despite their admission to 
the hospital, arising from some crisis which 
they were presumably unable to handle, these 
patients still tended to be more internal than 
the schizophrenics (p < .10, Table 1). 

At the height of their disorder (which is 
frequently at the time of hospitalization), 
manic patients display grandiose thoughts 
regarding their own ability to successfully 
deal with life. Their extremely internal ori- 
entation (M = 4.20) is consistent with this 
temporary confidence in their own capacity to 
meet lifes challenges. The manic patients 
were significantly more internal than the 
total sample of nonmanic patients (£ = 2.44, 
df = 126, p < .05). 

The present data were obtained with 
largely acute, middle-class patients. It is quite 
possible that a sample of more chronic, less 
middle-class patients might have produced 
different (perhaps more external) results. 


Initial Scores: Age and Sex 


Table 2 presents the results when the total 
sample was categorized according to age and 
sex. Since the data on age and sex differed in 
several cases according to whether or not the 
Patients were schizophrenic, the results for 
the schizophrenic and nonschizophrenic sam- 
ples are presented separately in the table. 
When the data were analyzed according to age 
and sex, it was found that the younger pa- 
tients were significantly more external ($ 
<.05) and there was a nonsignificant trend 
for males to be more external than females. 
Although these tendencies held for the pa- 
tient population as a whole, there were some 
Complex interactions between age, sex, and 
diagnosis. 

Thus, while older patients were significantly 
More internal, this was largely due to the 
overwhelming trend toward greater internality 


TABLE 2 


Mean INITIAL SCORES on Locus or CONTROL 
ACCORDING TO AGE AND SEX 


Schizophrenics Nonschizophrenics 
Patients 
M N M N 
Younger* 9.75 24 9.30 39 
Older* 10.13 16 7.02 49 
Male 10.77 22 8.17 24 
Female 8.83 18 7.98 64 


Note,—Young patients versus old patients: = 2.24, p <.05. 
Young nonschizophrenics versus old nonschizophrenics: 
t = 2.71, p < .01, Males versus females: ¢ = 1.56. 

a The sample was divided arbitrarily into younger patients 
(24 years and younger) and older patients (25 years and 
older), 
among the older nonschizophrenic population 
as contrasted to the younger nonschizophrenic 
population. In many ways the scores for this 
older nonschizophrenic population of patients 
were most similar (generally internal) to what 
might be expected of a nondisturbed upper- 
middle-class population. This is not surprising 
since the older nonschizophrenic patients 
were generally less “sick” (via independent 
ratings of psychopathology by their psychia- 
trists and by nurses) than the younger or 
schizophrenic patients. In contrast, the 
younger patients, in addition to their greater 
pathology, frequently have fewer social skills, 
Perhaps of key importance, most of the older 
nonschizophrenic patients have a greater 
backlog of successful experiences. Thus, in a 
way, this subportion of the total hospital 
population contains the patients most similar 
to “normals” in terms of previous compe- 
tence and achievement. 

Despite the overall results of significantly 
greater externality among younger patients, 
this age-related trend was not found among 
the schizophrenic population. The older 
schizophrenics, who represented a more 
chronic population, were rather external. Most 
of these older schizophrenics have had long- 
standing difficulties in adjustment. It may be 
that among the older schizophrenics, with in- 
creasing age a backlog of adjustment diffi- 
culties and social disappointments have led to 
a sense of personal futility and have contrib- 
uted to their more external stance. 

It should be noted that the results on age, 
sex, and diagnosis fit a general pattern in 
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which the less competent patients with greater 
psychopathology and fewer social skills 
(schizophrenics, younger patients, and, to a 
slight extent, males) were the most external. 
This would generally fit a formulation in 
which general internal and social competence, 
and a backlog of successful experiences, would 
be one set of factors leading to increased in- 
ternality. Evidence to support this formula- 
tion was found when the present patient sam- 
ples’ initial I-E scores were correlated with 
their scores on a scale for self-confidence used 
previously by Brim, Glass, and Lavin (1962), 
and one related to frustration over previous 
failures used by Srole (1962). The correla- 
tion between the I-E scores and self-confi- 
dence scores was .23 (df= 121, p<.02), 
and that between the I-E scores and frustra- 
tion scores was 30 (df = 121, p < .01). 


Change Scores: Overall Sample 


It was expected that at Time Period 2 the 
patients would have changed in the direction 
of decreased externality. The above formula- 
tion is based on the hypothesis that as symp- 
tomatic improvement occurs, patients will 
become capable of greater mastery and feel 
more capable and adequate (Harrow, Fox, 
Markhus, Stillman, & Hallowell, 1968). This, 
in turn, could lead to increased internality. It 
is also possible that at a university teaching 
hospital, where the patients and staff have 
frequent regular contact, patients will identify 
to some degree with the professional staff’s 
more internal values. Other evidence (As- 
trachan, Harrow, & Flynn, 1968; As- 
trachan, Schwartz, Becker, & Harrow, 1967; 


TABLE 3 
CHANGE Scores on Locus or CONTROL 


Mean 
Group change N t 

Total sample — 44 88 —1.38 
Schizophrenics 12 29 1.65 
Depressives =R 25 —2.71* 
Character disorders | —1.22 23 —1.64 
Manics 3.00 5 1.89 
Others —3.17 6 —4,53** 


* Thet scores represent differences between the obi 
and zero tno ang! e obtained mean 
02, 


*p <.01. 
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Harrow, Astrachan, Becker, Miller, & 
Schwartz, 1967) indicates some acceptance of 
staff attitudes and values by patients in such 
a hospital setting. 

The mean change scores for patients ad- 
ministered the I-E test twice, and the tests 
of significance associated with these changes 
are presented in Table 3. There was a slight, 
nonsignificant, trend for the patient sample 
as a whole to change in the direction of in- 
creased internality during the 6-week period. 
The surprisingly small (nonsignificant) 
change, coupled with the relatively large size 
of the sample, lead to the rejection of the hy- 
pothesis of increased internality for the pa- 
tient sample considered as a whole. These re- 
sults are all the more surprising since recent 
data of Hersch and Scheibe (1967) suggest a 
trend toward increased internality on test- 
retest over time among a “normal” sample. 
This trend (probably related, in part, to the 
test instrument) should be kept in mind in 
considering the subsequent data presented on 
the patients’ change scores. 


Change Scores: Categorized according to 
Diagnosis 

When the patient sample is divided into 
the various diagnostic subgroups and the 
change scores analyzed separately for each 
group (Table 3), marked differences emerge. 

Contrary to the general trend toward in- 
ternality, the schizophrenics (whose scores at 
Time Period 1 were the most external) showed 
a nonsignificant trend toward increased ex- 
ternality. The change scores for the schizo- 
phrenic subgroup differed significantly from 
those of the rest of the patient population 
(t = 4.14, df = 86, p < .001). The absence of 
a change toward internality is not surprising 
when considered in conjunction with several 
features related to the schizophrenics’ clini- 
cal status and the outlook that their pathology 
could engender. In terms of the degree of 
pathology in this group at the second testing 
period, many of these patients would still be 
labeled as very “sick.” Thus, due to their 
clinical condition one might not expect to see 
the sense of increased mastery, potency, and 
subsequent increased internality which oc- 
curred in some other diagnostic groups. 
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In addition, despite some clinical improve- 
ment, the schizophrenics have less to look 
forward to. There is usually more pessimism 
concerning the prognosis of these patients 
than is the case with other diagnostic groups. 
This can be seen in the greater difficulty in 
arranging for the posthospital psychotherapy 
of schizophrenics, due to the greater reluc- 
tance of many outpatient therapists to treat 
them. 

In contrast to the schizophrenics, the de- 
pressive group showed a strong trend toward 
increased internality during the 6-week period 
(p< .05). These results indicate that as 
their depressive symptoms diminish, these 
patients become more internal. It is not sur- 
prising that the return to their premorbid 
levels of self-esteem and control are reflected 
in increased confidence in their ability to 
master the environment, and a resulting 
change toward increased internality. 

The character disorders also tended to be- 
come slightly more internal during the 6-week 
period. It would have been difficult to make 
a precise prediction concerning the change 
Scores for this group since they represent a 
heterogeneous set of character types and a 
mixed group of problems. On a post hoc basis 
it might be noted that while much of their 
character pathology remains after hospital 
treatment, at least the acute reason for their 
hospitalization has been relieved and they are 
in the process of returning to their previous, 
moderately successful, adaptation. 

In general, the nonschizophrenic patients 
(other than manic patients) tended to become 
More internal as they improved during the 
6-week period between testings. These results 
fit the overall picture of increased internality 
with an increasing sense of competence (or 
decreasing incompetence), and this is in ac- 
cord with the earlier hypothesis. 

The tendency for the manic patients to be- 
Come more external (Table 3) is in accord 
with clinicians’ observations of these patients. 

uring hospitalization they recover from their 
original grandiosity concerning their plans, 
Capabilities, and futures, and become more 
Cognizant of their limitations. The external 
shift toward more “normal” locus of control 
is fully in accord with this increased realism. 
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TABLE 4 


CHANGE Scores on Locus or CONTROL 
ACCORDING TO AGE AND SEX 


Schizophrenics Nonschizophrenics 
Patients 
M N th M N te 
Youngers | 83 | 18 | 1.57 | —1.00 | 27 | —1.52 
Olders +36 | 11 46. | —1.00 | 32 | -1,98 
Male 1.29 | 17 | 2.48" | —.06] 17 | —.07 
Female | —.25 | 12 | —.35 | -1.38| 42 | —3.28* 


Note.—Young patients versus old patients: £ = .61, Male 
patients versus female patients: ¢ = 2.80, p <.01, Young 
males versus young females: ¢ = 3.30, p <.01, 

«The sample was divided arbitrarily into younger patients 
(24 years and younger) and older patients (25 years and older). 

b The £ scores represent differences between the obtained 
mean and ae (no change). 

<. 


py 20l 
Change Scores: Age and Sex 


The patient sample also was categorized 
according to age and sex to determine whether 
there were any relationships between these 
factors and the extent of change in locus of 
control during the 6-week period. The results 
again are presented separately for the schizo- 
phrenic and nonschizophrenic patients. There 
were no significant differences between younger 
and older patients on the change scores. 

The results on sex (Table 4) showed a gen- 
eral trend for a change toward internality 
among the females (¢ = 3.09, df = 53, p< 
.01) with a mixed picture among the males, 
who tended to become slightly more external. 
The difference between the male and female 
change scores was significant (t = 2.86, df = 
86, p< .01). The greatest sex difference oc- 
curred when the group of patients under 25 
years of age were compared according to sex 
(= 3.32, df =43, p< .01). The younger 
females showed a strong trend toward inter- 
nality (p< .01), while the younger males 
tended to become somewhat more external (p 
< .10). Despite the young males’ clinical im- 
provement during the 6-week period between 
testings, many were preparing to face again a 
society where much is demanded of them. 
Thus, many upper-middle-class males feel 
strong pressures to achieve (due to the expec- 
tations of friends and relatives, and to their 
own aspirations). These pressures, coupled 
with their own uncertainty over whether they 
are able to meet the pressures successfully, 
could easily arouse potential feelings of frus- 
tration and powerlessness. 
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The younger females also experience similar 
but less intense pressures. Many do not feel 
as much social pressure to take the initiative 
and succeed, Thus, a number may have felt 
more competent than the males to meet the 
social challenges they will face on discharge, 
with resulting increased internality. 


Test-Retest Reliability 


It should be noted that while the data for 
a number of subgroups showed some change 
over time, the patients’ scores at Time Period 
1 (within a week after acute hospitalization) 
were closely related to their scores at Time 
Period 2 (6 weeks later, when the symptoma- 
tology had been drastically reduced). In terms 
of reliability, the test-retest correlation was 
-75 (df = 86) for the current sample of “men- 
tal” patients. This compares favorably with 
results obtained with more “normal” college 
students. Thus the test-retest correlations were 
-55 and .72 for two samples of students pre- 
sented by Rotter (1966), and the test-retest 
coefficients ranged from .47 to .84 for 10 
groups of college students tested by Hersch 
and Scheibe (1967). 

Further subdivision of the sample into dis- 
crete types of patients showed similar results. 
The test-retest correlation for the nonschizo- 
phrenic patients was .67 (df = 58), and that 
for the schizophrenic patients was .87 (df = 
26). Similarly the test-retest correlation for 
the older patients was .75 (df = 41), and for 
the younger patients was .73 (df = 43). Fi- 
nally, the test-retest correlation for male pa- 
tients was .79 (df = 32), and that for female 
patients was .74 (df = 52). 

These high test-retest reliabilities and the 
comparative closeness of the scores at the two 
time periods indicate that while acute psychi- 
atric hospitalization and acute symptoma- 
tology influenced some of the scores in a more 
external or internal direction, other, perhaps 
more basic, characteristics were major de- 
terminants of the scores for most of the pa- 
tients. 
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STUDY OF PSYCHOLOGICAL CONTENT IN THE 
COMMUNICATIONS OF SUICIDAL INDIVIDUALS * 
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A test was made of the general hypothesis that the psychological content of 
communications of individuals who commit suicide differs from that of non- 
suicidal normal individuals. A comparison was also made of suicide-threatening 
outpatients with suicide committers and nonsuicidal normals. The message form 
selected for analysis was the suicide note. The Ss were American-born, white, 
adult males matched for age and occupational level. Overall results were in 
support of the major hypothesis. The content of communications of suicide 
committers reflected heightened dependency needs, difficulty maintaining inter- 
personal relationships, a high degree of activity, and an expressive style of 
veiled aggression which includes hurting the self in order to hurt and manipu- 
late others. Suicide threateners were predictably in some ways like suicide 


committers and in certain other ways more like nonsuicidal normals. 


Every 30 minutes, a person in the United 
States kills himself. The shock wave of these 
events, probably more than with the experi- 
ence of any other human event, is consistently 
followed by the serious question, “Why?” 
The first attempt to deal with this question 
usually involves pointing to a precipitating 
stress just preceding the suicide. But students 
of suicide are becoming increasingly aware 
that the dynamics of suicide cannot be under- 
stood in terms of precipitating stress alone, 
and that there may be certain features of the 
suicide’s personality which characterize his 
self-destructive approach to life and to other 
human beings (Farberow & Shneidman, 1961). 

The primary purpose of the present study 
is to test the general hypothesis that the 
psychological content of communications of 
individuals who commit suicide differs from 
that of nonsuicidal individuals. This hypothe- 
sis is based on the assumption that the psy- 
chological content of communications of an 
individual is relatively stable and reflects his 
life style. This view has been expressed by 
several theorists in recent years, including 

1 This article is based on a dissertation submitted 
in partial fulfillment of the requirements for the 
degree of Doctor of Philosophy under the chairman- 
ship of Georgene Seward in the Department of Psy- 
chology at the University of Southern California. 

2 Also at Reiss-Davis Child Study Center, Los 
Angeles. Requests for reprints should be sent to 
Allen R. Darbonne, Veterans Administration Center, 
Central Research Unit, Building 13, Wilshire and 
Sawtelle Boulevards, Los Angeles, California 90073. 


Ruesch (1961), Bateson, Jackson, Haley, 
and Weakland (1956), and Sullivan (1953). 

The practical implication for suicide re- 
search from a communications point of view is 
that it reduces the distance usually found 
between theory and observations. Since a 
therapist’s primary observations are of verbal 
and nonverbal communications, and his pri- 
mary treatment method involves communica- 
tion, it seems reasonable that theory used by 
therapists should be related to these same 
operations. 

The second goal of the present study is to 
compare the content in communications of 
individuals who have threatened suicide with 
that of individuals who have actually com- 
mitted suicide and with that of nonsuicidal 
individuals, 

Adler’s (1958) theory of the life style of 
suicidal individuals was used as a framework 
from which 19 hypotheses (presented in the 
Results section) concerning the communica- 
tions content and style of suicidal individuals 
were derived. Adler’s theory was selected as 
the framework because of its comprehensive 
quality and its explicit statements regarding 
life style which could be easily rephrased in 
operational terms concerning communications 
style and content. 

These 19 hypotheses concerned predictions 
that the content of communications of suicide 
committers would reflect four general psycho- 
logical characteristics: (a) heightened depen- 
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dency needs, (b) low self-esteem and self- 
centered goals, (c) a high degree of activity, 
and (d) an expressive style of veiled aggres- 
sion which includes hurting the self in order 
to hurt and manipulate others. 

Thus, it was predicted that the messages of 
suicide committers would show more evidence 
of these variables than those of nonsuicidal 
normals. Further, it was expected that per- 
sons who threatened suicide would communi- 
cate in certain specific ways similar to suicide 
committers and in other specific ways similar 
to nonsuicidal normals. In other words the 
prediction here was that the suicide threat 
group would always fall between the commit- 
Suicide and the nonsuicidal groups relative to 
the observed frequency of each communica- 
tions content item. 


s 


MetHopD 


The message form which was selected to be ana- 
lyzed and compared was the suicide note. This was 
done because of the suicide note’s unique quality of 
being a message which is primarily structured by the 
Writer and is available from a large number of per- 
Sons who have taken their lives. Studies analyzing 
the content of suicide notes have demonstrated the 
| validity and reliability of the method. Shneidman 

and Farberow (1957b), who introduced the tech- 
nique of suicide note analysis, reported on a com- 
parison between genuine and simulated suicide notes 
in terms of Mowrer’s concepts of discomfort and 
relief, They found the discomfort-relief quotient 
(DRQ) interesting, but of limited value in differ- 
entiating suicidal and nonsuicidal individuals. This 
Was primarily attributed to the DRQ’s insensitivity 
_ to the quality of discomfort and relief. They also 
= feported that data analysis has shown no significant 
difference in economic, social-cultural, and personal 
factors between suicide committers who leave notes 

Í and those who do not. 
In another study, Shneidman and Farberow 
l (1957c) again compared the content of suicide notes 
and pseudosuicide notes. They concluded that sui- 
Cidal individuals use a type of logic which involves a 
fallacious identification. They term it a “psycho- 
Semantic fallacy.” This fallacious identification be- 
__tween the self as experienced by the self (Is) and 
the self as it feels itself experienced by others (Io) 
Tesults in the suicide’s acceptance of erroneous premi- 
Ses and invalid conclusions. One of these is the as- 
Sumption that the suicide committer will be able to 
experience after death the effect his expiration will 
lave on others. In a study concerning suicide and 
age, Farberow and Shneidman (1957) rated actual 
Suicide notes on the basis of Menninger’s (1938) 
Concept of the (a) wish to kill, (b) wish to be 
Killed, and (c) the wish to die. They found more 
intense interpersonal motives expressed in the notes 
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of younger suicides and more chronic depressive 
feelings expressed by older suicides. In an analysis of 
948 genuine suicide notes in Los Angeles County, 
Shneidman and Farberow (1960) found reasons for 
suicides, and affect expressed in notes differed sig- 
nificantly between nine sociological area types. 

Darbonne (1969) studied the age variable in re- 
lation to content of actual suicide notes and found 
significant differences between age groups. Osgood 
and Walker (1959) studied the effect of motivation 
level on language behavior, assuming actual suicidal 
individuals are under heightened motivation. They 
compared actual suicide notes, simulated suicide 
notes, and ordinary letters to friends and relatives. 
Tuckman, Kleiner, and Lavell (1959) investigated 
the emotional content of actual suicide notes and 
found that 50% of the notes contained positive af- 
fect, 25% expressed negative affect, and 25% were 
neutral in affect. All investigators using suicide notes 
have reported high reliability with this technique. 
In a validational study, Tuckman, Kleiner, and La- 
vell (1960) analyzed 63 suicide notes and found 
reasons for suicide stated in the notes in agreement 
with information of an informant in over 90% of 
the cases. 

The Ss for the study were (a) 40 individuals who 
had actually committed suicide (C/S group), (b) 
40 individuals who indicated they were not then 
and had never been suicidal (N/S group), and (c) 
31 psychotherapy outpatients who were threatening 
to commit suicide (T/S group). All Ss were Ameri- 
can-born, white males. In addition, the C/S group 
and the N/S normal group were matched for age 
(within 5 years) and for occupational level. 

The actual suicide notes were randomly selected 
(from the files of the Los Angeles County coroner) 
and included American-born, white males between 
the ages of 20 and 59 who had committed suicide 
between July 1, 1963, and June 30, 1964. Suicide 
notes were obtained from the nonsuicidal (N/S) and 
the suicide threateners (T/S) by asking them to 
write the suicide note they would write if they were 
going to take their own lives. Extensive control pro- 
cedures were used to avoid the possibility of indi- 
vidual adverse posttask reaction to writing a sui- 
cide note. A standard form was used to analyze the 
suicide notes in terms of the 19 hypotheses. A 
method of analysis was used which controlled for 
length of the suicide note. All analyses were done 
blind, each note having been coded, retyped on simi- 
lar typewriter and paper, and shuffled prior to being 
seen by the scorer. Interrater reliability tested by the 
percentage of agreement method yielded a mean 
agreement on the 19 categories of 87.9%. 


RESULTS 


The overall results of the comparison be- 
tween the suicide-commit and nonsuicidal 
groups are given in Table 1. Ten of the 19 
hypotheses were supported by the data (p < 
.05), a number greater than would be ex- 
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TABLE 1 
RESULTS or COMPARISON oF C/S AnD N/S Grours 


Á, l aa E 


Frequency 

Hypotheses TE a /S ul ne p 
1a. Need others do for him 29 9 18.10 0005 
1b. Needs emotiona] support 18 8 4,62 .025 
ic. Loss of something 12 6 1.79 ns (.10) 
1d. Parent figures important 18 10 2.69 05 
2a. Low self-esteem 12 11 .00 ns 
2b. At mercy of external forces 15 14 -00 ns 
2c. Concern about others 25 11 , | 01> 
2d. Self-referents 18 22 = ns» 
2e. Break in social ties 11 4 2.95 | .05 
3a. Communicate more 23 17 s ns» 
3b. Motor-action oriented 24 12 * 05» 
3c. High activity—suicide method 15 14 00 ns 
3d. Activity concern > contemplation 19 3 14.11 .0005 
4a. Others hurt by past behavior 6 3 50 ns 
4b. Concern regarding blame for suicide 12 2 7.01 005 
4c. Suffering mentioned 16 19 .20 ns 
4d. Targets named 35 18 14.31 .0005 
4e. Express direct anger 2 3 .00 ns 
4f. Veiled hostility 12 4 3.83 05 


a Compared by Mood's (1950) Median Test, 


b Level of significance evaluated by reference to Mainland and Murray (1952) table. 


pected to occur by chance. The data also 
showed a trend in the direction of support for 
Hypothesis 1c (p < .10). Of all 19 hypothe- 
ses, only two cases were found in which the 
observed frequencies were in the opposite 
direction of that which was expected. These 
were Hypotheses 2d and 4c. 

The results of the suicide-commit and sui- 
cide-threat group comparisons and the non- 
suicidal and suicide-threat group comparisons 


in terms of those variables which differenti- 
ated the suicide-commit and nonsuicidal 
groups are given in Table 2. The overall re- 
sults of this comparison support the hypothe- 
sis that suicide threateners, as a group, are in 
some ways like suicide committers, but in 
other ways unlike them, and that these simi- 
larities and differences are detectable in the 
psychological content of their communications, 

Since there were 19 minor hypotheses, in 


TABLE 2 
ResuLTs or Comparison or C/S anp T/S Groups ann N/S ann T/S Groups 
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Comparing C/S with S/T Comparing N/S with S/T 
Hypotheses 

x b X b 
la. Need others do for him 16.14 0005 .12 ns 
1b. Needs emotional support .00 ns 3.15 05 
1c. Loss of something 1.16 (.15) .00 ns 
1d. Parent figures important .00 ns 1.58 ns (.15) 
2c. Concern about others 1.13 ns 3.288 05 
2e. Break in social ties 21 ns 3.45 0S 
3b. Motor-action oriented 4.34 025 428 ns 
3d. Activity concern > contemplation 3.67 05 2.15 ns (.10) 
4b. Concern regarding blame for suicide 19 ns 3.42 0s 
4d. Targets named 8.00 005 34 ns 
4f. Veiled hostility .00 ns 4.15 025 


* Compared by Mood’s (1950) Median Test. 
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consideration of brevity and clarity, rather 
than stating each hypothesis and then coming 
back and separately giving the findings, the 
findings are reported (in parenthesis) after 
each hypothesis is presented. 


1, Pampered Life Style (Dependency) 


la. The first hypothesis of the group re- 
lated to pampered life style was that the mes- 
sages of suicidal individuals were expected to 
reflect the writers’ need to have things done 
for them. Content related to others doing 
something for the communicator or indicating 
the communicator expects or demands that 
others fulfill his wishes will be found more 
frequently in the suicide notes of the C/S 
group than the N/S group. (This hypothesis 
was supported by the data. In terms of this 
item, the T/S group was similar to the N/S 
group and differed statistically from the C/S 
group.) 

1b. The next hypothesis was that the mes- 
sages of suicidal individuals would reflect their 
concern regarding the emotional support of 
others. Content related to whether or not the 
communicator feels others have supported him 
emotionally or have let him down will be 
found more frequently in the suicide notes of 
suicidal individuals than those of N/S indi- 
viduals, (This hypothesis was also supported 
by the data. The T/S group was more like 
the C/S group than the N/S group relative to 
this item.) 

1c, A third hypothesis was that the suicidal 
Person’s difficulty adapting to loss will be evi- 
dent in his messages. Mention of having lost 
something and being unable to do without it 
will be found more frequently in the suicide 
notes of suicidal individuals than N/S indi- 
viduals. (Support was found for this hypothe- 
Sis at the .10 level of significance. The T/S 
group was similar to the N/S group and 
showed a trend toward differing [p= .15] 
ftom the C/S group regarding this item.) 

Id. The fourth hypothesis concerning de- 
Pendency was that the heightened importance 
of parent figures will be notable in communi- 
Cations of suicidal persons. References to par- 
ents and/or authority figures will be found 
More frequently in the suicide notes of sui- 
Cidal individuals than in those of N/S indi- 
Viduals. (Again, this prediction was supported 
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by the data. The T/S group was similar to 
the C/S group and showed a trend toward | 
differing [p= .15] from the N/S group in 
terms of this hypothesis.) 


2. Inferiority Feelings and Self-Centered 
Goals 


2a. It was hypothesized that content indi- 
cating low self-esteem would be found more 
frequently in the messages of suicidal indi- 
viduals than those of nonsuicidal individuals. 
(This hypothesis was not supported by the 
data. Content reflecting low self-esteem was 
very prevalent in the C/S notes, but also in 
the notes of all other groups.) 

2b. It was expected that the notes of the 
suicide committers would, more frequently 
than those of nonsuicidals, reflect a feeling of 
not being in control of their own destiny and 
a feeling of being at the mercy of forces out- 
side of themselves, (This expectation was not 
supported by the data.) 

2c. It was posited that the suicidal indi- 
vidual’s dependence on others for need satis- 
faction would result in his referring to other 
persons in the suicide note more frequently 
than would be found in the messages of non- 
suicidal individuals. (This expectation was 
supported by the data. The T/S group was 
like the C/S group and differed from the N/S 
group in this respect.) 

2d. It was predicted that the suicidal indi- 
vidual’s self-centered thinking would result in 
his using self-referents more frequently than 
would nonsuicidal individuals. (This predic- 
tion was not supported by the data.) 

2e. Supported by the data was the hypothe- 
sis that the suicide’s self-centered goal would 
interfere with the formation of lasting social 
ties. (Notes of the C/S group more frequently 
indicated a loss or absence of social ties or a 
break in interpersonal relations than did the 
N/S notes. Again the T/S group was more 
like the C/S group than the N/S group.) 


3. Degree of Activity 

It was felt this concept was related to the 
“tendency to act-out” notion. This is a crucial 
concept whenever there is an attempt to dif- 
ferentiate persons who talk about committing 
suicide from those who act on the impulse, 
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3a. On the basis of their expected high ten- 
dency to action and activity, the suicide com- 
mitters were expected to communicate more in 
an unstructured communication situation than 
nonsuicidal individuals. Therefore, it was 
predicted that their actual suicide notes 
would be longer than those of nonsuicidal 
persons. (This hypothesis was not supported 
by the data.) 

3b. It was hypothesized that suicidal per- 
sons are highly motor-action oriented. Verbs 
of action, which involve actual physical mo- 
tor behavior rather than reflecting thought or 
feeling, were predicted to be found more often 
in the messages of the C/S group than the 
N/S group. (This prediction was supported 
in the findings. The T/S group differed from 
the C/S group relative to this variable, and 
was like the N/S group.) 

3c. It was expected that the method of sui- 
side attempt would be more active (based on 
ratings of skilled suicide clinicians) for sui- 
cide-committers than that stated by nonsui- 
cidal individuals. (Support was not found for 
this hypothesis.) 

3d. A second hypothesis related to activity 
which was supported in this study was that 
the individual notes of suicidal persons will, 
more frequently than the notes of nonsuicidal 
persons, contain a greater number of verbs 
than adjectives. This notion is based on Bod- 
er’s finding that the adjective-verb quotient is 
indicative of whether an individual is more 
concerned with contemplation or with action. 
(In this index of activity, the C/S group was 
significantly higher than the N/S group. The 
T/S group was very different from the C/S 
group and more like the N/S group.) 


4. Expression of Anger 


4a. Because of their life style of covert ag- 
gression, the notes of suicide-committers were 
expected to contain self-accusations and 
thoughts that others have been hurt by their 
past behavior more frequently than the notes 
of nonsuicidal persons. (This hypothesis was 
not supported by the data.) 

4b. Attack in the form of blame was pre- 
dicted in the notes of actual suicides. (Con- 
cern regarding the blame of others for his 
suicide was found to be communicated more 
frequently in the C/S notes than the N/S 
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notes. The T/S group was more like the C/S 
than the N/S group regarding this hypothesis. 

4c. The use of “suffering messages” to in- 
fluence others was expected to be prominent 
in suicidal communications. Reference to the 
communicator’s suffering was expected to be 
found more frequently in notes of suicide- 
committers than those of nonsuicidal persons, 
(The findings showed no statistical difference 
between the groups in the frequency of this 
type item. All groups included a high number 
of these items.) 

4d. The next prediction was that the sui- 
cidal individual would attempt to make sure 
his veiled aggression hits his targets. (This 
hypothesis was supported.) Messages related 
to the notification of a specific person or 
addressed to a special person were found more 
often in the suicide notes of the C/S group 
than of the N/S group. (The T/S group was 
like the N/S group and differed from the C/S 
group regarding this variable.) 

4e. Content expressing “direct anger” was 
expected to be found less frequently in the 
notes of suicidal individuals than in those of 
nonsuicidal individuals. (This prediction was 
not supported statistically because there were 
too few expressions of direct anger in any 
of the notes to permit a statistical comparison 
of difference.) 

4f. In keeping with Adlerian and psycho- 
analytic theory generally, it was predicted 
that suicidal individuals to a much greater 
degree than nonsuicidal persons would write 
notes whose content reflected a general tenor 
of veiled hostility, aggression, and resentment. 
(The prediction was supported by the data. 
The T/S group was more like the C/S group 
than the NS group in this respect.) 


Discussion 


The findings of this study indicate that the 
T/S group is both similar and dissimilar to 
the C/S group and to the N/S group. A 
question arises whether the T/S group is from 
a single population, relative to the variable 
of suicide, or if it is possibly from two or more 
populations, varying in degrees of suicidality. 
If suicidality is considered as the degree of 
likelihood that an individual or group of indi- 
viduals will successfully act out the impulse 
to commit suicide, it seems reasonable to 
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assume that the more suicidal a person is, 
the more he will have a communication style 
similar to the C/S group. If it were pos- 
sible to split the T/S group into high- 
potential suicide threateners and low-potential 
suicide threateners, would the high-potential 
threateners communicate more like the C/S 
group and the low-potential threateners like 
the N/S group? 

To answer this question, the suicide threat- 
eners were dichotomized into 11 high-potential 
suicide threateners and 11 low-potential sui- 
cide threateners, using the Los Angeles Suicide 
Prevention Center’s Lethality Assessment 
scale (Hattem, 1963; Scheitzach, 1964). 
These two groups were then compared in 
terms of the communication-style items. Un- 
fortunately, the M was so small that none of 
the items was able to differentiate the two 
groups to a statistically significant degree. 
All trends, however, except three which tied 
frequencies, were in the expected direction. 
The fact that not a single trend was counter 
to that which was predicted, suggests that 
larger sample comparisons might yield sta- 
tistically significant differences. 

The findings that the TS group shows a 
heightened importance of parent figures, a 
need for emotional support, less concern than 
the C/S about loss, and less need than the 
C/S to have things done for him, may offer 
some explanation as to what helps a person 
who comes for psychotherapy while threaten- 
ing suicide, to work against his suicidal im- 
pulses. While the therapist becomes an im- 
Portant parent figure to whom the patient 
looks for support, the patient is able at some 
Point to accept the responsibility for therapy 
and to value doing something for himself. 
This dynamic, along with ego strength to 
adapt to losses, helps him acquire a more 
constructive communication and life style. 

Another specific finding of interest was that 
the direct expression of anger was almost 
totally absent from all notes, in all three 
groups. This suggests that in this culture there 
IS great difficulty in consciously perceiving a 
relationship between anger and suicidal im- 
Pulses, not only by the suicide-committer 
himself, but also by suicide-threatening and 
Nonsuicidal individuals. 

While such a step involves much conjecture 
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at this point, it seems worthwhile to posit 
some relationships between the variables 
which were found to be significantly related 
to suicide in this study. Such a consideration 
may offer insight into possible dynamics in- 
volved in the development of a personality 
prone to suicide. The data suggests that indi- 
viduals who commit suicide may have an early 
history of being treated in such a way that 
they develop feelings that they need to be 
pampered by others. The predisposing experi- 
ence of persons who display this strong 
need for pampering may be related to poor 
handling of dependency needs by the parents, 
where either excessive or deficient “feeding” 
(or an inconsistent combination of both) was 
the pattern. As a result of this uncertainty 
about whether his survival needs will be 
met, the individual retains his infantile self- 
centeredness and continuing question concern- 
ing his survival. The feeling grows that others 
do not really want to care for him, so if 
care is to be gotten, others must be manipu- 
lated into giving it. Success at manipulation 
requires the development of keen awareness 
of others in a self-centered way. Any seeds 
of a concept of social interest die out quickly 
as they go unfed. The individual learns to 
sense and use the unconscious guilt of others. 
Since his safety and survival needs are not 
adequately met, his actions often lead to 
hurt and painful experiences. A surface feeling 
of his importance to others results from his 
noting that others give him attention when he 
hurts himself. Since attention is as close to 
caring as the developing suicidal person thinks 
he can get, these experiences become very 
important ones for him, Attention becomes 
a substitute for the caring experience. The 
dependency of the suicidal person differs from 
the dependency found in the constellation of 
dynamics of numerous other emotional dis- 
orders. It is an active dependency, rather than 
a passive dependency. A passive-dependent 
personality sits and waits while others make 
food and other things available to him. The 
suicidal personality is active, but the activity 
is directed at manipulating others to meet his 
needs, rather than accomplishing things to 
satisfy his own needs. This pattern results in 
his being unable to develop any feelings of 
real individuation or self-worth, partly be- 
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cause he never accomplishes anything for 
himself, and partly because he never develops 
an inner feeling of safety which is necessary 
to experience himself as an individual. Unable 
to be comfortably independent, he sees others 
as the only source for getting his needs met. 
He is painfully aware at the same time that 
others are the frustrators of his needs. Hating 
others for this set of events, he is nonetheless 
afraid to express his anger directly for fear 
of losing what he does have. Thus, a repeti- 
tious pattern of hurting himself to use the 
guilt of others to hurt them and to manipu- 
late them becomes subjectively the most 
reasonable style of coping. 

Certain implications for diagnosis and treat- 
ment of suicidal persons may be drawn from 
the conclusions. It may be possible to detect 
individuals with highly self-destructive life 
patterns from an analysis of other message 
forms more readily available in clinical 
settings. Responses to open-ended written 
applications for psychotherapy or tape re- 
cordings of initial phone contacts are two 
obvious forms which could be objectively 
analyzed. Information from anamneses and 
projective tests could also be used to gain 
insight into the patient’s life style and com- 
munication style. Treatment should be di- 
rected at helping the patient become aware 
of his intrapunitive active-dependent coping 
style by making objective reference to his 
parallel style and content of communication 
during the process of psychotherapy. 
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PREDICTING THE SUCCESS OF PEACE CORPS 
COMMUNITY DEVELOPMENT WORKERS * 


CHARLES DICKEN? 
San Diego State College 


Predictor assessments obtained before and during training of 55 Peace Corps 
Volunteers were compared with ratimgs of effectiveness of performance over- 
seas. The main criterion was a composite of judgments by field supervisors; 
there were at least two raters for each S. The most valid predictors were peer 
ratings, life history, certain training grades, and ratings by a selection board 
using all data. Some psychometric measures of personality and cognitive style 
had modest validity, but intellective measures had almost none. Interviews 
and clinicians’ impressionistic interpretations of predictor data showed low 
validity. Females were more predictable than males. Analysis of a subsample of 
“least effective” Ss failed to identify predictors particularly effective in dis- 
criminating between candidates “just acceptable” and “just not acceptable.” 


The predictors were factor-analyzed. 


Mischel (1965) compared criterion success 
ratings of 35 Peace Corps Volunteer teachers 
in Nigeria with psychological assessments ob- 
tained while the Volunteers trained to go 
overseas. Self-report personality measures cor- 
related significantly with the criterion but 
other predictors did not. The predictors em- 
ployed and their correlations with the crite- 
tion were as follows: California F Scale 
(—.45), Barron Ego-Strength scale (.34), 
Taylor Manifest Anxiety scale (—.34), peer 
ratings (.29), faculty ratings (.27), academic 
grades (.24), assessment board (.20), and 
interview (.13). Mischel concluded that his 
results supported the usefulness of simple 
self-report devices and that they pointed, as 
have previous studies, to the limitations of 
“global” assessment procedures. Limitations 
in Mischel’s predictor and criterion measures 
aky avowed by him) will be discussed 
ater, 
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Predictive assessments. 

* Requests for reprints should be sent to Charles 
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Ezekiel (1968) obtained fictional autobio- 
graphical essays describing the personal future 
from 42 Volunteers training to serve as teach- 
ers in West Africa, The essays were scored 
for Differentiation, degree of complexity and 
detail (three subscores); Demand, long-term 
effortful response to challenge; and Agency, 
portrayal of self as agent of major decisions. 
The scores’ correlations with a composite 
first-year criterion success rating ranged from 
.13 to .31 and with a second-year rating by 
the Peace Corps representative from .22 to 
.34, The sum autobiography scores correlated 
with these criteria .29 and .41. There is some 
question of contamination of the success 
ratings by the predictors since the investi- 
gator and the training-site selection officer 
collaborated with the representative and his 
deputy in determining the first-year ratings. 
The autobiography measures had negligible 
validity in predicting peers’ ratings of teach- 
ing success for the total sample, but did pre- 
dict for female and Protestant subgroups 
(validity of autobiography sum .41 and .56). 
Barron’s Ego-Strength, Taylor’s MA scale, 
and an authoritarianism measure were ob- 
tained in training, but no validities were 
reported. 

The present study investigated the validity 
of a comprehensive assessment program in 
predicting the success of 55 Volunteers who 
served as community development workers 
in Peru. 
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METHOD 
Assessment Program and Predictor Variables * 


At the time the Ss of the study entered Peace 
Corps training, applicants for the Peace Corps were 
initially screened by college-educated technicians 
under supervision of a clinical psychologist. In addi- 
tion to skill classification, the technician made a 
pretraining assessment of overall suitability for the 
project to which the candidate was assigned. This 
was based on an application blank, educational 
transcripts and employment records, letters of refer- 
ence, and scores on the General Ability Test and 
Modern Language Aptitude Test. The pretraining 
assessment was summarized in a short sketch for use 
at the training site. The technician also rated the 
candidate on the scale: (1) Unsatisfactory, (2) 
Marginal, (3) Good, (4) Very good, (5) Excellent. 
Plus (+) and minus (—) were used to qualify the 
numerical ratings in many instances. Candidates in- 
vited to train were all rated 3 or higher. During 
the study period approximately half of all applicants 
to the Peace Corps were invited to train and about 
half of all invitees accepted. 

At the training site two clinical psychologists were 
employed as field assessment officers (FAO) respon- 
sible for obtaining and interpreting assessment data. 
Another clinical psychologist, the field selection officer 
(FSO), made the final decision of whether the 
trainee was sent overseas (“selected”) or not sent 
(“deselected”). The FSO was assisted in his decisions 
by a selection board composed of himself, the FAOs, 
the training project director, the project psychiatrist, 
a training officer from Peace Corps headquarters, and 
a representative of Peace Corps administration in 
Peru. The selection board met at the midpoint of 
the 12-week training program and again at the close. 

A description of the quantitative or quantifiable 
assessment data gathered during training follows. 
The listing approximates the sequence in which the 
data were obtained, with exceptions noted. 

Test data. The Minnesota Multiphasic Personality 
Inventory (MMPI) and the Crutchfield Figures were 
administered at the start of the training, The MMPI 
was scored on the standard 13 scales plus the Barron 
Ego-Strength (Es) scale. 

The senior FAO, who was experienced in the 
clinical use of the MMPI, made a “blind” MMPI 
rating and wrote an interpretation for the selection 
board. Prior research has indicated low validity for 
the MMPI clinical scores as predictors of work 
performance of normals (Dicken & Black, 1965). 
Only Es was analyzed separately. The Crutchfield is 
an embedded figures series aimed at Measuring per- 
ceptual and cognitive flexibility* The FAOs also 
had access to the application tests cited above. 

Initial FAO impressions, During the first weeks of 
training, the FAQs conducted a structured interview 
with each trainee. Emphasis was on assessing motiva- 
tion, maturity and emotional stability, interpersonal 
relationships, leadership, competence, and psychiatric 


3 Variables used in the present study are italicized. 
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difficulties, if any. The interview rating of overall 
potential was then done with knowledge of the 
pretraining assessment and test data. 

Near the midpoint of training, a leaderless discus- 
sion group session was held with sets of five to seven 
same-sex trainees. Each group was asked to attempt 
to solve the problem of an “unsuitable Volunteer” 
overseas. The FAOs observed and rated overall 
potential as indicated by discussion behavior. 

Throughout the training, the FAOs made observa- 
tions of performance and social behavior. 

Training grades. The study uses grades made at the 
end of the program by the staff of four training 
components, Community development training per- 
formance was graded by examinations, discussion, 
and field work observations. The language rating 
was based on the Foreign Service Institute structured 
oral examination in Spanish. Physical education 
grades reflect attitude and effort more than athletic 
proficiency per se. Domestic skills training was pri- 
marily for the women trainees, but the males were 
given brief training and both sexes were graded. 
Male technical skills were not graded. The training 
faculty made qualitative appraisals of personality 
which were available to the selection board. 

Peer nominations. At the close of the program, 
each trainee was asked to nominate five others of 
each sex he would choose to work with overseas 
(assignment preference), five of each sex with the 
most leadership potential, and any number he be- 
lieved had overall potential for success in Peace 
Corps work. Two other nominations “adapt well 
overeas” and “can confide in” were obtained but not 
analyzed for the present study because of high inter- 
correlations with other ratings. An unlimited number 
of negative nominations was permitted on each of 
the five items (nominator prefers not to be assigned 
with S, sees lack of leadership potential in S, etc.). 
There were 65 nominators. Descriptive comments by 
peers were obtained and used by the selection board. 

Final FAO impressions. These were based on all 
data available to the FAOs during the training proc- 
ess but were not recorded until after the action of 
the final selection board. Board status was the 
FAOs’ impression of the intermediate and final 
selection board discussions of the trainee including 
controversiality. Development was based on pres- 
ence in the life history of “crisis experiences” and | 
successful resolution of same. Overall potential was | 
the FAQs’ global impression of the trainee. 

Final selection board rating. Trainees selected for 
overseas service were rated 4 (“is approximately in 
the lower quartile”), 5 (“approximately the middle 
two quartiles”), or 6 (“approximately the top 
quartile”), The psychiatrist contributed observations 
to the board based on all trainees’ performance in 
mental hygiene training and on interviews conducted 
with about one-fourth of the study sample who had 
been referred for evaluation. All assessment data 
was available to each board member. The FSO also 
used the results of a Civil Service background 
investigation of each trainee. 

Gough MMPI scales. Thirteen of the special MMPI 
scales developed by Gough (Dahlstrom & Welsh, 
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1960, pp. 448-467) were scored for the present study. 
These scores were not available during the assess- 
ment process and for this reason were analyzed 
separately from the other predictors. 


Subjects 


Eighty-four candidates accepted invitations to join 
the 12-week training project. Resignations and de- 
selection for suitability or medical reasons left 61 
candidates who were sworn in as Volunteers at the 
close of training and sent to Peru. Criterion data 
became available for 55 of the 61, which constitute 
the total sample of the study. The median age during 
the training period was 22. Two Ss had completed 
bachelor’s degrees and some postgraduate education, 
48 had completed the bachelor’s, four 2 years of 
college, and one a single year of college. 


Performance Criteria 


The Volunteers were, in the main, assigned to 
work in barriadas, relatively new owner-constructed, 
quasi-legal settlements which have grown up out- 
side established cities in the coastal regions of Peru. 
The task of the Volunteer was to stimulate and 
organize community action projects in housing, roads, 
education, youth activities, health, and related areas. 
An important component of the task was to foster 
communication between the barriada dweller and 
agencies of the Peruvian government. The job of the 
community development worker has been described 
as “unstructured” in that there is usually no clearly 
specified task for the worker when he arrives in 
the community. Results are judged as much in terms 
of increased cohesiveness in the community as by 
Specific projects completed. Providing technical assist- 
ance and instruction was frequently part of the task 
but usually not the main emphasis. 

Composite field rating. The main criterion was an 
average of independent ratings of Peace Corps ad- 
ministrators (associate representatives and deputy 
director) responsible for supervising the Volunteers’ 
work, The raters judged Ss on “Overall effectiveness 
as a Volunteer, taking site difficulty into account.” 

7-step scale was used: (1) Poor, (2) Below aver- 
age, (3) Probably below average, (4) Average, (5) 
Probably above average, (6) Above average, and 
(7) Superior. The raters were instructed to use their 
total experience with Volunteers in setting a frame 
of reference for “Average,” “Poor,” etc. A standard 
of at least two raters per S was set. Fifty-one Ss 
(23 male, 28 female) qualified for this criterion 
Measure. The ratings were made after 12 to 16 
months’ service by S. Nine raters participated and 
contributed a total of 134 ratings. There was an 
average of 2.9 raters per male S and 2.4 raters per 
female S, Reliability estimates for the composite 
ratings determined by Ebel’s (1951) technique are 
a (males), 50 (females), and .62 (total). The 
reliability estimates were computed for the case in 
Which the source of the ratings was not the same for 
all Ss, and were functions of the number of raters 
Der S and the degree of interrater agreement. The 
amount of contact between rater and S varied con- 
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siderably. No S was rated, however, unless the rater 
felt he had an adequate basis for determining the 
criterion as specified. 

The composite field rating on the 51 Ss is referred 
to hereafter as the “main” criterion. Two alternate 
criteria were available for some of the 51 Ss and in 
one instance for two additional Volunteers from the 
training project. 

Researchers’ rating. Two staff members of the 
Peace Corps research division briefly visited 30 
Volunteers and their supervisors. Two of the Volun- 
teers were not among the 51 on whom composite 
field ratings were obtained. Fifteen of the 30 were 
males, 15 females. The Volunteers had served 10 
months at the time of the visit. The researchers 
made a single performance rating of each S and 
later made these available to the author. 

Inspector’s rating. The second alternate criterion 
was furnished by an inspector charged with evalu- 
ating training and field programs, who visited 21 
of the Ss after approximately 10 months’ service. 
Subsequent to furnishing these ratings, the inspector 
became deputy director in Peru and contributed to 
the composite field ratings. 

The alternate criteria were based on fewer Ss and 
briefer contact than the main criterion. Their reli- 
ability cannot be estimated directly. The researchers’ 
rating correlates 42 with the composite criterion 
(N =28) and the inspector’s rating correlates .66 
(N=20). The alternate criteria intercorrelate .64 
(N = 19). 

In addition to the 53 Ss on whom performance 
ratings were available, two additional Ss from the 
original 61 were analyzed solely with regard to early 
termination (described later). 


Analysis 


Means, ranges, standard deviations, and Pearson 
intercorrelations of the variables were computed. 
N is maximized in each analysis; thus intercorrela- 
tions among the predictor variables uses N = 53; 
correlations between these variables and the com- 
posite field rating criterion uses N =51, etc. A tri- 
serial correlation was computed between the final 
selection board rating and the main criterion. A 
factor analysis was done on the 20 predictor vari- 
ables. The solution was reached by principal axes 
with unity in the diagonals and a Varimax rotation. 
Kaiser’s (quoted in Cooley, 1962, p. 160) criterion 
for the number of factors (no latent roots less than 
unity) was applied and resulted in extraction of six 
factors. 

Canonical (multiple) correlations between selected 
predictors and the main criterion were computed. 
Selection of predictors was done with several strate- 
gies: (a) all predictor variables; (b) best represen- 
tation of each class of data (pretraining data, test 
scores, initial FAO impressions, grades, peer ratings, 
final FAO impressions); (c) factor representation 
(most valid representative of each factor); (d) 
economy (use of measures available without psy- 
chologists’ effort), and (e) simplicity (pretraining 
assessment and peer leadership rating). 
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Possible Influence of the Criteria 
by the Predictors 


At the time of the study, detailed assessment data 
from the training site were not furnished to admin- 
istrators overseas. The selection board rating and a 
general, favorably-toned résumé of the Volunteers’ 
skills and characteristics were forwarded, and the 
overseas representative to the selection board had an 
opportunity to confer with some of the administra- 
tors at the time the Volunteers were assigned. Com- 
munication difficulties within the host country and 
inefficiencies in forwarding data sometimes meant no 
training data reached the administrators. 

The sparseness of available training data, the 
length of time between the arrival of the Volunteers 
in Peru and the point when the criteria were ob- 
tained, and the well-known pragmatism of Peace 
Corps administrators suggest that the field ratings 
were based on the Volunteers’ actual performance 
and influenced little if at all by information produced 
at the training site. A notable possible exception is 
the field data furnished by the representative (G.B.) 
who served on the selection board and who later 
contributed 19 (14%) of the 134 main criterion 
ratings. Two analyses were done to evaluate the 
plausibility of the assumption that the predictor 
data had little influence on the criteria and to 
investigate the exception noted. 

Survey of the raters. Seven of the eight main cri- 
terion raters other than G.B. were queried about 
their access to training data and their impression 
of the influence of such data on their field ratings.5 
Three of the seven, who furnished 18 (13%) of the 
ratings, stated they had no access to training data. 
A rater who made 12 (9%) of the ratings stated, 
“the feedback from training was too slow. We had 
very little data until the Volunteers were assigned. 
Unfortunately, we had to rely too heavily on the 
biographical sketch.” 6 A representative who made 4 
(3%) of the ratings said, “I do not recall what 
[training data] I had . . . prior to my last assess- 
ment, but I doubt that it influenced that assessment.” 
A representative who furnished 14 ratings (10%) 
stated, 


Tf there was [influence] it wouldn’t be very much. 
Usually 16 months is far enough—you pretty much 
go on what the Volunteer has done. I would say 
bias was minimal. You learn first impressions aren’t 
worth much. “Gold Stars” would come in, and 
they would not turn out—and vice versa. 


The deputy director, who furnished 45 ratings 
(34%), and who had earlier done the inspector 
ratings, responded as follows, 


I was influenced by nothing, to the best of my 
knowledge, other than the Volunteers’ performance 


5 This information was obtained after the ratings 
were received, One rater proved impossible to reach; 
he had contributed 22 (16%) of the main criterion 
ratings. 

C A short paragraph, neutral or favorable in tone, 
composed entirely from information furnished by the 
Volunteer. 


CHARLES DICKEN 


in Peru, in filling out the ratings you requested. 
The only data .. . that figured in my judgment 
was my own observation and [the] immediate field 
supervisors’ evaluations. I presumably had seen 
some training data on them, but I can assure you 
it had no influence on my ratings. 


Comparative predictability of criterion ratings 
made by “most influenced” rater. Since rater GB, 
had served on the selection board before assuming 
his administrative post in Peru, his field ratings 
would presumably be most open to contamination 
by the training data. Not only did he have complete 
access to all the predictors, numerical and qualitative, 
but he was also personally involved in the assess- 
ment process, Criterion contamination associated with 
this rater would, if present, be expected to result in 
greater predictability of his field ratings than of the 
overall composite field ratings, since the latter were 
based mainly on judgments by raters exposed to 
considerably less training data and who had no 
personal involvement in the assessment and selection 
of the Volunteers. To test this, G.B.’s field ratings 
were separately correlated with the final selection 
board rating and with the three best individual pre- 
dictors. The outcome of this comparison appears in 
the Results and Discussion section. 

Since many of the raters did have access to at 
least some information from the training site, despite 
the time lapse, the raters’ denial of influence, and 
the evidence from the comparison cited above, the 
possibility of some criterion contamination cannot 
be entirely ruled out. The findings of the study are 
subject to this limitation. Since no raters except 
G.B. had access to the detailed numerical assessments 
that are used as predictors, it seems probable that 
contamination, if present, has relatively little influ- 
ence on comparison of the relative efficiency of the 
various predictors, even though overall predictive 
validity might be inflated to some degree. 


RESULTS AND DISCUSSION 


Table 1 shows scale properties” of the 
major variables and the ranges, means, and 
standard deviations for all Ss and for males 
and females separately. Range is computed 
by (upper limit-lower limit); thus the maxi- 
mum possible range for a 7-step scale is six 
units. T scores were based on Peace Corps 
applicant norms except for Es, where Barron’s 
norms were used. 

The variabilities are in most instances sub- 
stantial. The ranges are often at or near the 
maximum possible. Thus restriction of range 
1s in most instances not a factor restricting 
correlation between variables. Males vaty 
more on the criteria than females and tend 
to do so on the predictors. There is no con- 

T Some of the original scales (e.g., the pretraining 


assessment ratings) were expanded to accommodate 
qualified” ratings (“3+4,” “3/4,” etc.), 
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TABLE 1 
PROPERTIES OF VARIABLES 
= ——————— == 
Total Males Females 
Steps (N = 53) (N = 24) (N = 29) 
Variables Scale in 
scale 
Range | M | SD |Range| M | SD |Range| M | SD 
Criteria® 
Composite field rating rating 7 54 | Aa) Sia 424s | 4s | aca as 
Researchers’ rating rating 5 4 28| 11 4 3.2] 10} 3 231.0 
Inspector’s rating rating 9 8 49| 24| 8 5.4] 23| 6 4.2] 2.3 
Predictors 
Pretraining assessment rating 6 4 39] 9] 4 3.9) 10) 4 3.912 9 
General Ability Test T score a 31 57.7| 7.1] 30 59.6 | 6.4] 30 S618 7.2 
Modern Language 
Aptitude Test T score ea E 57.1 | 7.4| 27 55.4] 8.0 | 24 58.5 | 6.4 
Crutchfield Figures raw score 1 | 20 11.2} 5.2) 20 13.2 | 5.4] 18 9.6} 4.5 
MMPI evaluation rating 5 4 3.6] 13) 4 3.4) 13| 4 3.8] 1.3 
MMPI Es T score fran 41 65.7 | 7.3} 31 64.7 | 6.9] 39 64.0 | 7.6 
Interview rating 8 7 sf LIATE 5.6} 16] 7 5.0} 1.9 
Group discussion rating rating 7 6 48] 17| 6 4.9} 16| 6 4.6) 18 
Community development 
grade rating 8j 7 60) 18| 7 58| 21| 5 61} 1.5 
Language rating rating 8 7 3.34 LD oF <i is UL 7 3.2] 1.8 
Physical education grade rating 6 5 41] 11 5 43) 11| 4 4.0} 11 
Domestic skills grade rating 5 2 45} 6] 2 45] 6] 2 44) 6 
Peer assignment preference | raw score} 64 | 23 9.4} 48| 20 11.0] 49] 18 8.1) 4.4 
Peer leadership nomination | raw score} 64 | 47 10.1 | 11.9 | 47 12.6 | 12.9 | 42 8.0 | 10.4 
Peer overall nomination raw score} 64 | 42 10.3 | 10.1 | 42 12.8 | 11.2 | 38 8.2| 8.5 
Peer negative nomination raw score | 321 29 42| 56| 14 3.6 | 39| 29 45) 6.6 
FAO rating “board status” rating 6 5 457 15) S5 47| 16| 4 44| 13 
FAO rating “development” | rating 8 6 5.4] 1.7 6 54] 15! 6 5.3} 18 
FAO rating “overall 
Potential” rating 8 6 5.6 | 1.8 5 59| 18] 6 5.4] 18 
Final selection board rating 3 2 48| .7| 2 SBN 7 2, 4.7 8 


* Ns for criteria (total, Male, Female) : Composite (51, 23, 28), Researcher (30, 15, 15), Inspector (21, 13, 8). 


sistent sex trend in mean values except in peer 
ratings where the males excel. 
1 Table 2 shows the intercorrelations of the 
predictors. The .05 and .01 significance levels 
for correlations are .27 and .35 (df = 51). 
A factor analysis of the predictors is presented 
later. Correlations and factor loadings are 
shown in the tables with decimals omitted. 


Predictive Validities 


Table 3 shows predictive validities of the 

_ Ssessments (7 e) and validities corrected for 
unreliability of the criterion (rpex). The co- 
efficients are shown for males and females 
Separately in the case of the main criterion. 
The most effective predictors of the main 
Ctiterion were the pretraining assessment, 
Community development grade, and peer 

, leadership rating. Each equals or exceeds the 


validity of the final selection board which 
used all data. Test scores and psychologists’ 
impressions have relatively low validity. 

The uncorrected correlations of the selec- 
tion board rating and the three best individual 
predictors with the criterion ratings by repre- 
sentative G.B. (N = 19) were final selection 
board, .09; pretraining assessment, .32; com- 
munity development grade, 35; and peer 
leadership nomination, .36. Comparison of 
these correlations with those in the first col- 
umn of Table 3 indicates that in each instance 
the outcome is opposite to that predicted by 
the contamination hypothesis. Failure of the 
final selection board ratings to predict G.B.’s 
field ratings indicates his impressions of the 
Volunteers’ field performance deviated quite 
distinctly from the appraisal to which he had 
been both privy and a contributor 16 months 
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TABLE 2 
INTERCORRELATIONS AMONG PREDICTORS 


1f2tsf4 [os] 62 a | 4 [is | a6 | az | se | ae | 2 
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a BASRA iaiahahasehhc bat aa 
v 
9 88° Jas 
a “a v as FER 
Ed g 3.88 
be w5 9 pee825 
EP TEE TEEEEEES 
B35 8 Eo aeR ESE Dog 
ME E £8 cs ,SaSseebes 
S\eeS BS 85 Phos akesss 
o/ 425 eS gu s8seSeoses 
agag SS 35 REEE EA 
D | w5 peu S23 Deg has EE 
SJE goria, gogo cess 
EERTE ELET 
EEEE TITE 
OS O22509 SeAccecess 
seis HUGE SHdddvicnas 
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Note.—Total sample N = 53. 


earlier. The data appear to justify the inclu- 
sion of G.B.’s ratings in the criterion and 
lend weight to the testimony of the raters 
and other considerations suggesting little 
criterion contamination. 

Except for the MMPI evaluation, females 
tend to be more predictable than males (con- 
ventional wisdom notwithstanding! ). This is 
true both of the level of the coefficients and 
in the larger number of assessments which 
show some validity. When the lower cri- 
terion reliability of the female sample is- 
considered, the effect is heightened. There 


(Seashore, 1962). 

Table 4 shows the multiple correlations ob- 
tained by various strategies of combining 
predictors. Uncorrected correlations were 
used in calculating the multiple correlation 
(R). Combinations of easily obtained data 
predict better than the selection board and 
much better than the assessment officers’ im- 
pressionistic combination of data. Multiple 
correlations capitalize on sampling fluctua- 
tions in the correlations, and there is no way 
to estimate the shrinkage which would occur 
on cross-validation. This limitation is espe- 
cially pertinent to the 16-predictor coefficient. 


Clinical and Actuarial Prediction 


From the pragmatic viewpoint, the most 
significant finding is that the pretraining 
assessment is virtually as valid as anything 
else, and that a combination of it with a 
single, easily obtained, in-training assessment 
(peer leadership) bests the validity of the 
selection board. Peer ratings and the prê- 
training assessment, while “clinical” in char- 
acter, are routine and relatively inexpensive. 
The generally poor showing of impression- 
istic modes of combining data is certainly. 
nothing new (Meehl, 1954; Sawyer, 1966). 

The alternate criteria are somewhat better 
predicted by the psychologists’ impressions 
and the selection board than is the main cri- 
terion. While the small W’s preclude any but 
tentative generalizations, there is the possible 
implication that a certain “superficiality 
of the quickly obtained alternate criteria 
is shared by the psychologists’ predictiv 
impressions, 
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TABLE 3 
CORRELATIONS BETWEEN PREDICTORS AND PERFORMANCE CRITERIA 
r Research- |. Inspec- 
Composite field rating ers’ tor’s 
rating rating 
Predictors 
Total Males Females 15M, 15F | 13M, 8F 
(N = 51) (N = 23) (N = 28) (N = 30) |(W = 21) 
Ypo Tpos Tpe Trew fpo T pew Ypo Ype 
Pretraining assessment 4i* aie 37 44* 44* 62** 21 38 
Test data 
General Ability Test 17 22 —07 | —08 37 52 32 —02 
Modern Language Aptitude Test| 11 14 04 05 23 32 —13 —23 
Crutchfield Figures 31* 39** 19 23 44* 62** 05 09 
MMPI evaluation 14 18 44* 5%" =t —16 00 27 
MMPI Es 32* 41** 24 29 48! 68** 17 24 
Initial FAO impressions 
Interview 23 29* 09 11 34* 48** 30 28 
Group discussion rating 23 29* 22 26 24 34 20 38 
Training grades 
}. Community development 43** Dogi 41* 49* 49** 69** 11 27 
f Language rating 17 22 —03 | —04 32 45 18 16 
Lf Physical education 32% 4i* 12 14 50** das. 07 24 
E Domestic skills 12 15 —13 | -15 33 47* 48** 45** 
Peer nominations i 
Assignment 22 28* 22 26 21 30 40* 30 
Leadership so | 64** ASEH MGS TEM: este 7a* 35 Sort 
Overall 40** =) ae 31 37 52** 72 44* 60** 
Negative —20 | —25 marsh = tons |) 20 | 5) 02 05 
Final FAO impressions 
“Board status” 27* 34* 06 07 47* 66** 46** 43* 
Development” 26 33* 19 23 33 47* 51 50* 
“Overall potential” 22 28* 13 15 30 42* 4gr* 51* 
Final selection board rating SY hse! 47* 24 29 47* 66** e. iial 51* 
(Triserial) 41* | 52** 27 32 bot 78** — — 
*P, <\ 
"P, S01: 


Prediction of Least Successful Performance 


The function of the assessment program 
and of the psychologists’ and selection board’s 
judgments was not primarily to predict degree 
of success among suitable prospects but to 
Separate the unsuitable. The “band” to which 
Me assessments were oriented was “barely not 
Suitable” to “barely suitable.” Since indi- 
Viduals found unsuitable were not measured 
as to overseas performance, there is no direct 
Measure of the accuracy of this discrimination. 

An indirect test of the effectiveness of the 
assessments in this band is shown in Table 5. 
Means and standard deviations of selected 
predictors and significance tests are shown for 
12 “least successful” Ss versus the remainder 
(“more successful”). The least successful con- 


sist of those five males and five female Ss 
receiving composite (main) criterion ratings 
in the range 1.0-2.99 (“poor” to the upper 
limit of “below average”) plus two additional 
early termination Ss. Six of the original 61 
Volunteers terminated service prematurely. 
The official “reason” for termination and 
anecdotal data from field raters were available 
for all six. Three of the six terminators met 
the standard for the main criterion and are 
in the study sample. All of these received 
composite criterion ratings less than 2.99, and 
so fall in the “least successful” group. While 
confidentiality forbids going into detail, it 
seemed clear to the writer that two additional 
Volunteers (one male, one female) terminated 
because of, or concomitant with, unsatisfac- 
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TABLE 4 
MULTIPLE CORRELATION oF PREDICTORS AND MAIN CRITERION 


CHARLES DICKEN 


= 


Strategy Predictors Be E R fs 
|——_ 
All predictors All shown in Table 2 except final FAO impressions and final i 
selection board 16 5208 
Best representation of | Pretraining assessment; Zs; interview; community development ag 
classes of data grade; peer leadership rating ki 62 EA 
Factor representation | Peer leadership rating; Crutchfield Figures; FAO board status; & 
PE grade; Es, pretraining assessment 6 ‘Ole 
Economy: Pretraining assessment; community development grade; language v 
grade; PE grade; peer leadership rating | 5 
Simplicity Pretraining assessment; peer leadership rating 2 
Note.—N = 51. 
tory performance. These two are included in formly the case, and only 5 of 18 comparisons 


the “least successful” group. Five of the six 
early terminators are thus included in the least 
successful group. The other early terminator 
appears to have performed satisfactorily until 
leaving. 

Were the assessments especially effective in 
the band at which they were aimed, it would 
be reasonable to expect relatively large dif- 
ferences in the means of the two groups rela- 


reach more than borderline significance,® The 


8 Because of the small N, the possibility of Type 1 
errors must be considered, especially in the sex- 
separated analyses. On the other hand, the variance 
homogeneity assumption for £ is probably nol 
justified in the case of the peer and selection board 
data. Despite the large differences, especially in the 
peer nominations, nonparametric analyses for the 
total sample (Mann-Whitney) and for the females 
(Kruskal-Wallis) were not significant for eithe 


i 


tive to variabilities. This is by no means uni- variable. 
TABLE 5 
Least SUCCESSFUL VOLUNTEERS Versus More SUCCESSFUL VOLUNTEERS: SELECTED PREDICTORS 
Total Males Females 
Predictors i 
Least More P, Least More P Least More 
(N = 12) |(W = 43) N = 6) (N = 19) | “| (WW = 6) |N = 24) 
Pretraining assessment am 
M 3.58 4,00 | ns 3.67 4.00 | ns 3.50 4.00 
SD 76 94 US 1.03 76 87 
MMPI evaluation 
M 3.42 3.72 | ns 2,33 3.79 |.05 4.50 3.67 
SD 1.44 1.28 1.11 1.10 -76 1.40 
Interview 
M 5.00 5.23 | ns 5.33 5.58 | ns 4.67 4.96 
SD 1.83 1.78 1.37 1.60 2:13 1.86 
Community development grade 
M 5.17 6.19 | 10 5.33 6.95 10 5.00 6.38 
SD 1.62 1.79 170 | 216 1.53 1.41 
Peer leadership nomination 
M 5.42 11.09 |.05 8.17 13.68 | ns 2.67 
' . i 9.04 
SD 5.69 | 12.63 549 | 14.01 4.42 | 11.00 
Final selection board 
M 4.42 4.81 05 4.67 484 |ns 
: al z 4.17 4,79 
SD 49 79 47 74 3F 82 


* Direction contrary to hypothesis, 
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TABLE 6 
Factor MATRIX FOR PREDICTORS 
Factor 
Variable 

Je I Ts IV vV VI hè 
Pretraining assessment 28 31 05 03 11 52 46 
General Ability Test 04 82 26 —10 —19 16 82 
Modern Language Aptitude Test —07 59 09 08 —01 57 69 
Crutchfield Figures 06 84 —10 14 17 —06 78 
MMPI evaluation 06 00 —02 —01 89 09 81 
MMPI Es —09 12 37 —08 33 59 57 
Interview 20 —02 61 —12 06 38 57 
Group discussion rating —03 —24 55 24 06 44 61 
Community development grade 44 -17 13 15 —30 61 72 
Language rating 71 —02 23 11 —11 —03 59 
Physical education grade 33 05 11 74 04 08 67 
Domestic skills grade 19 —07 43 49 10 =13 50 
Peer assignment preference 74 00 23 —09 07 06 62 
Peer leadership nomination 74 15 30 22 09 28 80 
Peer overall nomination 84 06 31 12 08 18 87 
Peer negative nomination —39 13 —12 61 —20 12 61 
FAO rating “board status” 32 07 79 21 —11 18 83 
FAO rating “development” 31 12 85 —01 03 03 83 
FAO rating “overall potential” 44 15 80 07 “il 06 88 
Final selection board 56 04 71 04 05 14 85 


Note.—Total sample N = 53. 
* Signs reflected. j 


findings of considerable overlap in mean 
predictor scores do not change when the 
six cases with main criterion scores of 2.00 
or less are compared with the remainder of 
the Ss or when the five terminated-unsatisfac- 
tory cases are compared with the remainder. 
Inspection does, however, indicate that the 
Overlap is partly a result of two or three 
cases in the least successful group scoring 
quite high on predictors while the others in 
the group score low. 

Inspection of the negative aspects of the 
assessment data on the least successful Ss 
yields the following impressions. Males: One 
of the six was assessed as essentially sound 
across the board and given a “5” rating by 
the selection board. A “clean miss” must be 
entered here. Of the other five, three had 
MMPI profiles rated “unfavorable” (Codes 
#9; 132'; and 83' with sharply elevated L 
and K) and one’s profile was rated doubtful 
(59'). Two of the males showed prominent 
Signs of anxiety in the interview plus psycho- 
Somatic illness in the life history. Two others 
Were seen as strong, active individuals but 
With evidence of opinionation and excessive 


individuality. All four last-cited males were 
seen by the selection board as having com- 
pensating strengths. The last of the five was 
perceived as immature and possibly lacking 
in staying power in pretraining assessment, 
community development training, and in the 
program generally. He was “controversial” at 
the midtraining selection board. 

Females: One of the six was assessed as 
essentially sound. Three of the other five were 
seen during training as rather clearly lacking 
in experience and maturity, as naive, and as 
having questionable staying power. Likeability 
and inoffensiveness seems to have resulted in 
favorable selection decisions in these cases. 
The other two females were seen as stable 
and mature but as somewhat slow and passive 
during training. 

In retrospect, there seems to have been a 
relatively clear basis for deselecting the one 
“poorest” male and the three “immature” 
females. The remaining eight least successful 
cases do not seem to have been clearly dis- 
tinguishable from many other “selected” Ss 
because of what seemed, at the time, to be 
substantial compensating assets. 
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TABLE 7 


CORRELATIONS BETWEEN GouGH SCALES 
AND MAIN CRITERION 


No.of} Total | Males | Females 
mele Items | (V = 50) | (N = 23) | (N = 27) 

‘Academic 

achievement (4c)| 18 | —03 00 —05 
Delinquency (De) 12 —19 -32 —08 
Dominance (Do) 28 27* 13 41* 
Femininity (Fe) 16 | —06 -09 07 
Honor point 

ratio (Hr) 16 23 11 34 
Intellectual 

efficienty (Ze) 39 04 12 —01 
Impulsivity (Im) 21 15 -04 —30 
Psychological 

interests (Py) 6 25 11 44* 
Social respon- 

sibility (Re) 32 19 —02 45* 
Social participa- 

tion (Sp) 25 21 —02 46* 
Social presence (Sr)| 21 30* 11 44* 
Social status (Si) | 34 12 -05 22 
Tolerance (70) 30 31* 12 45* 

* P, <05. 


Factor Analysis of Predictors 


The results of factor analysis of the pre- 
dictors are shown in Table 6. The frst factor 
obtained reflects overall merit, and loads most 
on peer ratings. Factor II is “tested mental 
ability.” Factor III is equally clearly an 
“assessment officer” factor. With validity data 
in mind, the high loading of the selection 
board on this factor may document an undue 
influence of the FAOs on the board. Interpre- 
tation of the fourth factor is uncertain. 
Factor V is an MMPI factor. The last factor 
seems to reflect personal strength as assessed 
by life history, social workers (community 
development), psychologists, and psychologi- 
cal tests, but not by peers. 


Validity of the Gough Scales 


Table 7 shows the predictive validities of 
the Gough scales. Only Dominance, Social 
Presence, and Tolerance are significant in the 
total sample. None of the scales, with the 
possible exception of Delinquency, shows 
promise in assessing males. Several measures 
of social attitudes and behaviors show 
substantial validity for the females. 


Comparison with Mischel’s (1965) Findings 


The self-report personality measure (Es) 
common to both studies yields similar validity 


2 One female S was inadvertently omitted from the 
analysis. It should be pointed out that the Gough 
scales used here are not the longer versions con- 
‘taining specially prepared items which Gough later 
developed to measure the same traits in his California 
Psychological Inventory. 
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in each even though the measure in Mischel’s 
study had the possibility advantage of being 
taken under “nonselection” (research) condi- 
tions. The present findings support Mischel’s 
optimism over self-report measures only in the - 
case of females. (Mischel’s Ss were two-thirds — 
male.) The interview yields similarly poor va- 
lidity in both studies. Academic grades (at 
least the “better” ones), peer ratings (ditto), 
and selection board ratings yielded consider- 
ably higher validities in the present study, 

Higher validity in the present study may 
result from several factors: (@) possible con- 
tamination of the criterion in the present 
study; (b) better criterion measures in the 
present study—Mischel’s criterion did not 
permit a reliability estimate and was admit- 
tedly crude; (c) Mischel’s predictive data 
used crude faculty and peer ratings, in that a 
composite was used in each case that may 
have pooled the effective with the ineffec- 
tive; (d) “hurried” conditions under which 
Mischel’s predictors were obtained; and (e) 
preponderance of males in his sample. 

Factors which might have favored higher 
validity in the Mischel study are (a) the 
considerably more intangible nature of the 
criterion performance in the present study 
and (b) less restriction of range of talent by 
deselection and other attrition during training 
in the Mischel study (only 4 of 41). 
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COMPARISON OF THE BINET AND WPPSI WITH 
LOWER-CLASS FIVE-YEAR-OLDS 


JOEN FAGAN,? ELIZABETH BROUGHTON, MILDRED ALLEN, BETTY CLARK 
AnD PATRICIA EMERSON 
Georgia State College 


The Stanford-Binet and WPPSI were administered to thirty-two 5-year-old 
lower-class children, 16 of whom were Negro and 16 white. A correlation 
coefficient of .80 was found between the Binet and WPPSI. Contrary to previ- 
ous research, significant differences were found in the IQ scores obtained from 
the measures, with the Binet mean of 95.2 being eight points higher than the 
WPPSI. No significant sex or race differences were found in IQs, A decided 
preference for the Binet was reported by the examiners on the basis of ease 
of administration, greater scoring clarity, and subjective assessment of accuracy. 


Increased emphasis on early education, 
especially the attempts to reach culturally 
disadvantaged and Negro children 1 or more 
years prior to the beginning of public school, 
has increased the need for adequate intel- 
lectual assessment instruments. The Stanford- 
Binet has been criticized for lack of Negro Ss 
in its standardization sample and its “middle- 
class” orientation which may penalize lower- 
class Ss. An extensive study aimed at pro- 
viding norms for Stanford-Binet (Form L-M) 
performance by Negro school children in the 
Southeast states yielded a mean IQ of 80.6 
(Kennedy, Van deRiet, & White, 1961). The 
publication of the Wechsler Preschool 
and Primary Scale of Intelligence (WPPSI) 
in 1966 made available another instrument 
which ostensibly corrects for the above faults. 
As yet there are few published studies which 
Compare the WPPSI with other individually 
administered intelligence tests. The only pub- 
lished reliability and validity studies are 
found in the WPPSI manual (Wechsler, 
1963), The study reported in the manual 
which correlated the WPPSI with the 
Stanford-Binet was administered to 98 chil- 
dren in a single elementary school in Cali- 
fornia. The mean Full Scale IQ obtained 
ftom the WPPSI was 89.6, and the Binet 
mean IQ was 91.3 (Wechsler, 1963, p. 33). 

he present study was undertaken to com- 


* Requests for reprints should be sent to Joen 
agan, Department of Psychology, Georgia State 


seuss, 33 Gilmer Street, SE, Atlanta, Georgia 


pare the WPPSI with the Binet with a group 
of lower-class Negro and white children. In 
addition to a statistical analysis, the Binet 
and WPPSI were compared on a subjective 
level for ease of administration, scoring, and 
S interest. 


METHOD 


The Ss were 32 children, between the ages of 
4 years, 11 months, and 5 years, 11 months, attend- 
ing day-care centers operated for children of working 
mothers living in low-income housing projects. Of 
these, 16 were Negro and 16 were white, with 8 boys 
and 8 girls in each of the two racial subgroups. The 
age range selected was such that the children would 
enter the first grade the following year. The day- 
care centers included two that were primarily white 
and four primarily Negro. 

The tests were administered by four graduate 
students in psychology who had had prior experience 
with tests used in this study. Each child was given 
the two tests in two sessions from 1 to 5 days apart. 
The order of administration of the WPPSI and the 
Binet was counterbalanced. Standard procedure was 
observed in the administration of the tests with one 
exception, The Geometric Design subtest was found 
to be so time-consuming as to extend the administra- 
tion of the WPPSI to two sessions, and the scoring 
was very difficult and subjective. Hence, the subscale 
was omitted and resulting performance IQs were 
prorated. 


RESULTS 


The means, standard deviations, and dif- 
ferences between means for the WPPSI and 
Binet are shown in Table 1. The highest mean 
IQ (95.2) was on the Binet. The Verbal, Per- 
formance, and Full Scale IQs of the WPPSI 
were almost identical, being between 87.1 and 
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TABLE 1 
Mean IQs on THE BINET AND THE WPPSI AND #’s BETWEEN MEANS 
| | t values 
T N 4M SD 5 
es Y a aR WPPSI WPPSI WPPSI 
|. Verbal Performance | Full Seale l ' 
Binet 32 95.2 13.1 | 5.26* 3554 5.57* 
WPPSI Verbal 32 88.0 11.6 34 1.06 
WPPSI Performance 32 88.5 13/3 | 1.71 
WPPSI Full Scale 32 87.1 | 12.8 


s£ value on difference between means. 
*p <01 


88.5. The Binet score was significantly higher 
($ < .01) than all WPPSI scores. 

Table 2 shows the correlation coefficients 
between the Binet and WPPSI. All correla- 
tions for the group as a whole and also for 
the Negro and white subsamples are signifi- 
cant at the .01 level, with one exception 
with is significant at .05. 

A comparison of the means and standard 
deviations of the Negro and white children 
is shown in Table 3. The differences between 
means are negligible on the WPPSI. The 
Binet does show a 6-point difference in 
favor of the white children, but this does not 
reach significance (t = 1.26). The mean and 
standard deviation for the Binet scores of 
the white children approach those expected 


TABLE 2 
CORRELATION BETWEEN THE Binet AnD WPPSI 


ae 


WPPSI 
WPPSI 
Test Race Wann ee Full 
erba! orm- Si cali le 
ance 

Binet Negro 94 56* 82 
White} .79 74 .80 

Total 81 67 80 
WPPSI Verbal | Negro .67 91 
White 80 94 
Total 73 93 

WPPSI 

Performance | Negro 91 
White 95 
Total 93 


Note.—The correlations, except where noted, are significant 
begona ne «01 level, 


for the general population. The standard 
deviation for the Negro sample on the Binet — 
is consistently smaller than for white Ss, 
No significant differences were found be- | 
tween male and female Ss on the Binet or 
WPPSI Verbal, Performance, or Full Scale 
TQs. In fact, each of these scores showed less 
than a 1-point difference. y 


Discussion 


The Binet, in spite of the criticism found 
in the literature, resulted in IQ scores in the | 
average range for the lower-class children in 
this study. The WPPSI scores averaged 
in the dull normal range. In contrast to 
Wechsler’s results, which showed no signifi- 
cant differences between the Binet and the 
WPPSI for a group of children with a mean 
IQ of 90, the 8-point difference found in the — 
Present study was significant. It would ap- | 
pear, therefore, contrary to expectations, that 


TABLE 3 


Mean Scores or Necro AnD Wurte CHILDREN | 
ON THE BINET AND WPPSI 


Test Race M SD 3 

A A EO SST RN 

Stanford-Binet IQ Negro | 92.3 98 
White | 981 | 154 

WPPSI Verbal IQ Negro | 883 | 10.5 
White | 87.6 | 13.0 

WPPSI Performance 1Q| Negro | 86.6 | 127 
White | 905 | 144 

WPPSI Full Scale IQ Negro | 86,2 | 14.7 
White | 879 | 142 
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lower-class children earn higher IQ scores with 
the Binet than the WPPSI, 

The correlations between the Binet and the 
WPPSI approximate those found with the 
Binet and the WISC (Schachter & Apgar, 
1958), with Verbal higher than Performance. 
The lack of any IQ differences between races 
was a finding of some import, probably due 
to the effort to control socioeconomic factors. 
The lower standard deviations of the Negro 
sample are consistent with the findings of 
Kennedy et al. (1961) on the Binet (SD = 
12.4). The homogeneity of scores probably 
reflects the low education level of the Negro 
parents which contributes to a lessening of 
scores in the upper range. 

The lack of significant differences on the 
intelligence tests between male and female 
Ss was expected. 

Subjective comparison of the WPPSI and 
the Binet was also part of this study. Con- 
ceptually, the examiners recognized the ad- 
vantages of the Verbal and Performance 
division of the WPPSI. However, with respect 
to both administration and scoring, the 
examiners strongly preferred the Binet. The 
lower-class child is frequently shy, nonverbal, 
activity-oriented, and sensitive to failure. The 
frequent switching of materials, the briefer 
verbal demands, and the less apparent failure 
experiences made the Binet more appropriate. 
WPPSI disadvantages included the not infre- 
quent changes in instructions and materials 
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in midtest, and the length of the testing 
session, which, even with the Geometric De- 
sign omitted, usually took more than 1 hour 
and made finishing difficult for many of the 
children, The examiners found several of the 
questions on Comprehension objectionable 
because of ambiguousness or emotional load- 
ings. Having to ask for additional reasons on 
several of the questions made some children 
uncomfortable or resistive. Finally, the exam- 
iners found scoring of the WPPSI Geometric 
Design and Vocabulary difficult, and noted 
lack of scoring clarity on individual items in 
Comprehension. Overall, the children’s en- 
joyment of the Binet, together with the 
administrative advantages, enabled the exam- 
iners to achieve better rapport and led to their 
willingness to accept the Binet IQ score as the 
more accurate. 
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PARTICIPATION IN PSYCHOLOGICAL RESEARCH: 
RELATION TO BIRTH ORDER AND DEMOGRAPHIC FACTORS? 
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Some previous studies have indicated that research which depends on the 
voluntary participation of Ss may be seriously overrepresented by firstborn 
persons, who are thought to differ from later borns on a number of psycho- 
logical dimensions. The results of four independent surveys reported here 
suggest that social class and religion, but not birth order, are factors in selec- 
tive participation. The findings were (a) birth order had no evident effect on 
responding to a mailed questionnaire, either among college professors or stu- 
dents; (b) neither birth order nor sex predicted willingness to participate in an 
“extensive psychological interview”; (c) upper-class Ss were more agreeable to 
an interview than were lower-class Ss ( < .05); (d) independent of Finding c, 
Jewish Ss were more agreeable to an interview than were Protestants and 


: 
DAVID WINER 
| 


Catholics (p= .05). 


Rosenthal (1965) has summarized the so- 
ciological and psychological literature which 
describes characteristics of the volunteer S. 
His article served to reemphasize the point 
that the generality of much research is limited 
to those Ss willing and able to cooperate with 
a given study’s purposes. Caution about S 
selection must be particularly observed where 
the investigator has little control over factors 
affecting participation and, additionally, those 
same factors might influence his results. 

Various investigations suggest that birth 
order may be an important variable in the 
selective participation of Ss. It is contended 
that firstborn persons are more likely than 
later borns to take part in psychological 
studies (Capra & Dittes, 1962; Varela, 1964) 
and to fulfill Æ’s requests (Wuebben, 1967). 
If such findings are reliable, then any study 
which recruits Ss on a voluntary basis may 
be biased by overrepresentation of firstborn 
persons, This is problematic because the 
firstborn person is thought to differ from the 


1The studies were partly financed by the Faculty 
Research and Study Fund at Trinity College of 
Hartford. The data were collected with the help of 
Ethel Brooks and Unis Johnson at Hartford Hos- 
pital School of Nursing, Stanley L. Cohen at George 
Washington University, Harry L. Leonhardt at the 
University of Hartford, Evans Mandes of Hood 
College, and D. A. McCool at Rhode Island College. 

2 Requests for reprints should be sent to Edward 
H. Fischer, Department of Psychology, Connecticut 
Valley Hospital, Middletown, Connecticut 06457, 
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later born on a variety of psychological 
dimensions (Sampson, 1965; Warren, 1966). 

The selective cooperation of Ss became a 
concern for the authors in a recent study of 
the birth-order statuses of college professors. 
The result was a series of four questionnaire 
studies reported here. 


Participation of College Professors in 
Survey Research 


The original study was specifically planned 
to examine a hypothesis about birth order 
representation among college professors. Ques- 
tionnaires requesting birth order and other 
information were mailed to 48 female and 48 
male professors selected randomly from two 
college catalogues. Since only 62% of the 
faculty members returned the first question- 
naire, it would have been impossible to gen- 
eralize about their birth statuses. In light of 
the above-cited research, a high percentage of 
firstborn persons among those responding to T 
the initial questionnaire might have meant 
simply that the firstborn Ss were more con- 
Scientious about completing and returning 
the forms, 

The solution to this methodological problem 
was to insure a complete sample by sending 
a follow-up questionnaire, and then telephon- 
ing the remaining group of nonresponders. 
However, an analysis of the data revealed no 
bias in returning the forms, either by birth 
order or sex. Table 1 shows there was 
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TABLE 1 


NuMBER OF PROFESSORS WHO RESPONDED BY 
QUESTIONNAIRE OR TO TELEPHONE 


Response Firstborn | Later born 
Jnitial questionnaire 33 26 
Follow-up questionnaire 11 7 
Telephone 8 11 


Note.—x? = 1.53, df = 2, p <.50. 


no relationship between birth order and 
the manner of eliciting information from S 
(p < .50). 

It is conceivable that the birth-order bias 
was absent because the Ss, that is, professors, 
differed from those where a birth-order par- 
ticipation effect has been found previously, 
that is, among students. A second study was 
begun, therefore, to expressly test the propo- 
sition that firstborn college Ss are more re- 
sponsive to a mailed survey than are later- 
born college Ss. 


Participation of Students in Survey Research 


Ninety-six upperclassmen from a state 
college were sent a two-page questionnaire; 
ostensibly this was a follow-up to an earlier 
study in which birth-order and attitudinal 
data had been gathered. The birth-order vari- 
able had predicted interest in a college teach- 
ing career for these Ss (Fisher, Cohen, & de 
Milan, 1968). 

Half each of the 56 female and 40 male 
students were firstborn, half were later born. 
The instructions urged S to complete and 
promptly return the form. 

The hypothesis was that more firstborns 
than later borns would return the forms. In 
addition, one item was put in the question- 
naire to test the idea that firstborn persons 
tend to volunteer for events involving social 
affiliation (Dember, 1964; Suedfeld, 1964). 
This item read: “I would agree to an exten- 
sive psychological interview at a later time: 
yes, definitely ___; maybe —; no AN OE 
should be noted that this item occurred in 
the context of questions about teaching 
Careers, a vocation which Fischer, Wells, and 
Cohen (1968) maintain has special interest 
for the firstborn student, Therefore the item 
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was expected to draw more positive responses 
from firstborn than from later-born students. 

The results are summarized in Table 2. 
There was no evidence that firstborns were 
more responsive than later borns. In fact, 
more questionnaires were returned by the 
later borns, although this tendency was unre- 
liable (p < .25). 

Responses to the question about a later 
“extensive psychological interview” were also 
compared for firstborn and later-born Ss. 
While 50% of later-born Ss responded “yes, 
definitely” to the item, only 34.6% of the 
firstborn Ss did so. The difference is not sig- 
nificant (x? = 1.38, df = 1, p < .25), and the 
trend reverses the prediction. 


Soliciting Volunteers for Research 


A third survey was conducted in order to 
make the conditions for participation similar 
to the recruitment procedure used by research 
psychologists, and more similar to the condi- 
tions under which a relationship between 
birth order and volunteering has been found 
previously. 

Recruitment forms were handed out in two 
undergraduate psychology classes to a total 
of 142 male liberal arts students. The Ss 
were told, and also read, in their instructions: 


Two psychological research programs are being 
planned for the near future. We would like to 
identify persons who would be willing to take part 
in the studies. One of the studies will involve an 
extensive psychological interview dealing with per- 
sonal crises faced by college-age people; the second 
project will involve standardization of a psycho- 
motor (hand-eye coordination) task and the rela- 
tionship of performance to personality variables. 
Each would take about one hour. Please indicate 
your interest in participation. 


The students were asked to provide their 
names, campus addresses, and information on 


TABLE 2 


NUMBER OF COLLEGE STUDENTS RETURNING 
QUESTIONNAIRES BY BIRTH ORDER 


Response Firstborn | Later born 
Returned 26 32 
Did not return 22 16 


Note—x? = 1.57, df = 1,0 <.25. 
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TABLE 3 


SUBJECTS’ EXPRESSED WILLINGNESS TO BE 
INTERVIEWED By SOCIAL CLASS 


Epwarp H. FISCHER AND Davip WINER 


TABLE 4 


SUBJECTS” EXPRESSED WILLINGNESS TO BE 
INTERVIEWED BY RELIGION 


Socioeconomic class 


Response 
I,II m IV, V 
Definitely 49 38 41 


Maybe or no 30 45 52 


Note,—x? = 6.45, df = 2, p <.05, 


birth order, family size, social class,’ and 
religious background. They then checked a 
category indicating their willingness to volun- 
teer for one of the studies. 

Analyses of the birth-order and family size 
data produced no significant effects, Again, 
slightly more later-born than firstborn Ss ex- 
pressed definite interest in participation (56% 
and 47%, respectively, x? = 1.15, df=1, 
p> .25). Students from large (four or 
more children) as compared to small 
(three or fewer children) families did not 
differ in willingness to volunteer for research 
(x? = .67, ns). 

However, there were trends suggesting that 
both socioeconomic class and religion are re- 
lated to volunteering. Over 50% of the Ss 
coming from Classes I, II, and III (upper- 
and middle-class groups) expressed definite 
interest in participation, while less than 30% 
of Ss from Classes IV and V did so. A chi- 
square test of the frequencies approached sig- 
nificance (x? = 4.72, df = 2, p < 10). Also, 
Jewish Ss appeared to be somewhat more 
favorable to the idea of research participation 
than Protestants or Catholics, although the 
frequency differences were not statistically 
reliable. 

A final study, using a larger N, bore out 
these trends, 


Demographic Variables and Volunteering 
for Research 


The item concerning an “extensive psycho- 
logical interview” was presented once more 
in a study of 256 nursing school and college 
students. The context was an anonymous 

3 Social class was indexed by Hollingshead’s (1957) 


criteria applied to the occupation and education of 
‘S's father. 


Religion 
Response 
Jenin | Reames | Pa 
Definitely 27 41 47 
Maybe or no 13 49 53 


Note.—x? = 5.98, df = 2, p = .05. 


questionnaire which sought information about 
emotional problems of students. In view of 
the earlier research which suggests a corre- 
spondence of primogeniture and need for 
affiliation (Dember, 1964; Schachter, 1959), 
it was again expected that more firstborn 
than later-born Ss would be receptive to the 
interview. It might be said the item was 
loaded to attract the firstborn person. In 
addition to birth status, data were obtained 
on S’s socioeconomic class and religion. 

Chi-square analyses showed no effect of 
birth order (x? = .01, ms) or sex (x? =.97, 
df=1, p< -50) on willingness to have the 
interview. But both the social class and 
religion variables were significantly related to 
agreement to the interview. 

As can be seen in Table 3, the majority 
of Ss from the highest socioeconomic groups 
(I and If) expressed definite agreement to the 
interview, whereas this was not generally so 
for lower socioeconomic classes (IV and V). 
A chi-square test of the frequencies in Table 3 
showed significance at the .05 level. 

Table 4 shows that Jewish Ss were signifi- 
cantly more likely to agree to an interview 
than were Protestant and Roman Catholic Ss 
(= .05). A breakdown of the Jewish group 


TABLE 5 


Jewisx Susyects? WILLINGNESS TO BE 
RVIEWED BY SOCIOECONOMIC CLASS 


Socioeconomic class 


Response 
i It Iv,V 
tn er a ate Mase Beso 
Definitely 10 11 6 
Maybe or no 7 3 3 
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by social class (Table 5) indicated that the 
finding with respect to religion was inde- 
pendent of the finding regarding social class. 
That is, Jewish Ss were fairly uniform and 
positive in their agreement to an interview 
regardless of their socioeconomic class. This 
was clearly not the case for Protestants and 
Catholics. 


CONCLUSIONS 


The overall results of the four independent 
surveys corroborate the conclusions of Ward 
(1964) and Wilson and Patterson (1965) that 
a birth-order bias ordinarily need not concern 
the investigator. Birth order did not appear 
to be a factor in responding to a mailed 
questionnaire, either among college professors 
or students. Further, there was no evident 
birth-order selection effect in volunteering for 
psychological research even when the recruit- 
ment form was slanted to attract firstborn 
persons (e.g., implying that the volunteer 
would engage in extensive interaction with a 
psychologist). Where slight differences did 
exist, it was the later born who showed some- 
what more tendency to participate than the 
firstborn person. It may be that a significant 
birth-order effect occurs only when Ss are 
Pressured to volunteer, such as in face-to-face 
recruitment (Capra & Dittes, 1962) or where 
they can gain points in a psychology course 
by participating in experiments (Wolf & 
Weiss, 1965). 

The present data do indicate that at least 
for certain types of study, the investigator 
who uses volunteer Ss may be drawing dis- 
Proportionately on upper-level social status 
groups and on Jewish Ss. The social class 
effect may be explainable in terms of “‘better” 
Psychological-mindedness among high social 
Status persons. The effect of religion is more 
difficult to explain, but it may be related to 
Rotter’s (1967) finding that Jewish students 
express more interpersonal trust than Protes- 
tants and Catholics. In the current academic 
Setting where “psychological research” may 
have many dubious connotations, a funda- 
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mental trust of professionals and existing 
institutions may be requisite to volunteering 
for research, 
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Each participant in 45 patient-therapist pairs completed a structured response 
questionnaire after each of a series of individual psychotherapy sessions. In 
these questionnaires they reported on the feelings experienced by each during 
the session. A method of causal analysis of nonexperimental data was applied 
to these reports. These analyses suggested that for relatively inexperienced 
therapists, the patients’ positive feelings were a sufficient condition for the 
therapists’ positive feelings. The reverse was the case for more experienced 
therapists. For negative feelings, however, therapists’ feelings of uncertainty 
and apprehension were generally a sufficient condition for the patient to experi- 
ence dysphoric affect. Extremely positive or negative affective experiences of 
either participant tended to preclude and be precluded by opposite experiences 
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in the other. 


The feelings that patients and therapists 
experience during a psychotherapeutic session 
are events of central importance to the treat- 
ment process. Exploration of affect is a prime 
task of the patient and affective cues are used 
by the therapist in understanding and relating 
to the patient. Even aside from these 
functions, of course, the psychotherapy rela- 
tionship is usually affectively alive for the 
two participants. 

In a previous paper evidence was presented 
which indicated that patient and therapist 
share the dominant mood or affective tone of 
a psychotherapeutic session, that is, there 
tends to be a commonality of affective experi- 
ence (Howard, Orlinsky, & Hill, in press; 
Lennard & Bernstein, 1960). It does not 
seem reasonable, however, that common af- 
fective experience merely arises independently 
or spontaneously for both participants in psy- 


1 An extended version of this paper was presented 
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2 Also at the Institute for Juvenile Research. 
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Howard, Institute for Juvenile Research, 232 East 
Ohio Street, Chicago, Illinois 60611, 

3 Also at the Institute for Juvenile Research. 


chotherapy. Most likely, for a particular pair 
in a particular session, either the patient or 
the therapist affectively influences the other, 
so that the feelings of one are responded to 
affectively by the other, 

The research reported in this paper is aimed 
at providing a better understanding of affec- 
tive processes in individual psychotherapy. 
More specifically, it is concerned with the 
direction of affective influence. Two distinct 
and, in effect, contradictory positions have 
been reported in the literature. Fiedler 
(1953), Truax (1963), Wilson and Hannon 
(1968), Rausch and Bordin (1957), and 
Patterson (1967) all stress the prepotence of 
the therapist’s feelings. Their research seems 
to indicate that the therapist sets the affective 
tone of the relationship. On the other hand, 
Heller, Myers, and Kline (1963), Bohn 
(1965), Snyder (1963), and Moos and 
Clemes (1967) all stress the reactivity of the 
therapist’s feelings. Their research seems to 
indicate that the patient sets the affective 
tone and the therapist reacts to the patient’s 
feelings. This paper, using an adaptation of 4 
method discussed by Blalock (1964), in- 
vestigates the evidential support for, or 
contradiction of, these two positions. 

The ideal way to carry out such an investi- 
gation would be to conduct a controlled ex- 
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periment in which one participant’s affect 
were somehow manipulated and the other’s 
affective responses were assessed. The proba- 
tive power of this experiment, perfectly per- 
formed, derives from the control over the 
time sequence of changes in the indepen- 
dent and dependent variables and the sto- 
chastic minimization of external confounding 
(Krause, 1967). Experiments are basically 
intended to support or contradict propositions 
of the form, “A is a sufficient condition 
for B,” that is, whenever A occurs B will 
occur, although B may occur in the absence 
of A. Without control of the time sequence, 
the evidence for A as a sufficient condition 
for B cannot be distinguished from the evi- 
dence for B as a necessary condition for A. 
This latter proposition means that A will only 
occur when B has occurred, although B may 
occur at other times. If the time sequence 
between A and B cannot be specified, the 
choice between these different causal proposi- 
tions must be made on other evidence, for 
example, plausibility. This is precisely what 
happens in the case of causal analyses of non- 
experimental data (Blalock, 1964). 

It should be noted, however, that several 
causal propositions are contradicted by any 
data which support (equally) the two men- 
tioned propositions. We can construe the data 
as consisting of a simple count of the joint 
occurrence of the various combinations of A, 
not-A, B, and not-B (a fourfold table). If 
we find that cells A,B, not-A,B, and not-A, 
not-B are occupied and the cell A,not-B is 
hot occupied, these results would support the 
two propositions mentioned in the previous 
paragraph (see Figure 1). Figure 1 shows 
that B occurs in the presence or absence of A, 
but that A occurs only in the presence of B. 
A is thus a sufficient but not necessary condi- 
tion of B, or B is a necessary but not suf- 
ficient condition of A. However, the following 
additional propositions are contradicted by 
these results: (a) A is a necessary and suf- 
ficient condition for B; (6) B is a necessary 
and sufficient condition for A; (c) B is a 
sufficient condition for A; and (d) A is a 
necessary condition for B. Thus, causal analy- 
sis of nonexperimental data can at least limit 
the alternative explanations for that set of 
data (Blalock, 1964; Krause, 1967). It also 


not-B 


not-A 


Fic. 1. Evidence consistent with the proposition that 
A is a sufficient condition for B. 


represents a more complete use of correla- 
tional data. 

As noted above, experiments generally are 
concerned with documenting the sufficiency of 
an independent variable, A, not-A, for pro- 
ducing a certain set of states of the depen- 
dent variable, B, not-B. The corresponding 
necessity proposition, that is, that B is neces- 
sary for A, is ruled out by the physical control 
imposed. For nonexperimental data the choice 
between such corresponding propositions of 
necessity and sufficiency is generally based 
on plausibility. Regarding the analysis of 
affective influence, the choice was based on 
the following considerations. It seems im- 
plausible that the therapist’s mood is a 
necessary condition for some affective state of 
his patient, for example, that the patient can 
never be uncommonly sad unless the therapist 
is uncommonly sad. The necessity of certain 
patient moods for therapist affective experi- 
ence seems even more implausible, for ex- 
ample, that the therapist can never feel 
uncommonly good unless the patient feels 
uncommonly good. These notions of implausi- 
bility derive both from clinical experience and 
Knowledge of the volatility and multiple 
causation of affective states. 

In the light of the preceding considerations, 
the most plausible rendering of the two posi- 
tions (hypotheses) concerning direction of 
affective influence in psychotherapy would be: 
(a) therapist feelings are a sufficient condi- 
tion for patient feelings; and (b) patient 
feelings are a sufficient condition for therapist 
feelings.* An examination of the empirical 


4Data that are consistent with Hypothesis a are 
also consistent with the proposition that (c) the 
absence of a certain therapist feeling is sufficient for 
the absence of the corresponding patient feeling. Data 
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support (based on the data of the Psycho- 
therapy Session Project) for each of these 
hypotheses is the major focus of this paper. 


METHOD 


As part of a larger study, a panel of patients and 
therapists completed structured-response question- 
naires immediately after each of a series of psycho- 
therapy sessions. The questionnaires (Therapy Ses- 
sion Report), methods of data collection, and 
samples of therapists and patients have been de- 
scribed elsewhere (Howard et al., in press; Howard, 
Orlinsky, & Hill, 1969; Orlinsky & Howard, 1966; 
Orlinsky, Howard, & Hill, 1969). 


Sample 


The sample employed for the analyses reported 
in this paper consisted of 45 patient-therapist pairs. 
The members of each pair completed Therapy Session 
Reports (TSRs) after from 5 to 58 psychotherapeutic 
sessions, creating a pool of 893 sessions for which 
there were joint reports. 

The 45 patients were female and most were single 
and employed. They ranged in age from 19 to 60, 
but the majority were under 30. Fifteen percent 
were nonwhite, 88% had at least a high school edu- 
cation (65% had attended college), and 59% had had 
some previous psychotherapy. All patients were being 
treated in once-weekly individual psychotherapy, each 
session scheduled for 45 minutes. 

Of the 15 therapists in this sample 13 were male. 
The therapists had been trained in psychiatry, clinical 
psychology, or psychiatric social work; their median 
experience level was 6 years. Over half of the thera- 
pists had had personal therapy. Their theoretical 
orientations were generally “dynamic-eclectic” in that 
no therapist represented a single school or theory. 
Most therapists reported on sessions for from 1 to 3 
patients, but two therapists reported on as many as 
15 patients. 


Instruments 


There are two parallel forms of the TSR, one for 
patients and the other for therapists. Within each 
form are two sections which survey the feelings 
experienced by the participants during the session. 
An early section asks each to report how the patient 
felt, and provides a list of 33 feelings. A later section 
asks each to report how the therapist felt and pro- 
vides a list of 25 feelings. For each feeling the re- 
spondent is instructed to circle a number (0= No0; 
1= Some; 2=A Lot) to indicate the intensity of 
its presence during the session. 

In a previous paper (Howard et al., in press), the 
results of factor analyses of these feeling lists were 


consistent with Hypothesis b are also consistent with 
(d) the absence of the corresponding therapist feel- 
ing. Both c and d seem quite implausible with regard 


to fe like happiness and sadni it 
a feelings ! ppi sadness, which shall 
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reported. Although a more highly differentiated struc- 
ture could be defined, there was considerable support 
for a two-factor structure of each list. The two 
factors closely resembled the positive affect and 
negative affect dimensions reported by Snyder (1961), 
Patients’ positive affective experience (P+) was 
essentially defined by the following feelings: grateful, 
relieved, hopeful, pleased, relaxed, confident, affec- 
tionate, and sexually attracted. Therapists’ positive 
affective experience (T+) was mainly constituted by 
optimistic, cheerful, playful, effective, pleased, sym- 
pathetic, close, and affectionate. Patients’ negative 
affective experience (P—) was essentially defined by 
angry, impatient, cautious, inhibited, withdrawn, 
frustrated, tearful, depressed, inadequate, anxious, 
guilty, strange, and embarrassed. The feelings which 
comprised therapists’ negative affective experience 
(T—) were apprehensive, perplexed, unsure, and 
demanding. 

For each session reported on by each participant, 
four affect scores were computed: patients’ positive 
and negative affect (P+ and P—) and therapists’ 
positive and negative affect (T+ and T—). These 
scores were simply the sums of the participant’s re- 
sponses to the feeling words in each composite, and 
represent the data base for statistical analyses. 

Three pairings of judgments of positive or negative 
affective process could be derived. Patient-perceived 
positive affective process was embodied in her reports 
of P+ and T+. Therapist-perceived positive affec- 
tive process could be derived from his reports of 
P+ and T+. Finally, “actual” positive affective 
process could be obtained from the patient’s reports 
of P+ and the therapist’s reports of T+. The same 
three pairings were possible for negative affect. The 
analytic model for all comparisons, however, was 
the same, 

The major purpose of the analyses was to discover 
which, if either, of the two causal propositions re- 
garding direction of affective influence was consistent 
with the empirical data. The same procedure was 
followed for each of the six possible comparisons 
outlined above. For purposes of exposition “actual” 
positive affective process will be used to illustrate 
the method of analysis. The analyses were performed 
for each patient-therapist pair, separately, over their 
common sessions, 

The basic idea was to construct a fourfold table 
consisting of the frequency of joint occurrence of 
the four conditions: P+,T+; not-P+, not-T+; 
P+, not-T+; and not-P+, T+. If both off-diagonal 
cells (not-P+, T+ and P+, not-T+) were empty, 
this would be consistent with both hypotheses. If 
only not-P+, T+ were empty, this would be con- 
sistent only with the therapist influence (TI) inter- 
pretation, since the therapist’s feeling could occur 
without the patient’s but not vice versa. Finally, if 
P+, not-T+ were the only empty cell, only the pa- 
tient influence (PI) hypothesis would be consistent 
with the data, since the therapist would only feel 
good when the patient felt good. 

The data, however, were not dichotomized in 
terms of the presence or absence of T+ or P+; the 
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scores were continuous. Consequently, some criterion 
for dichotomization was necessary in order to create 
the basic fourfold tables required by the analytic 
model. Since the purpose of this analysis was to 
differentiate between causal propositions about the 
direction of affective influence, a criterion was needed 
that would maximize the chances of distinguishing 
cases of therapist predominance from cases of patient 
predominance. To this end, the criterion of “the 
largest empty off-diagonal quadrant” was devised. 
The implementation of this criterion was as follows: 
(a) For each patient-therapist pair, T+ and P+ 
scores were cross-tabulated over their common ses- 
sions.5 (b) These scatter plots were inspected to 
determine the largest off-diagonal area, including a 
corner, which had no entries. (c) A fourfold table 
was constructed using the cutoff points indicated 
by Step b. Finally, (d) the fourfold table was in- 
spected to determine which of the causal propositions 
was consistent with the results. 

The reader should note that this method of 
dichotomizing the data is based on the assumption 
of a certain underlying model for data generation. 
(a) The data for each patient-therapist pair are 
assumed to be an unbiased sample from a longer 
finite series of their interviews, that is, trends in 
direction of influence are precluded, but (b). each 
datum point is assumed to be free of measurement 
error. Thus, we assume a population distribution in 
cluster shape rather than (as has become conven- 
tional) a population line with measurement error. 
In the method of dichotomization, two points can 
determine the frame for defining quadrants, and 
single points can determine what shall be the largest 
empty quadrant. Under Assumptions @ and b, these 
characteristics of the method are quite reasonable. 
Finally, (c) it is assumed that one of the two op- 
posite hypotheses is true: either the therapist’s 
feeling is a sufficient condition for the patient’s or 
Vice versa. These assumptions then imply that what- 
ever dichotomization yields the best discrimination 
between the two hypotheses, in c, is the best esti- 
mate of the true or latent dichotomization, though 
it is subject to sampling error. (Note that there is 
no assumption here of the same hypothesis being 
true for all patient-therapist pairs.) 


RESULTS 


The above procedures were carried out on 
the data from each of the 45 patient-therapist 
Pairs for the three comparisons involving posi- 
tive affect and the three involving negative 
affect. In this section the focus is on the 
Implications of these results for therapy 
Process in general. The next section explores 


* The authors most gratefully acknowledge the dili- 
Sent assistance of Craig Barton, Len Sushinsky, Jim 
ttner, and Howard Ullan in performing this part 

Of the data analysis, 
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the differences in direction of influence associ- 
ated with patient and therapist characteristics. 


Positive Affect 


Comparisons of the therapists’ reports of 
positive feelings with the patients’ reports of 
positive feelings did not reveal predominant 
support for either of the hypothesized causal 
relationships. Seventeen pairs were consistent 
with the therapist influence (TI) hypothesis, 
19 pairs were consistent with the patient in- 
fluence (PI) hypothesis, and 9 pairs equally 
supported both. The perceived feeling process 
of the patients (their reports of their own 
positive feelings compared to their reports of 
their therapists’ positive feelings) yielded 
essentially the same results: 11 TI, 19 PI, and 
15 indeterminate. The perceived feeling proc- 
ess of therapists (their reports of their own 
positive feelings compared with their reports 
of their patients’ positive feelings) was more 
consistent with the patient influence hypothe- 
sis: 9 TI, 23 PI, and 13 indeterminate. Thera- 
pists, thus, tended to see patients’ positive 
feelings in a manner consistent with their 
being sufficient for the therapists’ own experi- 
ences of positive feelings. But the convergence 
of the three perspectives was weak, indicating 
that no general statement could be sustained 
for patient-therapist pairs. 


Negative Affect 


The comparisons involving negative (dys- 
phoric) feelings revealed a more consistent 
trend. Analysis of patients’ reports of nega- 
tive feelings and therapists’ reports of nega- 
tive feelings supported the therapist influence 
hypothesis: 26 therapist influence, 9 patient 
influence, and 10 indeterminate. The results 
obtained from the comparisons involving 
patient-perceived negative feeling process 
were: 27 TI, 10 PI, and 8 indeterminate. 
Therapist-perceived negative feeling process 
was also consistent with these results: 29 TI, 
10 PI, and 6 indeterminate. Binomial tests of 
the splits between patient influence and thera- 
pist influence were statistically significant for 
all three comparisons, beyond the .01 level. 
For negative affect, then, it appeared that the 
presence of negative affective experiences for 
the therapist may usually be sufficient to pro- 
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vide a negative affective experience for the 
patient; that is, although the patient may feel 
unusually bad in any particular session, she is 
considerably more likely to feel bad if the 
therapist does. 

The degree of case-by-case agreement be- 
tween the three perspectives, patient’s (P), 
therapist’s (T), and “actual” (A), was 38% 
(PT), 36% (PA), and 42% (TA) for posi- 
tive feelings and 58% (PT), 53% (PA), and 
56% (TA) for negative feelings. None of the 
six 3 X 3 tables from which these values were 
computed had a significant chi-square of 
association at the .01 level. Thus, the three 
perspectives may not be considered convergent 
means of assessing the same facts of influence. 
Because the “actual” perspective compares 
the results of two independent assessments, 
each by the person who has the best informa- 
tion about how he feels, we are inclined to 
treat that data as most valid for this study. 

Although the results differed for positive 
and negative feelings, direct comparisons re- 
vealed an interesting interdependency. When 
fourfold tables were constructed comparing 
patient negative feelings with therapist posi- 
tive feelings (P— vs. T+) and patient posi- 
tive feelings with therapist negative feelings 
(P+ vs. T—), there was strong evidence that 
the experience of a marked degree of one type 
of feeling (e.g., positive affect) for one par- 
ticipant precluded the experience of an ex- 
treme degree of the other type of feeling (e.g., 
negative affect) by the other participant. If 
one felt especially bad, the other tended not 
to feel especially good; if one felt especially 
good, the other tended not to feel especially 
bad. Unfortunately, both the necessity and 
the sufficiency interpretations of these find- 
ings are equally plausible, that is, the follow- 
ing two statements seem equally defensible: 
(a) the therapist cannot feel especially effec- 
tive unless the patient does not feel especially 
bad; (b) the patient will not feel especially 
bad when the therapist feels especially effec- 
tive. However, the results are inconsistent 
with propositions that (c) the absence of a 
particular extreme affective experience for one 
participant is sufficient to elicit the presence 
of the opposite extreme affective experience 
for the other, and (d) the presence of a par- 
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ticular extreme affective experience for one 
participant is necessary for the absence of the 
opposite extreme affective experience of the 
other. 


Results of Individual Differences Analyses 


Clearly there were individual differences as 
to which participant was affectively predomi- 
nant. The purpose of the analysis in this sec- 
tion was to determine what individual char- 
acteristics of patients or therapists were dif- 
ferentially associated with patient or thera- 
pist affective predominance. 

In all, 10 patient characteristics were ex- 
plored: age, education, marital status, re- 
ligion, family size, birth order, parental sepa- 
ration, age at familial disruption, diagnosis, 
and previous psychotherapeutic treatment. 
The 11 therapist characteristics investigated 
were: age, religion, social class of origin, mari- 
tal status, parental status, sex, family size, 
birth order, profession, experience, and per- 
sonal psychotherapy. Chi-square was employed 
to screen these 21 personal characteristics for 
significant associations with the various pat- 
terns of affective influence. Since the purpose 
of this analysis was exploratory, the .10 sig- 
nificance level was employed in screening. 
Only entries which contributed a dispropor- 
tionately large amount to a significant chi- 
square will be discussed. 

No patient characteristic was significantly 
differentially associated with direction of in- 
fluence in either affective process. No thera- 
pist characteristic was differentially associ- 
ated with direction of influence in negative 
feeling process, but three therapist character- 
istics were related to positive process. These 
were (a) experience (p < .01), (b) age ($ 
< .10), and (c) birth order (p < .10). For 
those pairs where the evidence was consistent 
with the proposition that the patient’s posi- 
tive feelings were sufficient for the therapist 
to experience unusually positive feelings (PI), 
74% of the therapists had had less than 6 
years’ experience. For the TI pairs, on the 
other hand, 76% had had 6 or more years’ 
experience in the practice of psychotherapy: 
Seventy-nine percent of the therapists in PI 
Pairs were under 35 years of age, while 69% 
of the TI therapists were 35 or older. Finally, 
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only 11% of the PI therapists had at least 
one older same-sex sibling, while 41% of the 
TI therapists had at least one older same-sex 
sibling in their family of origin. On the whole 
the 21 independent variables available in this 
study were not strongly predictive of affective 
predominance, with the exception of therapist 
experience. 


Discussion 


When a patient and her therapist indicate 
how they felt in each immediately preceding 
therapy session, they give evidence bearing 
on which of them is the more affectively influ- 
ential. It was found that for most of the 
patient-therapist pairs, the therapist was the 
more influential of negative feelings. This 
seems quite understandable, because the pa- 
tient, but not the therapist, is supposed to be 
feeling badly. The patient’s bad feelings are 
part of his task and are no reason for a com- 
petent therapist to actually feel badly him- 
self, certainly no reason for feeling inade- 
quate as a therapist. However, if he does 
feel this way, that is good reason for the 
patient to feel negative in and about her 
therapy. She has presumably come preoccu- 
pied. with her problem and expecting help 
from an expert. If he feels badly, that is, in- 
adequate, then how can she hope to get help 
from him? If he feels good, that may be re- 
assuring but not necessarily enough to raise 
her spirits, 

A second finding, that for less experienced 
(and younger) therapists, patients’ positive 
feelings are more likely to be sufficient for 
them to feel good, also seems quite under- 
standable. A less experienced therapist is more 
likely to be responsive to his patients’ posi- 
tive feelings, because he needs evidence of his 
Competence and of a good prognosis. Patients’ 
Positive feelings are such evidence. More ex- 
Perienced therapists probably tend to have a 
More secure concept of themselves as thera- 
Pists, and so are less affectively responsive to 
their patients in this way. It may also be that 
being more secure, they can convey a more 
Clear and certain sense of their extreme posi- 
tive affect than can the less experienced thera- 
pists. 

How a therapist feels about his treatment of 
a patient and the effect of these feelings on 
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the patient may be manipulable factors. If 
therapists could be made to feel unconfident 
and unsuccessful, for example, through critical 
supervision, stronger evidence of the suffi- 
ciency of their negative feelings for patient 
negative feelings could be had. Similarly, if 
patients could be made to feel more positively, 
for example, through an assessment inter- 
view, the second finding could be strengthened. 
Without such manipulations, of course, we 
must rest on the more inferential evidence 
available from the sort of retrospective data 
of this study or the somewhat better evidence 
derivable from process analysis of single ther- 
apy sessions. In the meantime, it would seem 
advisable for therapists to experiment with 
various ways of handling their dysphoric ses- 
sions. Perhaps canceling such sessions, clari- 
fying the patient’s perception of her thera- 
pist’s mood, taking special notice of the affec- 
tive interaction, or consciously managing one’s 
affective display would yield better results on 
such hours. Surely the treatment must be 
adapted to the therapist as well as to his 
patient. 
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PREDICTION ENHANCEMENT THROUGH THE USE OF 


MODERATOR VARIABLES 


JAMES E. COWDEN+ 
Wisconsin Department of Health and Social Services 


In order to increase the ability of clinicians to predict the subsequent behavior 
of inmates in a juvenile correctional institution, moderator variables (i.e. 
variables which were correlated with degree of predictability of inmates as 
measured by the relative magnitudes of the difference (D) scores between 
predictor and criterion variables) were developed using a construction sample of 
76 Ss. These moderator variables were then used to remove less predictable Ss 
from a cross-validation sample, resulting in a moderate increase in the corre- 
lation between predictor and criterion variables when one-third of the least 
predictable Ss were removed, and a more substantial increase when one-half 
of the least predictable Ss were similarly removed. Hence, this method showed 
considerable potential value as a means of facilitating more accurate classifica- 


tions of inmates. 


The problem of how to increase the ability 
to predict a given criterion is traditionally 
resolved by increasing the reliability and 
validity of the predictor and criterion vari- 
ables, by more precise measurements of these 
variables, or by increasing the number of pre- 
dictors. In most cases, the use of these tech- 
niques insures at least a moderate level of 
Prediction. However, when not more than two 
or three predictors of moderate reliability and 
validity are available, and it is important to 
achieve the maximal possible degree of pre- 
diction, other means of enhancing prediction 
must be found. Ghiselli (1960, 1963) has 
concerned himself with this problem, with the 
goal of predicting as accurately as possible 
which employees would subsequently show the 
best job performance. He pointed out that 
except in the cases where correlations between 
predictor and criterion variables are quite 
high, there is typically a significant variation 
between Ss in difference (D) scores between 
Predictor and criterion scores. For any given 
set of predictor and criterion variables, indi- 
viduals could thus be differentiated along a 
dimension of predictability, as measured by 
the magnitudes of these D scores. If other 
tests or ratings were then given to the same 
Ss and found to be correlated with these D 


*Requests for reprints should be sent to James 
Cowden, Department of Health and Social Ser- 
Vices, Division of Corrections, 1 West Wilson Street, 
‘adison, Wisconsin 53701. 


scores, these ratings or tests could be used to 
select subsequent samples of Ss (particularly 
if similar to the original sample) having rela- 
tively small D scores. Using this technique, 
Ghiselli (1956) was able to select certain 
Ss from a cross-validation sample who were 
presumed to be more predictable than average. 
In support of his hypothesis, he found that 
the correlation between predictor and criterion 
variables was significantly higher for this 
subsample than for the remaining Ss in the 
sample. 

In the present study, an attempt was made 
to use the above techniques in somewhat 
modified form to assess their efficacy in en- 
hancing the prediction of institutional and 
postrelease adjustment of delinquent boys. 
Clinicians in correctional institutions are typi- 
cally asked to classify and make recommenda- 
tions about newly admitted inmates based on 
judgments as to their probable subsequent 
adjustment within the institution. The results 
of previous studies conducted at two Wis- 
consin correctional institutions for delinquent 
boys suggested that age at first admission to 
the institution and global prognostic ratings 
by clinicians and social workers were signifi- 
cantly related to institutional and postrelease 
adjustment (Cowden, 1966, Cowden & Pacht, 
1967). While these two variables were among 
the best predictors of the subsequent adjust- 
ment of delinquent boys, the magnitudes of 
the correlations between the individual pre- 
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dictor and criterion measures were not very 
high (within the range of .26 to .54). Adding 
other predictors (using a multivariate ap- 
proach) increased the multiple correlation co- 
efficient significantly, to about .68. However, 
this approach did not appear very feasible 
mainly because the process of assigning 
weights to 6-10 predictors and using them in 
combined form to predict adjustment was 
found to be too complicated for routine clas- 
sification purposes. Second, the multiple cor- 
relation coefficients tended to show a signifi- 
cant decline when the multiple-regression 
equations were used on new (cross-valida- 
tion) samples, most likely because the use 
of multiple predictors allowed a variety of 
sources of error to operate. These considera- 
tions suggested that more attention might 
profitably be focused on methods of increasing 
the predictive value of individual predictors, 
as opposed to using a multiple regression 
approach, Hence, an attempt was made in 
this study to enhance the prediction of insti- 
tutional and postrelease adjustment through 
selecting more predictable Ss. 


METHOD 


Minnesota Counseling Inventory (MCI) scale 
scores (Berdie, 1957) and ratings of personality, 
social, and demographic variables derived from 
case folders of 152 consecutive first admissions to 
the Wisconsin School for Boys at Wales, Wisconsin, 
were used as sources of data, This sample was typical 
of the population of boys institutionalized as delin- 
quents at juvenile correctional institutions in Wiscon- 
sin (as well as in juvenile correctional institutions in 
most other states). All boys in Wisconsin between the 
ages of 12-18 who are committed as juvenile delin- 
quents to the State Department of Health and Social 
Services must be received at the Reception Center of 
the Wisconsin School for Boys at Wales, Wisconsin 
(the institution from which the sample was drawn). 
Approximately half of the boys are then retained 
at the School for Boys at Wales and half are trans- 
ferred to a similar juvenile correctional institution, 
the Kettle Moraine Boys School at Plymouth, Wis- 
consin. The boys at these two institutions are very 
similar in most respects, though more seriously dis- 
turbed boys tend to be retained at Wales. Statistics 
compiled by the Bureau of Research (June 1967 
and September 1967) reveal the following charac- 
teristics of the boys in this population during 1966. 
The median age of boys at the time of their first ad- 
mission was 15.9, they had completed 8.8 grades in 
school, their achievement grade level (California 
Achievement tests) was slightly below the seventh 
grade level, and 34.2% were nonwhite. The major 
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offenses for which they had been committed in- 
cluded auto theft, (28%), burglary (23%), truancy 
(11%), uncontrollability (10%), theft (9%), ag. 
gravated assault (5%), and sex misconduct (4%), 
Sixty-eight percent had been involved with one or 
more companions in their delinquent behavior, while 
32% had no such companions. Their mean length 
of stay within the institution in 1966 was 4.5 months 
and their median age at release was 16 years, 3 
months. Fifty-one percent of those in the sample 
came from homes in which both parents were mar- 
tied, while the remainder came from homes in 
which the parents were separated, divorced, not 
married, or widowed. Seventy percent of the boys 
came from lower socioeconomic class levels, while 
30% came from middle-class backgrounds. Fifty-five 
percent of those released in 1966 were released for 
the first time from a juvenile correctional institution. 

The sample assessed in this study appeared to be 
representative of the above population of delinquent 
boys in the characteristics described. Seventy-six Ss 
of the 152 in the sample were then randomly as- 
signed to the construction sample. Thirty-seven pre- 
dictor variables and several criterion measures of 
institutional and postrelease adjustment were then 
intercorrelated. Three predictor variables showing at 
least a moderate relationship with one or more of 
the criterion measures were then selected. The dis- 
tributions of these predictor variables and the cor- 
responding criterion variables were then transformed 
into standard (Z) distributions with means of 50 
and standard deviations of 10. The predictor and 
criterion score distributions were thus equally 
weighted in constructing D-score distribution, con- 
structed by subtracting the criterion Z scores (Ze) 
of each S from his predictor Z scores (Zp). The 
signs of the D scores were disregarded in deriving 
mean D scores, since either positive or negative D 
scores constituted errors in prediction. 

The data available for each S then included three 
D scores in addition to his 37 predictor ratings and 
three criterion ratings. These three D scores were 
then correlated with the 37 predictor variables in 
order to select the three predictor variables which 
correlated most highly with the D scores, that is, 
which of these predictor variables functioned most 
effectively as moderators. 

Three variables were selected in this manner to 
function as moderators for each of the three 
predictor-criterion combinations, These three moder- 
ator variables were then used in turn to subdivide Ss 
in the cross-validation sample into subgroups show- 
ing different degrees of predictability. Correlation c0- 
efficients as well as mean D scores were then calcu- 
lated between the three predictor and criterion 
variables in turn for the following: the entire cross- 
validation sample; the two-thirds of the cross-valida- 
tion sample which was most predictable, according 
to their scores on the moderator variable used; and 
finally, the one-half of the cross-validation sample 
which was most predictable according to their scores 
on the moderator variable used. This process WAS 
repeated for each of the three predictor-criterio 
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TABLE 1 
CORRELATIONS BETWEEN PREDICTOR AND CRITERION VARIABLES FOR ALL SUBJECTS 
AND FOR SUBJECTS SELECTED THROUGH THE USE OF MODERATOR VARIABLES 
Moderator Predictor Criterion ee tion eae uda on sampi 
variables variables variables w ~ 76) 
rt N= 76 N = 54 N= 38 
MCI V scale COPR SWFR 54 _ AT 69 73 
Xoiee = 7.99 | Xoin = 8.06 | oie = 7.45 
MCI V scale SWPR SWFR AT 41 ARSS .67 
Xoite = 8.03 | Xvi = 7.21 | Koi = 5.91 
COPR CLPR PAPR 44 44 46 83 
Xoiee = 8.80 | Xoin = 8.52 | nin = 4.47 


Note.—COPR = counselor prognostic ratings; SWPR = social workers’ prognostic ratings; CLPR = clinicians’ prognostic 
ratings; SWFR = social workers’ final ratings of institutional adjustment; PAPR = parole agents’ ratings of postrelease adjust- 


ment. 


distributions assessed, allowing an evaluation to be 
made of the degree to which each of the moderator 
variables increased the predictive capabilities of each 
of the three predictor variables. 


RESULTS AND DISCUSSION 


The three predictor variables selected for 
use included prognostic ratings by counselors, 
social workers, and clinicians (psychologists 
or psychiatrists). These were essentially pre- 
dictions, on 7-point scales, of the adequacy 
of the boys’ subsequent institutional and post- 
release adjustment, and were derived from 
reports written shortly after each boys’ com- 
mitment to the institution. They were found 
to be moderately reliable, with reliability 
Coefficients of .62, .68, and .76, The criterion 
variable associated with the first two predictor 
variables was the final rating of institutional 
adjustment (derived from social workers’ re- 
ports submitted just prior to the boys’ release 
from the institution). In the third case, the 
criterion variable consisted of ratings of post- 
telease adjustment, derived from probation- 
Parole agents’ reports of the boys’ postrelease 
adjustment for the first 3 months after being 
released from the institution. Both of these 
criterion ratings could be rated reliably, as 
shown by previously obtained reliability co- 
efficients of .74 and .70, respectively. The V 
Scale of the Minnesota Counseling Inventory 
(on which higher scores indicate greater de- 
fensiveness) functioned as a moderator vari- 
able in the first two cases, that is, higher 
Scores on the V scale had been found (on the 
Construction sample) to be correlated signifi- 


cantly with lower D scores (greater predicta- 
bility). Counselors’ global prognostic ratings 
functioned in the third case as a moderator 
variable, with higher scores (more positive 
prognostic ratings) correlated with lower D 
scores (greater predictability) on the con- 
struction sample. Hence, these ratings were 
thus used in the third case to selectively 
remove the less predictable boys from the 
cross-validation sample. 

As shown in Table 1, when the moderator 
variables were used to remove one-third of 
the least predictable boys from the cross- 
validation sample, the correlations between 
predictor and criterion variables showed a 
slight to moderate increase (.22, .12, and .02) 
in the three cases. When half of the least 
predictable boys were similarly removed, the 
correlation between predictor and criterion 
variables showed a much more consistent and 
marked increase (.26, .26, and .39) in the 
three cases. 

These results suggest that selectively re- 
moving less predictable Ss through the use of 
moderator variables markedly enhances the 
clinician’s ability to predict their subsequent 
behavior, particularly when half of the Ss are 
removed in this manner. One obvious limita- 
tion of this approach is that it excludes the 
one-third or one-half of the Ss who are rela- 
tively unpredictable. Although relatively un- 
predictable, these inmates must still be clas- 
sified by the clinician using these moderators. 
Clinicians could, of course, use this informa- 
tion as a rationale for being more tentative in 
classifying these inmates. On the other hand, 
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if it is important to make accurate predictions 
concerning this residual group of relatively 
less predictable Ss, additional moderator vari- 
ables could be used in similar fashion to 
enhance predictions concerning their subse- 
quent behavior. Moderator variables appear 
to be significantly correlated only with cer- 
tain predictor-criterion difference distribu- 
tions. Hence, if certain Ss are found to be 
unpredictable using one moderator, this does 
not mean that they will be similarly unpre- 
dictable using another. Using two moderator 
variables successively is, of course, a more 
complex and time-consuming procedure than 
using only one. However, this is less compli- 
cated than using a number of predictor vari- 
ables, as would be done using a multiple- 
regression approach, 

Although some problems thus exist in the 
use of moderator variables, this technique ap- 
pears potentially more useful under certain 
conditions than a multiple-regression ap- 
proach, not only because the resultant cor- 
relation coefficients are significantly higher 
than those achieved through the use of 
multiple-regression techniques, but also be- 
cause the method is simpler to use by institu- 
tional staff members in classifying newly ad- 
mitted inmates. Although the present study 
involved the selection of moderators simply 
on the basis of the magnitudes of their cor- 
relations with difference scores, it is possible 
that as more information becomes available 
concerning their stability and generality, a 
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theoretical rationale could be developed 
would facilitate their selection and use. Sy 
quent studies will be undertaken with 
goal in mind, focusing specifically on the 
of two or more moderators to maximize 
predictability of virtually all the Ss in tk 
cross-validation samples. 
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SEX DIFFERENCES AND VARIATIONS IN BODY ATTITUDES 


RICHARD 


M. KURTZ1 


Washington University 


In the present study a number of hypotheses were tested about the relation- 
ship between body attitude and sex membership. The Ss were 89 male and 80 


female undergraduate students. Three 
Osgood’s three attitudinal dimensions 
ential on 30 different body concepts. 


separate body attitudes corresponding to 
were measured by the Semantic Differ- 
Results indicated that women not only 


like their bodies better than men but also have a more clearly differentiated 
notion of what they like and dislike about their bodies. Men on the other 
hand judge their bodies as more potent and more active than women. 


The purpose of this study was to investigate 
the relationship between sex differences and 
variations in global body attitudes. A body 
attitude is defined as an individual’s general, 
overall global attitude or feeling about the 
outward form and appearance of his body. 
Three separate body attitudes will be studied 
which correspond to the three major factors 
Osgood, Suci, and Tannenbaum (1957) have 
isolated (Evaluation, Potency, and Activity). 

Of all the differences that exist between 
human beings, sex membership is presumably 
one of the most important. Sex and age are 
equally universal as reference points for es- 
tablishing status and assigning social roles 
(Lundberg, Schrag, & Larsen, 1954). Most 
societies prescribe different attitudes and ac- 
tivities to and develop different expectancies 
about men and women. The techniques used 
to effect and maintain such differences also 
vary from culture to culture (Mead, 1935, 
1949). In the ascription of occupation, educa- 
tion, political activity, and moral codes, sex 
difference is an important factor in most 
Societies. 

In light of the presumed general importance 
of sex membership, it is indeed surprising that 
there are few experimental studies which deal 
directly with sex differences and variations in 
body attitude. Except for the various psycho- 
analytic speculations about the importance of 
the anatomical differences between the sexes 
(Freud, 1938, 1959b, 1959c), there is only 
one study directly related to possible differ- 
ences in body concepts as a function of sex 


1 Requests for reprints should be sent to Richard 
M. Kurtz, Department of Psychology, Washington 
University, St. Louis, Missouri 63130. 


membership. Jourard and Remy (1957) at- 
tempted to investigate whether men and 
women differed in their degree of differentia- 
tion in global body attitude. They tested the 
differentiation hypothesis? by comparing the 
between-person variances of the males and 
females in terms of their total body attitude 
scores. No definite conclusions can be drawn 
from their study since comparison of between- 
person variance is not an adequate test of 
the differentiation hypothesis, In other words, 
insofar as between-person variance reflects 
only differences between the groups in overall 
attitude, it reflects only variance due to indi- 
vidual differences in total body attitude. The 
correct test would have been to compare the 
within-person variances of the males and 
females. 

Several studies are distantly related to the 
question of sex differences in body attitudes, 
For example, some researchers (Secord, 1953; 
Weinberg, 1960) have found that women gave 
more homonym associations to body concepts 
than did men. In a similar study using the 
TAT, Van Lennep (1957) reported that 
females exceed males in the number of body- 
related references found in their stories. In a 
study with male and female adolescents, 
Erikson (1951) found differences in the type 
of scenes which male and female adolescents 


2 Differentiation refers to intraindividual variation 
in attitudes toward specific body aspects. The con- 
cept of differentiation is identical with the semantic 
distance notion of Osgood et al. (1957). The more 
articulated a notion of what one likes or dislikes 
about his body, the clearer, the evaluative distinc- 
tions a person can be expected to draw. The greater 
the differentiation, the greater will be the differences 
between all body concepts. 
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constructed with play material. Females gen- 
erally made structures which were open and 
easy to enter, whereas males fashioned more 
upright “phallic” type structures. Erikson 
speculated that these differences reflected 
contrasting modes of experiencing their bodies 
which were related to their over all sexual 
differentiation. Using a task which called for 
the construction of a series of human forms 
by means of schematic body parts of different 
sizes, Katcher and Levin (1955) found that 
girls were able to match body parts to form 
representing self, mother, and father more 
accurately than could a group of boys of the 
same age. They interpreted their data to mean 
that girls may have a more realistic apprecia- 
tion of the smallness of their own bodies in 
relation to adults than is the case with boys. 

While these studies provide some fragmen- 
tary and tangential support for the existence 
of sex differences in body attitude, most of 
them serve even more to highlight the need 
for a direct “common sense” approach to the 
investigation of body attitude. It seems to this 
author that, in American society, differential 
attitudes toward the body are readily appar- 
ent. General and specific aspects of the female 
body, for instance, are held in high esteem. 
One need only glance through a few of count- 
less magazines, walk past movie marquees, 
beauty shops, or cosmetic counters to verify 
the importance placed on the female body. 
It is also a commonplace observation that 
women expend more effort, time, and money 
on personal grooming than do men. 

One hypothesis suggested by these con- 
siderations is that women should have a more 
clearly differentiated or articulated notion of 
what aspects they like and what aspects they 
dislike about their bodies than is generally the 
case with men. Females can, therefore, be 
expected to draw finer evaluative distinctions 
about the various aspects of their bodies and 
might be less inclined to respond to their 
bodily appearance in a global manner than 
men. If this is the case then one would pre- 
dict that all possible differences between all 
body concepts produced by females on the 
Evaluative factor of the Semantic Differential 
will be larger than all possible differences 
between the same concepts produced by males 
on the Evaluative factor. Stated in an equiva- 
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lent but more technical way, it is predicted 
that the within-person variance of the females! 
Evaluative factor scores will be significantly 
larger than the within-person variance of the 
males’ Evaluative scores, 

It is also predicted that women will evalu- 
ate their bodies more positively than will men, 
That sex differences are related to the atti- 
tudes and expectancies held about the body 
is also indicated by another common sense 
observation. Muscular strength (Jones, 1947) 
is an admired quality in men, but not in 
women. Aggressive behavior is less acceptable 
in females than in males (Lansky, Grandall, 
Kagan, and Baker, 1961). In interpersonal 
relationships the man is expected to be strong 
and dominant, the woman weak and submis- 
sive. Freud (1933, 1959a,) has seen the failure 
to accept these role prescriptions as involved 
in the genesis of neurosis. From these observa- 
tions, the following hypothesis is offered: 
Males will tend to judge their bodies as more 
potent or stronger than will females, that is, 
males will rate their bodies higher on the 
Potency factor than will females. 

Sex differences can also be expected on the 
Activity dimension of the Semantic Differen- 
tial. By the time girls reach puberty, most 
of them accept certain restrictions on their 
physical activities (Muuss, 1962). They are 
exposed to strong social pressures to relin- 
quish, among other things, tomboy behavior. 
Most girls, at least those from the middle 
class, come to recognize that it is not lady- 
like to be as rough and active as their 
brothers. A little later on, girls also learn 
that in social situations involving the opposite 
sex, it is unacceptable to be “forward,” that 
is, to take the initiative, be it sexual or other- 
wise. The female, in short, is generally as- 
signed a more passive role in many situations 
in our society than is the case with males. 

Women’s acceptance of a more passive role 
is seen by many psychiatrists (Deutsch, 1930; 
Fenichel, 1955) as a precondition of sexual 
adjustment and marital happiness. On the 
other hand, the reluctance or inability of a 
man to be assertive and accept the active role 
has been viewed as pathogenic in the forma- 
tion of sexual impotence and homosexuality 
(Ferenczi, 1950a, 1950b). For these reasons, 
a difference is expected between males and 


Sex DIFFERENCES AND Bopy ATTITUDES 


TABLE 1 


Tumrty Bopy CONCEPTS JUDGED BY SUBJECTS ON THE 
SEMANTIC DIFFERENTIAL IN ORDER OF 
PRESENTATION 


Color of my hair 

Size of my hands 

Shape of my fingers 
Size of my ears 

Shape of my ankles 
Shape of my head 

My profile 

Width of my shoulders 
Size of my bust (females) 
Size of my chest (males) 
Size of my hips 

Shape of my legs 

Size of my feet 

My posture 

My weight 

Texture of my hair 

My facial complexion 
Distribution of hair on my body 
Size of my waist 

Shape of my ears 
Length of my neck 

My body build 

My height 

Size of my arms 
Appearance of my facial features 
Texture of my skin 
Appearance of my teeth 
Shape of my knees 

My gait 

Back view of my head 
Color of my skin 


females in the way they rate their bodies on 
the Activity factor. College men will judge 
their bodies as more active than will college 
women. 


METHOD 


The Ss were 89 males and 80 female undergraduate 
Caucasians, predominantly middle-class students 
between 18 and 23 years old, 

Three separate body attitudes corresponding to 
Osgood’s three attitudinal dimensions (Evaluation, 
Potency, and Activity) were measured by the 
Semantic Differential on 30 different body concepts, 
such as “size of my arms,” “color of my hair.” The 
format for presenting this technique to Ss was iden- 
tical to that of Osgood. The body concepts are 
listed in Table 1. 

The task of S was to rate the body concept in 
question along nine bipolar 7-point adjective scales 8 


? The nine bipolar 7-point adjective scales used in 
this study were: Evaluative dimension, good-bad, 
awkward-graceful, beautiful-ugly ; Potency dimension, 
Weak-strong, hard-soft, thin-thick; Activity dimen- 
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which tap the three attitude dimensions. The sum 
of the judgments on the three adjective scales for 
each dimension of the Semantic Differential were 
summed separately for each concept to obtain an 
item score, Then the 30 item scores were summed 
for each person and the total sum was treated as an 
index of how a person felt about the overall 
appearance of his body. 

A generalizability study (Gleser, Cronbach, & 
Rajaratnam, 1965) was conducted on a pilot basis 
to see how well one could generalize from attitudes 
toward specific aspects of the bodily appearance 
to a hypothesized universe of observations, the global 
body attitude. The “wanted” variance due to dif- 
ferences among Ss over a universe of body attitudes 
and adjectival scales far exceeded the “unwanted” 
variance in the total scale score attributable to idio- 
syncratic responses of Ss to specific concepts on spe- 
cific scales. The obtained generalizability coefficients 
for all three dimensions ranged from .86 to .95. 


RESULTS 


The hypothesis that women would have a 
more clearly differentiated notion of what 
they like or dislike about their bodies than 
would men was confirmed. An F ratio com- 
puted between the within-person mean square 
for women (MS = 11.851) and the within- 
person mean square for men (MS = 8.307) 
yielded an F value of 1.425 which was signifi- 
cant beyond the .05 level. 

In addition to having a more differentiated 
body concept, females also like their bodies 
better than do men. The test of significance 
for differences between the mean for females 
( = 452.01) and the mean for males 
(X = 430.03) yielded a z of 2.27 signifi- 
cant beyond the .02 level. 

The hypothesis that the mean Potency 
factor score for men (X = 414.72) would be 
significantly larger than the mean Potency 
factor score for women (X= 351.11) was 
confirmed. A test of significance of differences 
between these means yields a z = 7.47 signifi- 
cant beyond the .001 level. 

The hypothesis which predicted that the 
mean Activity factor score for men (X = 
419.69) would be significantly larger than the 
mean Activity factor score for women (X = 
403.35) was confirmed. The test for signifi- 
cance yields a z = 2.20 which is significant 


sion, active-passive, cold-hot, fast-slow. The poles of 
the scales and the position of the dimensions 
were systematically counterbalanced in order of 
presentation. 
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beyond the .02 level. Pearson product- 
moment correlation coefficients computed 
separately for both sexes between the three 
attitude dimensions ranged from .01 to .09 
and were not significantly different from zero, 
thus indicating that all three factors were 
orthogonal. 


Discussion 


In the present study the hypothesis was 
confirmed that college women have a more 
clearly differentiated body concept than men. 
This finding suggests that greater awareness 
and concern over bodily appearance may be 
more acceptable in the female than in the 
male. This differentiation may stem from the 
fact that it is part of the females’ role pre- 
scription to focus attention on the details of 
their bodies, whereas men are expected to be 
more subdued in the interest they take in 
the appearance of their bodies. 

The finding that women have a more posi- 
tive evaluative attitude toward their bodies 
than men is consistent with the high esteem 
placed on the physical form of the female 
body in American society. It is interesting to 
contrast these findings with the psychoana- 
lytic contention (Kubie, 1937; Reik, 1957) 
that women have a lower evaluation of their 
bodily appearance then men because of “penis 
envy.” Sex differences in body concept may, 
of course, be related to more than obvious 
anatomical differences. It is possible that par- 
ticular cultural definitions of masculinity and 
femininity may be related to specific sex dif- 
ferences found in the three factors of the 
Semantic Differential. Thus, in a culture which 
emphasized the need for the female to be 
physically strong and assertive and the male 
weak and passive, there might be a reversal 
of the present findings. One might even 
speculate that greater body differentiation and 
high evaluation in the female is related to 
her lack of actual social or political power 
in American society. If this is true one might 
expect differences in body attitude between 
women who are occupationally successful in 
a “man’s world,” and those women who are 
content to accept a more traditional female 
role. The career women, therefore, might have 
a body attitude profile more closely resembling 
the typical male, that is, she might be lower 
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on Evaluation and higher on Potency and 
Activity, than would be the more traditionally 
oriented woman. 

On the other hand, it may be that the 
actual physical differences in the form and 
appearance of the male and female bodies 
may be the nucleus for the differences in 
body attitude (Freud, 1933, 1959b). In this 
case one would expect similar sex differences 
in body attitude regardless of cultural varja- 
tions in role prescriptions. These speculations 
about the underlying factors which produce 
sex differences in body attitude pose questions 
for future research. 

Sex differences in body attitude may also 
provide useful leads for studying several 
theoretical problems about sex roles within 
each sex. Sex differences in the multi-dimen- 
sional body attitude measure may, for ex- 
ample, reflect general feelings of masculinity 
or femininity. Perhaps body attitude is related 
to sexual adjustment. Do women who judge 
their bodies as very potent and very active 
have greater sex role conflict than women 
who see their bodies as weak and passive? 
Similar questions can be raised for men. Are 
men who judge their bodies as weak and 
passive more inclined to homosexual or latent 
homosexual adjustments? 

Thus, it seems possible that measures of 
body attitude can provide an additional tool 
to probe the masculinity-femininity dimension. 
This area is one that might prove very fruit- 
ful for future investigation since what counts 
as feminine or masculine in American culture 
So often seems associated with what may 
loosely be called a “good figure.” Association 
of the masculinity-femininity dimension with 
body attitude will probably be most pro- 
nounced in the adolescent and young adult 
age range. One might also expect social class 
differences here, with the lower classes relying 
more heavily upon a “good physique” to 
define and support the adequacy of their sex 
role identity, 
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BRIEF REPORTS 


EFFECT OF INTELLIGENCE, ANXIETY, AND DIAGNOSIS ON 
ARITHMETIC AND DIGIT SPAN PERFORMANCE 
i ON THE WAIS? 


CHARLOTTE J. CALLENS ? anp MALCOLM L. MELTZER 


George Washington University 


The purpose of this study was to investigate 
the effects of intelligence, anxiety, and diagnosis 
on the relationship between the Arithmetic (A) 
and Digit Span (DS) subtests of the WAIS. Of 
special interest was the hypothesis of Rapaport, 
Gill, and Schafer (1945) that schizophrenics 
show the “out of pattern” relationship of DS 
higher than A as compared to the more typical 
A higher than DS found in the anxious normal 
or neurotic, 

Eighty male Ss were selected from a larger 
pool of patients who had been hospitalized at 
either a large city hospital or a naval hospital. 
Forty carried discharge diagnoses of schizo- 
phrenia, 40 were diagnosed as neurotic. WAIS 
scores with A and DS subtracted were used to 
divide the sample into high IQ (105 or above) 
and low IQ (104 or below). The Welsh A scale 
of the MMPI was employed as a measure of 
anxiety (T score 55 or above) and low anxiety 
(T score 54 or below). This allowed for a 
2 X 2 X 2 factorial design composed of eight cells 
and with 10 Ss per cell. Analysis of variance was 
used to evaluate effects of intelligence, anxiety, 
and diagnosis on A, DS, and A minus DS. 

Intelligence was found to be related to A per- 
formance (F= 24.8, df=1/72, p<.001) but 


1This paper is based on a thesis submitted by 
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Department of Psychology, George Washington Uni- 
versity, Washington, D. C. 20006, or for a fee from 
the American Society of Information Science. Order 
NAPS Document 00579 from ASIS National Aux- 
iliary Publications Service, c/o CCM Information 
Sciences, Inc., 909 Third Ave., New York, New York 
10022; remitting $1.00 for microfiche or $3.00 for 
photocopies. Requests for reprints should be sent to 
Malcolm L. Meltzer at the above address, 

? Now at Prince Georges Community College. 


not to DS (F = 1.79). Higher A than DS score 
was also related to intelligence (F= 9.04, df= 
1/72, p <.01). 

No direct relationship was found between anx- 
iety and either A or DS scores. However, the 
difference between A and DS was related to 
anxiety, with high anxiety being associated with 
better A performance than DS performance 
(F = 6.02, df = 1/72, p< .02). 

Diagnosis did not have a direct effect on A, 
DS, or A minus DS. 

The Intelligence X Anxiety X Diagnosis inter- 
action approached significance (F = 3.61, df= 
1/72, p<.10). None of the low-anxiety, low-IQ 
schizophrenics showed A superior to DS, whereas 
this was the typical pattern for the high-anxiety, 
high-IQ neurotics. The ¢ test was significant at 
the .001 level (¢ = 4.73, df = 18). However, the 
A to DS relationship is clearly a function of 
intelligence and anxiety, and the DS above A 
pattern is to be expected in either diagnostic 
group when S is of low-intelligence or low-anxiety 
level. 

The findings of this study suggest that to 
interpret the relationship between A and DS one 
must consider the intelligence level of S, and 
that the relationship between A and DS is prob- 
ably a better indication of anxiety than is the 
level of DS alone. DS well below A may be a 
function of anxiety or intelligence, and the effect 
would be enhanced when S is both bright and 
anxious. Rapaport, Gill, and Schafer (1945) are 
probably right in assuming that the chronic or 
low-anxiety, dull schizophrenic will show the 
out-of-pattern relationship of DS above A, but 
so will nonschizophrenics who are dull or not 
anxious. 
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FIELD INDEPENDENCE AND EDUCATIONAL-VOCATIONAL INTERESTS + 


JACK ARBUTHNOT anp LEOPOLD GRUENFELD 


Cornell 


The purpose of this study was to determine 
whether field dependence-independence (FD-FI) 
is predictive of educational-vocational interests. 
It is hypothesized that (a) increasing FI will 
be associated with increasing interest in analytical 
and abstract areas, such as physics, and with 
decreasing interest in less analytical and abstract 
areas, such as the humanities; (b) since relative 
masculinity has been shown to modify relation- 
ships involving FI (see Gruenfeld & Arbuthnot, 
1969) for male Ss, the relationships predicted 
in a will be of greater magnitude for more mascu- 
line Ss; (c) the partialing out of effects due to 
verbal and mathematical intelligence will not 
affect the magnitude of the relationships pre- 
dicted in a. 

The sample consisted of 55 male freshmen 
engineering students, Level of FI was assessed 
by both the portable rod-and-frame test (RFT) 
and the group form Embedded Figures test 
(GEFT; Form CFV). Masculinity was assessed 
by the Guilford-Zimmerman M-F scale (GZMF). 
Verbal and mathematical intelligence were as- 
sessed by the Scholastic Aptitude Test (VSAT 
and MSAT). Interests were measured by the 
Business (BUS), Humanities (HUM), Social 
(SOC), Physical (PHY), and Biological (BIO) 
Sciences scales of Fricke’s (1965) Opinion, 
Attitude, and Interest Survey. Analytical inter- 
ests are considered to be PHY (most pure) and 
BIO; nonanalytical are HUM (most pure), BUS, 
and SOC, 

The results of Pearson product-moment cor- 
relational analyses were as follows: The RFT 


1An extended report of this study may be ob- 
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Information Sciences, Inc., 909 Third Avenue, New 
York, New York 10022; remitting $1.00 for micro- 
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and GEFT were independent (r= —.03) and 
lacked convergent validity; this replicates earlier 
findings of Gruenfeld and Arbuthnot. The GZMF 
was independent of RFT but predicted HUM, 
PHY, and BIO (r=—.42, .32, .32, p< .05). 
The interest scales intercorrelated negatively or 
not at all. Regarding the hypotheses, RFT (but 
not GEFT) predicted PHY, BIO, HUM, and 
SOC (r= —.33, —.28, 34, 32, p<.05) in the 
expected directions. With the effects due to 
VSAT partialed out, the relationships of RFT 
to PHY and HUM (the purest scales) remained 
significant, while BIO and SOC did not; MSAT 
was independent of RFT and all interest scales. 
When the 38 Ss who were given the GZMF were 
split into high (N =18) and low (N = 20) 
masculine groups, results opposite to those hy- 
pothesized were observed. For the low-masculine 
Ss, RFT (but not GEFT) predicted PHY, HUM, 
and BUS (r= —.56, .50, .45, p<.05). With 
VSAT partialed out, no changes in significance 
were observed; MSAT was independent of RFT 
and all interest scales. 

The results strongly support the hypothesis that 
FD-FI (as measured by RFT) is predictive of 
educational-vocational interests. Further, the no- 
tion that FD-FI merely reflects intelligence is 
largely dispelled for at least one portion of the 
population. Finally, the GZMF does not simply 
reflect predictability; it may be reflecting either 
a rigidity-flexibility or an instrumental-expressive 
dimension, particularly for FI Ss, but which is 
independent of FI. Further research on both the 
prediction of interests with a larger, more repre- 
sentative sample and the nature of the GZMF 
moderating effect is underway. : 
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In this study, also described elsewhere (Danet, 
1968, 1969), a beginning attempt to assess the 
impact of videotape playback on group therapy 
under relatively controlled conditions was made. 
Two university student groups from the Mental 
Hygiene Clinic, University of Minnesota Health 
Service, were videotaped under identical condi- 
tions. The experimental group (E) received play- 
back while the control group (C) did not. C con- 
tained four men and three women, E the reverse. 
The same co-therapists conducted both groups. 

The groups met weekly for ten 50-minute 
sessions, For E, 10-minute segments were ex- 
tracted from the previous meeting for playback 
at the beginning of the following session. 

A periodic questionnaire was administered. 
Sociometric items dealing with interpersonal at- 
tractions were aimed at assessing interaction 
processes and cohesiveness. Evaluative ratings 
and a mutual ranking by members on contribu- 
tion to the group’s progress were also included. 
The ranking yielded an “index of consensus on 
rankings in small groups” which made it pos- 
sible to compare their achievement of concor- 
dance. After the last session, students rated other 
members regarding the extent to which they had 
benefited from therapy. Two other procedures 
employed were an analysis of the seating ar- 
rangements across sessions and a frequency count 
of group members’ participation. 
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The results were as follows: In E the chairs 
were occupied by a significantly larger number 
of students across sessions than in C. This 
finding of a greater amount of mobility and 
shifting position from session to session was 
consistent with Geertsma and Reivich’s (1965) 
assumption that any self-observational experience 
is potentially anxiety producing. 

A tally of the number of first choices each 
member received on the sociometric questions 
showed that C “endorsed” one or two members 
and failed to endorse one or two others, while 
in E, first choices tended to fluctuate. Results of 
the evaluative and improvement ratings showed 
that E tended to have a more negative view of 
itself. 

The frequency of participation data revealed 
C as a well-defined group with a stable pattern 
of interaction. No pattern was apparent in E 
which was more variable. Only, on the task-type 
dimension of the mutual ranking on contribution 
to the group’s progress did E achieve a higher 
degree of concordance (unity). 

The results pointed to the a posteriori hypothe- 
sis that the videotape playback as provided 
during this study had a disrupting influence. As 
compared to C, E was hampered in establishing 
itself as a productive, cohesive unit. This finding 
supported Stoller’s (1967) warning that if video- 
tape feedback is not handled adequately by a 
skillful therapist, it can be disruptive of a psy- 
chotherapy group’s progress. 


REFERENCES 


Danet, B. N. Self-confrontation in psychotherapy 
reviewed, American Journal of Psychotherapy, 
1968, 22, 245-257. 

Daner, B. N. Video tape playback as a therapeutic 
device in group psychotherapy. International Jour- 
nal of Group Psychotherapy, 1969, in press. 

Grertsma, R. H, & Rervicu, R. Repetitive self- 
observation by videotape playback. Journal of 
Nervous and Mental Disease, 1965, 141, 29-41. 

STOLLER, F. H. Focused feedback with videotape: 
Extending the group’s function. In G. M. Gazda 
(Ed.), Basic innovations in group therapy and 
same, Springfield, Ill.: Charles C Thomas, 

967. 


(Received January 15, 1969) 


632 


| 


CONSULTING PSYCHOLOGIST 


Mature Ph.D. Psychologist with strong clinical skills. Should be experienced 
counseling with mature adults. Supervisory experience desirable. Prefer knowledge 
of guidance and industrial counseling. 


To counsel executives and management personnel in areas of personal develop- 
ment and career planning. Handle assignments in assessment, training and organi- 
zation. 


Starting salary $15,000 to $25,000 based upon experience and qualifications. 
Attractive benefits, growth opportunity. Little or no travel. 

Opportunities in New York, Chicago, Dallas, Los Angeles, San Francisco and 
other major cities. 


For consideration phone or submit resume in confidence: 
C. T. Deeble, Ph.D. 
American Management Psychologists 


200 South Michigan Avenue, Chicago, lilinois 60604 
312-427-9335 


Journals of the 
AMERICAN PSYCHOLOGICAL ASSOCIATION 
Subscription Rates for Calendar Year 1970 


AMERICAN PSYCHOLOGIST. Editor: Charles S. Gersoni, American Psychological. Association. Official papers 
of the Association and articles on Psychology. Monthly. The 1970 volume is Vol. 25. Subscription: $10.00 (foreign 
$10.50), Single copy $1.00. 


CONTEMPORARY PSYCHOLOGY. Editor: Gardner Lindzey, University of Texas. Critical reviews of books, 
films, and research material in the field of psychology. Monthly. The 1970-volumeis Vol. 15. Subscription: $10.00 (foreign 
$10.50). Single copy $1.00. 


JOURNAL OF ABNORMAL PSYCHOLOGY. Editor: Howard F. Hunt, Columbi ‘iversit; ‘Theoretical and 

research conditions in behavior pathology. Bimonthly; first issue appears in February. Beginning 1970: Two volumes per 

fae three oe per volume. The 1970 volumes are Vols. 75 and 76. Subscription: $20.00 for 2 vols. (foreign $20.50), 
ingle copy $4.00. 


JOURNAL OF APPLIED PSYCHOLOGY. Editor: Kenneth E. Clark, University of Rochester. Applications of 
psychology in business and industry. Bimonthly; first issue appears in February. The 1970 volume is Vol. 54. Subserip- 
tion: $10.00 (foreign $10.50). Single copy $2.00. 


JOURNAL OF COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY. Editor: Eliot Stellar, University of 
Pennsylvania. | Original contributions in the field of comparative and physiological psychology. Monthly. Beginning in 
1970: four volumes per year, three issues per volume. The 1970 volumes are Vols. 70, 71, 72 and 73. Subscription: $40.00 
or 4 vols, ($40.50 foreign). ' Single copy $4.00. 


JOURNAL OF COUNSELING PSYCHOLOGY. Editor: Francis P. Robinson, Ohio State University. A primary 
publication medium for research in counseling theory and practice. Bimonthly; first issue appears in January. The 1970 
volume is Vol. 17. Subscription: $10.00 (foreign $10.50). Single copy $2.00. 


JOURNAL OF EDUCATIONAL PSYCHOLOGY. Editor: Wayne H. Holtzman, University of Texas. Studies of 
learning and tehina measurement of psychological development, psychology of school subjects, methods of instruction, 
school adjustment. imonthly; first issue appears in February. The 1970 volume is Vol. 61. Subscription: $10.00 (foreign 
$10.50). Single copy $2.00. £e ü 


JOURNAL OF EXPERIMENTAL PSYCHOLOGY. Editor: David A. Grant, University of Wisconsin. Original | 
contributions of an experimental'character. Monthly. Four volumes per year, three issues in each volume. The 1970 
volumes aro Vols. 83, 84, 85 and 86. Subscription: $40.00 for 4 vols, (foreign $40.50). Single copy $4.00. 


JOURNAL OF PERSONALITY AND SOCIAL PSYCHOLOGY, Editor: William J. McGuire, University of Cali- 
ia eerie and ae i on Peay, Broamiice, Rae Fracan and the psychological aspects of structure. 
A » four issues in eacl b; Subscription: 

$30.00 for 3 vols. (Foreign $30.50). Single copy $8.00,’ THe 1970 volumes are Vols. 14, 15 and 16. Subscription 


PSYCHOLOGICAL ABSTRACTS. Executiye Editor: Philip J. Si i blogic jation. 
Nonecritical abstracts of the world’s literature in psychology and related subject, Monthly aian Aponoea] Association, 
Approximately 20,000 abstracts published annually. The 1970 volumeis Vol. 44. Subscription: $130.00 ($102.00 foreign), 


PSYCHOLOGICAL BULLETIN, Editor: James Deese, Johns Hopki į i i i h 
literature, discussions of research methodology in psychology. Mauthly ive won ter pe oer aes olama 
The 1970 volumes are Vols. 73 and'74. Subscription: $20.00 for 2 yols. (foreign $20.50). Single copy $2.00. 


PSYCHOLOGICAL REVIEW. Editor: Charles N. Cofer, Pennsylvania State University. Original contributions 


f a th ‘tical nature. Bi thly; first is i r m h soa 
Coreign $10.50). Sinple copy $2.00" irst issue appears in January. The 1970 volume is Vol. 77. Subscription: $10.00 


DEVELOPMENTAL PSYCHOLOGY. Editor: Boyd R. Mı iversi i 
development and their associated variables. Bimonthly. frst gs Speen o Se epee aes ete 
ar year, three issues per volume. The 1070 volumes are Vols. 2'and 3.. Subscription: $2000 for 2 poks Gonna fence 
ji y 00. i aoa 


Discounts: AGENTS and BOOK DEALERS receive a 10% discount on all orders. There is no club arrangement. 
‘There are no quantity, cash, library, or other special discounts, Payment in U. S. funds. 


x Subscriptions are ayailable on a January through December basis only x 


AMERICAN PSYCHOLOGICAL ASSOCIATION 
Subseription Department 
1200 Seventeenth Street, N.W. 
Washington, D. C. 20036 


JOURNAL OF CONSULTING 
and CLINICAL PSYCHOLOGY 


December 1969 à Vol. 33, No. 6 


` Contents 


ARTICLES 


Manipulation of Indications of Disturbed Thinking in Normal Subjects: Leonard S. 


Levitz and Leonard P, Ullmann........... Bebe: Be E ANE A 633 
Schizophrenia as a “Put-On”: Gerald Rosenbaum.......... toaono 642 
Schizomimetic Conditions and Schizophrenia: Loren F. Chapman............+.++ 646 

An Experimental Procedure for the Modification of Psychotic Behavior: Ivan Toby 
Rutnggand Charles Burle TOE E a: ona, eae eee Siete ithe bs SRL vue cals 651 


Perception of Disability by the Nondisabled: 4. P. MacDonald, Yr., and Janet Hall.. 654 
i Frequency of Dream Recall Estimated by Three Methods and Related to Defense 


Preference and Anxiety: David Benjamin Cohens... <... nianus ecreis oaen 661 
Intelligence Defined as Effectiveness of Approaches: Constance T. Fischer........... 668 
vOperant-Interpersonal Treatment for Marital Discord: Richard B. Stuart............ 675 
vPersonal Adjustment as a Determinant of Aggressive Behavior toward the Mentally Ill: 
Kenneth Ring-and AmtrigQMManenannn E E EE a ey oo sets 683 
Effect of Information, Subject Age, and Instructional’ Set on Predictive Judgments: 
James L. Knestrick and Leon Gorlow.............-+ ess EES i oc E 
¥Rorschach Prediction of Success in Clinical Training: A Second Look: Rae Carlson... 699 
Verbal and Overt-Behavioral Assessment of a Specific Fear: Anthony F. Fazio....... 705 
~ Relations between Personality and Smoking Behavior in Preadult Subjects: Gene M. 
Smith... 2 E E ARS 5 oo SUR a Sa 710 


Effects of Cues and Examiner Influence on Two Wechsler Subtests: Ferome M. Sattler 716 
Personality Correlates of Accurate Empathy in a College Companion Program: George 


F. Vesprattic x... <n RN eee oe a = > os aac clon es oe 722 
Identification of Clinically Relevant Dimensions of Children’s Behavior: Jacob O. Sines, 
Jerome D. Pauker, Lloyd K. Sines, and David R. Owen. +... 0.11... eevee even 728 
Relationshi Ther to Psychiatric Length of Stay and Readmission: 
Mowe Snare a js E A A. Mirck. Be EE EEES onis, AEA 735 
Some Personality and Aptitudinal Correlates of the “A-B” Therapist Scale: James 
E. Dublin, Charles F. Elton, and Juris I. Berzins... 20.06.20. irureun ences 739 
Distinctive Personality Attributes of Child-Abusing Mothers: Barry Melnick and ae 
John R. Hurley.........-+ SESE EAA at Si E eaters 
SPI Discriminators among Four Psychological Health-Sickness Levels: Don Spiegel.. 750 
Prospective Span as a Cognitive Ability: Louis S. Dickstein. ....... 2+... s00.0-000: 757 
Should WISC Subjects Explain Picture Arrangement Stories?: James M. Herrell and ws 
Jeffrey H. Golland tie ae MRM ete 2 a no oo he ate es es e 
Relationship of Manifest Anxiety to Repression-Sensitization on the MMPI: Patrick a 
F. Sullivan and Lynn K. Roberts. .....+- mo... 


List or Manuscripts ACCEPTED. . 2.00220 0c cece ie t Woes nie A Aidan 
(Volume title page and contents appear at the end of this issue.) 


© 1969 by American Psychological Association, Inc. 


Brief Reports 


The Journal of Consulting and Clinical Psy- 
chology will accept Brief Reports of research 
studies in clinical psychology for early publication 
without expense to the author. The procedure is 
intended to permit the publication of soundly de- 
signed studies of specialized interest or limited 
importance which cannot now be accepted because 
of lack of space. Several pages in each issue will 
be devoted to Brief Reports, published in the 
order of their receipt without respect to the dates 
of receipt of the regular articles. Most Brief Re- 
ports appear in the first or second issue to go to 
press following their final acceptance. 


An author who wishes to submit a Brief 
Report: 

1. Sends the Brief Report, limited to one printed 
page and prepared according to the specifications 
given below. 

2. Also sends to the Editor a full report of the re- 
search study, in sufficient detail to give a clear ac- 
count of its background, procedure, results, and con- 
clusions, which will be filed with the National 
Auxiliary Publications Service (NAPS) to insure 
indefinite availability. 

3. Prepares at least 100 mimeographed copies of 
the full report, which the author will send without 
race to all who request it as long as the supply 
lasts. 


4. Agrees not to submit the full report to another 
journal of general circulation. 


Specifications 
Brief Report. The Brief Report should give a 
clear, condensed summary of the procedure of 
the study and as full an account of the results 
as space permits. 


To insure that the Brief Report will be no 
longer than one printed page, its typescript, in- 


cluding all matter except the title and the au- 
thor’s lines, must not exceed 85 lines averaging 
42 characters and spaces in length. Set the type- 
writer margins for short lines of 42 characters, 
which are 3.5 inches long in elite typing, and 4.2 
inches long in pica. 


The manuscript of the Brief Report must be 
double spaced throughout. Except for its short 
lines, it follows the standard style of the 1967 
revision of the APA Publication Manual. Head- 
ings, tables, and references are avoided or, if 
essential, must be counted in the 85 lines, Each 
Brief Report must be accompanied by a footnote 
in the style below, which is typed on a separate 
sheet and not counted in the 85-line quota: 


1 An extended report of this study may be ob- 
tained without charge from John Doe (giving the 
author’s full name and address) or for a fee from 
the American Society for Information Science. Order 
NAPS Document from ASIS National Auxil- 
iary Publications Service, c/o CCM Information 
Sciences, Inc. 909 Third Avenue, New York, New 
York 10022; remitting $—— for microfiche or $—— 
for photocopies. 


Extended report. Because the extended report 
is intended for photoduplication, and is not copy 
to be sent to a printer, its style should differ’ in 
several ways from that of other manuscripts: 
(a) The extended report should be typed with 
single spacing for economy in duplication. (b) 
Tables and figures should be placed adjacent to 
the text which refers to them. A caption should 
be typed below each figure. (c) Footnotes should 
be typed at the bottom of the page on which 
reference is made to them. In other respects, the 
full report is prepared in the style specified by 
the Publication Manual. 


_ Style of Manuscripts. Manuscript: tbe 

in the style described in the Publication Monee of he 

American Psychological Association (1967 Revision) 

chesieete for $1.50 from the Washington office of 
e . 


Early Publication. Authors may secure the early 
publication of an acceptable article in the first issue 
to go to press after its acceptance by arranging for 
the article to be printed as extra pages, increasing 
the mumbar a pages mt subscribers receive. Informa- 

on about charges for early publication € 
tained from the Managing Editor, paige 


National Auxiliary Publications Service. To save 
printing costs, materials supplementary to articles may 

deposited with the NAPS. To obtain copies, order 
by Document Number, as cited in the reference foot- 
note, from ASIS National Auxiliary Publications 
Service, c/o CCM Information Sciences, Inc., 22 West 
34th Street, New York, New York 10001. The indi- 
Cated fee must be remitted in advance. 


Abstracts. Manuscripts must be accompanied by an 
abstract of 100-120 words typed on a separate sheet of 
paper. The abstract should conform to the style 
of Psychological Abstracts. Detailed instructions for 
preparation of the abstracts appeared in the American 
Psychologist (1961, 16, 833), or they may be obtained 
from the Editor or from the APA Central Office. 


Copyrighted material. ‘The following policies govem 
reprinting or other forms of reproduction of APA- 
copyrighted journal materials. Permission by the 
American Psychological Association must be requested 
for reprinting all tables, figures, and text of more than 
500 words (total from one article) in length. ‘There will 
be a charge of $5.00 per page for such permission except 
when the requester is the author of the article to be re- 
printed, or the article is to be reproduced in a limited 
number of copies solely for instructional purposes. 


Reprints. Authors receive 50 reprints gratis 
Zeon quantities may be ordered when returning 
proofs. 


rnal of Consulting and Clinical Psychology 
zit Hd SS No. E, 633-641 


MANIPULATION OF INDICATIONS OF DISTURBED 
THINKING IN NORMAL SUBJECTS? 


LEONARD S. LEVITZ? ann LEONARD P. ULLMANN 
University of Illinois 


A behavioral model of schizophrenia would predict that “normal” people 
emit indications of disturbed thinking (IDT) and that the rate of emitted 
IDT will be affected by environmental manipulations. Two 2 X 2 (presence or 
absence of reinforcement; use or nonuse of instructions) experiments were 
conducted with college students. In one, Holtzman inkblots (HIT) served as 
the pretest-posttest measure and a word association test (WAT) was used for 
training; in the second, the WAT was the pretest-posttest measure and HIT 
was used for training. Results indicated that (a) both reinforcement and 
instructions affected IDT rates on both HIT and WAT during training; 
(b) training results generalized to the pretest-posttest task; and (c) compared 
to reinforcement, instruction led to a more rapid but less durable increase 


in IDT, 


In contrast to other theories of schizo- 
phrenia (e.g., Buss, 1966, pp. 237-390; 
Millon, 1967; Page, 1966), a behavioral 
approach proposes that the designation of 
certain people or their behavior as schizo- 
phrenic is the result of differential social 
teinforcement. Among data cited by Ullmann 
and Krasner (1969, esp. pp. 186-217 and 
377-413) leading to such a formulation are 
the responsiveness of patients to different so- 
cial situations (Zarlock, 1966), experimental 
Conditions (Little, 1966; Meichenbaum, 
1966; Ullmann, Forsman, Kenny, McInnis, 
Unikel, & Zeisset, 1965), placebos (Haas, 
Fink, & Hartfelder, 1963), and direct thera- 
Peutic maneuvers (Atthowe & Krasner, 1968; 
Ayllon & Azrin, 1965; King, Armitage, & 
Tilton, 1960). 

A key question for any theory of schizo- 
Dhrenia is the specification of conditions that 
lead certain people to be hospitalized. Soci- 
Ologists such as Scheff (1966) have pointed 
Out that studies of the prevalence of mental 
illness indicate that there are far more people 
in the community who are “treatable” (ie., 


‘This study is based on a thesis submitted by the 
first author, under the direction of the second 
author, in partial fulfillment of requirements for the 
master’s degree at the University of Ilinois. The 
authors wish to thank J. S. Wiggins for his com- 
ments and Meryl Bernstein for her assistance with 
reliability, 

? Requests for reprints should be sent to Leonard 
sagin 312 West Springfield, Urbana, Illinois 


acting in a manner considered disturbed) 
than there are people who are “treated” (i.e., 
hospitalized or under psychiatric care). 
Scheff further notes that the tendency for 
psychiatrists when making decisions about 
hospital admission or court commitment is 
the “safe” one of designating the person psy- 
chotic. A related consideration is the unsat- 
isfactory reliability of psychiatric diagnosis, 
especially when the individual case is the 
focus. In this situation, very fine and even 
trivial behavioral manifestations may be used 
to justify a designation of a symptom, for 
example, disorganization of thinking, which 
in turn justifies a diagnosis, for example, 
schizophrenia, and therefore hospitalization of 
the person as the solution to a social prob- 
lem (Glasscote, Cumming, Hammersley, 
Ozarin, & Smith, 1966; Smith, Pumphrey, & 
Hall, 1963; Szasz, 1965). 

The person designated as schizophrenic is 
neither passive nor inert. Hospitalized psychi- 
atric patients engage in “impression manage- 
ment” (Braginsky & Braginsky, 1967; 
Braginsky, Grosse, & Ring, 1966) and, in 
turn, are affected by their hospital experience 
(McInnis & Ullmann, 1967). In a similar 
fashion, rather than the designation as 
schizophrenic being relatively fortuitous, it is 
possible to hypothesize that nonhospitalized 
people emit a low base rate of behaviors that 
might be called schizophrenic. If some valued 
change (avoidance or escape from unpleasant 
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stimuli or an increase of pleasant stimuli such 
as acceptance and understanding) occurs 
after emission of such “disturbed” behavior, 
the disturbed behavior will be emitted more 
frequently. 

The purpose of this paper is to present 
data consistent with this behavioral formula- 
tion. The Ss used were students attending 
college. The “schizophrenic” behavior selected 
was disturbance of language and thought, a 
variable a number of theorists (e.g., Meehl, 
1962) have made central to their formula- 
tions of schizophrenia. 

Two instruments that are frequently used 
to measure disturbance of language and 
thought are the Kent-Rosanoff Word Associa- 
tion Test (WAT) and the Holtzman Inkblot 
Test (HIT). The WAT provides a definition 
of disturbed thought based on response fre- 
quencies in the normal population, while 
certain indexes of the HIT, especially those 
of Pathognomic Verbalization, Integration, 
and Form Appropriateness, have been found 
to be significantly associated with scores of 
the process-reactive distinction (Steffy & 
Becker, 1961; Ullmann & Eck, 1965) and, 
by implication, with the severity of the dis- 
order. 

A number of recent investigations, some of 
which were noted above, have indicated that 
the language behavior of hospitalized schizo- 
phrenics may be manipulated directly through 
the use of operant conditioning techniques. 
Ullmann, Krasner, and Edinger (1964), 
Panek (1966), and Thompson (1967) all 
found that schizophrenics emit an increased 
rate of common associates to the WAT when 
reinforced for such behavior. Thompson 
(1967) and Meichenbaum (1967) have found 
that another effective method of increasing 
the emission by schizophrenics of common 
associates and abstract proverb interpreta- 
tions, respectively, was the provision of spe- 
cific instructions about the task such as asking 
the patient to “act as most people would.” 

Palermo and Jenkins (1964), in their tabu- 
lations of response frequencies to the WAT, 
have shown that across all age levels there 
are normal people who give responses that 
are relatively rare or unusual. Holtzman, 
Thorpe, Swartz, and Herron (1961) found 
that college students emitted a low number 
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of HIT responses that were scorable as 
Pathognomic Verbalization, poor Integration, 
or poor Form Appropriateness. Morgan 
(1968) replicated this finding using a group 
of nonhospitalized applicants to a psycho- 
logical clinic. As such, a base rate of indica- 
tions of disturbed thinking (IDT) as mea- 
sured by the WAT and HIT may be found 
in the general population. The present inves- 
tigation, therefore, centered on whether 
instructions and response-contingent rein- 
forcement would significantly increase the 
emission of IDT as measured by the WAT 
and HIT. If the experimental manipulations 
were effective, an additional line of investiga- 
tion would be whether such altered behavior 
generalized to a different but related task. 


METHOD 
Instruments 


Word Association Test (WAT). In this investiga- 
tion an IDT was defined as a response that was 
given by less than 5% of the normative sample of 
500 male college sudents tested by Palermo and 
Jenkins (1964). Because of the nature of the defini- 
nition and of the normative sample, it was neces- 
sary to eliminate four words from the original 
word list.3 

Holtzman Inkblot Test (HIT). On the basis of 
pilot work, 48 inkblots were selected from Forms 
A and B to maximize the base rate of disturbed 
responses found among college students.* A response 
was scored as an IDT if the response could be 
scored positive on any one of three variables: 
Pathognomic Verbalization, Integration, and Form 
Appropriateness. The categories and standards used 
were those presented by Holtzman et al. (1961, pp. 
45-66). 


Treatments 


Upon entering the testing room, each S was given 
the following general instructions: 


I am sure you have heard about psychologists 
being interested in inkblot cards and word associa- 
tions. The purpose of today’s meeting is to obtain 
such responses. To each item that I present, I 
want you to answer as quickly as possible. In the 
word association part, I will orally present @ 
common word to which I want you to tell me the 


3 The most frequent associate of “city” was “Min- 
neapolis” due to the locale of the normative sample; 
high” was eliminated because of its possible con- 
fusion with its homonym; and the common associ- 
ates of “trouble” and “house” accounted for less 
than 40% of the total responses of the sample. 

*For a list of the HIT cards used see Levit? 
(1968). 
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first word that comes to you. When I give you 
an inkblot card, I want you to tell me what the 
card might look like or what it might represent. 
You may see several things in each inkblot but try 
to give me only one response. 


Group R (reinforcement only). After a base-rate 
period, and during the training period only, Ss in 
this group received positive reinforcement contingent 
on the production of indications of disturbed think- 
ing, IDT. The reinforcing stimulus consisted of a 
show of attention and a variety of encouraging 
remarks. 


Group I (instructions only). After a base-rate 
period, Ss in this group received the following 
specific instructions: 


I think your responses would be more spontane- 
ous if you let yourself go: your first thoughts are 
best. In general, this can be done by acting more 
like the common stereotype of a mentally ill 
person. In other words, act like a person who 
doesn’t care what impression he makes on anyone 
but says whatever he feels like saying. 


No reinforcement was given and the instructions 
were not repeated. 

Group RI (reinforcement and instructions). After 
a base-rate period, Ss received the same specific 
instructions as Group I. During the training period 
only, these Ss received positive reinforcement con- 
tingent on the emission of indications of disturbed 
thinking, IDT. 

Group C (control). The Ss were given the intro- 
ductory instructions but no further information, 
teinforcement, or feedback. 


Procedures 


This study consisted of two experiments alike in 
Procedures and treatments but differing in the ways 
the dependent variables were used. 

Prior to testing, each S$ was randomly assigned 
to one of the four treatment conditions described 
above and to one of the two experiments.5 In each 
experiment the variables were reinforcement or no 
reinforcement and instructions or no instructions. 
Tn the first experiment, the HIT was used as a 
Pretest and a posttest measure and the WAT was 
Used as the training material. In the second experi- 
Ment, the pretest and the posttest material was the 
WAT and the training material was the HIT. 

Experiment I. The 48 Ss, 12 in each of the four 
treatment groups in this experiment, received a pre- 
test of 12 Holtzman inkblots, then a base-rate period 
of 24 words. There followed a training period of 
ne, 


* Of the 96 male Ss in this study, 56 were paid 
Volunteers and 40 were obtained from the intro- 
ductory psychology course S pool. The Ss were 
randomly assigned to one of the treatment groups 
With one restriction: in each group there were 7 Ss 
ee a paid volunteer group and 5 Ss from the 

ool. 
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72 word associations, which was later divided into 
three subperiods of 24 words each for the purpose 
of statistical analysis. The training period was fol- 
lowed by a posttest consisting of 12 Holtzman ink- 
blots. Thus, the WAT was used to test whether 
emission of IDT could be increased, and the HIT 
provided the measure of response generalization. 

Experiment II. The 48 Ss, 12 in each of the four 
treatment groups in this experiment, received a pre- 
test of association to 24 words and a base-rate period 
of 12 Holtzman inkblot cards. The training period 
consisted of 36 inkblots, which was later divided for 
statistical analysis into three subperiods of 12 ink- 
blots each. The training period was followed by a 
posttest of association to 24 words. 

In Experiment I, the words used as the base-rate 
measure and the inkblots used in the pretest were 
identical to the words used in the pretest and the 
inkblots used in the base-rate period of Experiment 
II. Further, in Experiment I, the words used in the 
last training period and the inkblots used in the post- 
test were identical to the words used in the posttest 
and the inkblots used in the last training period of 
Experiment II. 


RESULTS 
Reliability of Dependent Measures 


To determine whether the HIT provided a 
reliable measure of IDT, Ss’ responses were 
scored by two independent raters: the senior 
author and an undergraduate student without 
training in clinical psychology. Prior to 
being scored, each response was numbered 
and transcribed to an index card so that the 
identity of the treatment or test period would 
be unknown to the raters. 

The interrater reliability coefficients, each 
based on responses by 48 Ss to 12 inkblots 
were as follows: pretest, .64; posttest, .57; 
base-rate period, .51; Training Period 1, .66; 
Training Period 2, .62; Training Period 3, 
.75. All these correlations were significant 
beyond the .001 level. 


Main Experimental Effects: Experiment I 


Before assessing the effects of the experi- 
mental treatments on the production of indi- 
cations of disturbed thinking as measured by 
the WAT, it was determined that during the 
base-rate period there were no significant 
group differences (F = 2.02, df = 3/44). 

In this investigation, it was not only the 
final level of disturbed thinking that was of 
interest but also the trend of acquisition that 
each experimental manipulation produced. 
The statistical model best suited for deter- 
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TABLE 1 


ANALYSIS OF VARIANCE OF THE MAIN EXPERIMENTAL 
Errects OF EXPERMENT I; WorD 
ÅSSOCIATION TEST SCORES 


Source df MS F 
Experimental groups 3 384 8.17* 
Experimental periods 3 221 34.53* 
Groups X Periods 9 26 4.06* 
Correlated error 44 47 
Uncorrelated error 132 6.4 


*p <.001. 


mining these effects was the two-way repeated 
measures ANOVA (Lohdahl, 1967). The re- 
sults of this analysis are presented in Table 1. 
Group, period, and the Group X Experimental 
Period interaction effects were all significant 
at the .001 level. By examination of Figure 1, 
and by a series of post hoc comparisons, the 
nature of these differences was determined. 

In Figure 1 it may be seen that immedi- 
ately after the special instructions were pre- 
sented to Group I, the number of indications 
of disturbed thinking increased sharply. How- 
ever, this higher rate was not maintained 
during the succeeding periods, and by the 
last training period the rate had returned to 
the base-rate level. The difference between 
the base-rate period and Training Period 1 
was significant at the .01 level, while the 
difference between the base rate and Training 
Period 3 was not significant. 

Group RI, to which the same instructions 
were given but which also received positive 
reinforcement, showed the same sharp in- 
crease as Group I. However, this higher rate 
was maintained to the end of the training 
periods. Thus, the differences between the 
base-rate period and Training Periods 1 and 3 
were significant at the .01 level. 

Receiving positive reinforcement alone, 
Group R showed a steadily increasing rate of 
IDT. The differences between the base rate 
and Training Periods 2 and 3 were significant 
at the .01 level. 


Main Experimental Effects: Experiment II 


It was determined that no significant group 
differences existed during the base-rate period 
(F = 2.38, df = 3/44). The results of the 
tepeated measures ANOVA are presented in 
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Table 2. The differences among experimental 
groups and among the test periods were sig- 
nificant at the .01 level. The interaction was 
not significant. 

Immediately after the special instructions 
were given, as may be seen in Figure 2, 
Group I evidenced a rapid increment in IDT, 
During Training Period 2, as in Experiment I, 
this rate declined. However, during the last 
training period, a second sharp increment ap- 
peared. The differences between the base-rate 
period and Training Periods 1 and 3 were 
significant at the .01 level. 

Group RI showed the same sharp incre- 
ment after the instructions were presented. | 
This increment was maintained through 
Training Period 2, but during the last train- 
ing period the rate decreased. The differences 
between the base rate and Training Periods 
1, 2, and 3 were significant at the .01 level. 

Group R showed some increment in the 
rate of IDT, although not to the extent found 
on the WAT, Still, the difference between the 
base rate and Training Period 3 was signifi- 
cant at the .01 level. 
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Fic. 1. Experiment I: Main experimental effects 
(Word Association Test scores). 
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TABLE 2 


ANALYSIS OF VARIANCE OF THE MAIN EXPERIMENTAL 
Errects*or ExrermeNT II: HOLTZMAN 
INKBLOT TEST SCORES 


Source df MS F 
Experimental groups 3 26.3 4.61* 
Experimental periods 3 23.3 9.71* 
Groups X Periods 9 2.9 1.20 
Correlated error 44 5.7 
Uncorrelated error 132 2.4 

*p <01. 
Bra 


Generalization of Experimental Effects: 
Experiment I 


Upon undertaking this investigation, the 
HIT was believed to measure approximately 
the same type of behavior (in the present 
paper, called indication of disturbed thinking, 
IDT), as did the WAT. The authors were 
unable to find a study reporting on the cor- 
telation between these measures, especially 
one using normal Ss. Unless a significant 
positive relationship between the two mea- 
sures did exist, it would not have been rea- 
sonable to expect the results of the training 
periods to generalize to the second variable. 
Furthermore, a stringent test of generaliza- 
tion would be to show that the degree of 
change due to an experimental manipulation 
on one measure was associated with change 
on the second measure when that manipula- 
tion was no longer being applied. 

A correlation coefficient of .46 (p < .001, 
one-tailed) was found between the base-rate 
Period scores of the WAT and the pretest 
Scores of the HIT. Thus, a moderate degree 
of association existed between the two vari- 
ables used as measures of disturbed thinking. 
It was also determined that no significant 
group differences existed during the pretest 
period (F = .13, df = 3/44). 

To determine differences among groups 
from the pretest to the posttest periods, the 
repeated measures ANOVA was again used. 
The results of the analysis are presented in 
Table 3. The only significant finding was a 
Pretest to posttest trial effect across all ex- 
perimental groups at the .05 level. By exami- 
Nation of Figure 3, and by post hoc compari- 
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sons, it was determined that the difference 
between Groups R and I combined, and 
Group C was significant at the .05 level. All 
individual comparisons were not significant. 
As a final test of generalization, the rela- 
tionship between the degree of IDT change 
exhibited on the WAT during training and 
the degree of IDT change from pretest to 
posttest HIT was determined. The resulting 
correlation of +.24 was in the predicted 
direction and significant at the .05 level 
(CR = 1.64), one-tail test. Considering the 
rater reliability of the HIT, the moderate 
degree of association between the two mea- 
sures, and such factors as time lapse differ- 
ences, this was not an unimportant finding. 


Generalization of Experimental Effects: 
Experiment II 


The correlation coefficient between the 
base-rate period of the HIT and the pretest 
period of WAT was found to be .34, signifi- 
cant at the .02 level, one-tail test. Thus, 
the corresponding finding in Experiment I 
was replicated. No significant group differ- 
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Fic. 2. Experiment II: Main experimental effects 
(Holtzman Inkblot Test scores). 
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TABLE 3 


ANALYSIS OF VARIANCE OF GENERALIZATION EFFECTS 
or EXPERIMENT I: HOLTZMAN 
INKBLOT TEST SCORES 


LEONARD S. LEVITZ AND LEONARD P. ULLMANN 


TABLE 4 


ANALYsIS OF VARIANCE OF GENERALIZATION EFFECTS 
oF EXPERIMENT II: Worp 
ÅSSOCIATION TEST SCORES 


Source df MS F Source df MS F 
Experimental groups 3 4.67 1.67 Experimental groups 3 40.33 2.20 
Pre-post periods 1 8.00 4.82* Pre-post periods 1 114.00 | 30.24* 
Groups X Periods 3 3.00 1.80 Groups X Periods 3 27.00 T16% 
Correlated error 44 2.80 Correlated error 44 18.30 
Uncorrelated error 44 1.66 Uncorrelated error 44 3.77 


*p <.05. 


ences were found during the pretest period 
(F = 1.16, df = 3/44). 

A summary of the results of the analysis 
of variance for changes from pretest to post- 
test is presented in Table 4. The pretest to 
posttest trials effect was highly significant, as 
was the Experimental Group X Test Period 
interaction. Inspection of Figure 4 reveals 
that Group I and Group RI showed a sharp 
pretest to posttest increment as compared 
with Groups C and R. Remember that during 
the training periods on the HIT it was 
Group I that showed the greatest final rate 
of IDT. The pretest to posttest change of 
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Fic. 3. Experiment I: Generalization effects 
(Holtzman Inkblot Test scores). 


*p <.001. 


Group I was found to be significant at the 
.05 level. 

An analysis of change scores identical to 
that described in Experiment I resulted in a 
correlation coefficient of +-.06 which was in 
the predicted direction, but not statistically 
significant. 


Discussion 


Both experiments in this investigation indi- 
cated that the emission by normals (non- 
hospitalized college students) of indications 
of disturbed thinking (IDT) may be signifi- 
cantly influenced by environmental manipula- 
tions, such as instructions and reinforcement. 
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On both experiments, the effects of such 
training influenced emission of IDT on a dif- 
ferent, but correlated task that was also a 
measure of IDT. These findings are consistent 
with, and offer an empirical demonstration of, 
some of the aspects of the behavioral ap- 
proach to schizophrenia. These data are not 
proof of the behavioral formulation; they do 
not prove that this is what necessarily hap- 
pens to people who are called schizophrenic 
and hospitalized. An experiment which ac- 
complished this would be patently unethical. 
Further, even if one did shape an experi- 
mental group to the point of incarceration, 
such a procedure would not logically rule out 
that other factors might also eventuate in 
people being placed in psychiatric hospitals. 
To repeat, the data presented are consist- 
ent with, but not proof of, the behavioral 
approach. 

To some extent, the present data are in- 
consistent with other theories of the etiology 
of schizophrenia. There is no reason to believe 
that the students underwent regressions or 
suffered from excessive variations of motiva- 
tion, attention, social censure, stimulus input, 
or biochemical substances. While the present 
work does not rule out the conceivable im- 
portance of these latter factors, it does indi- 
cate that instructions and reinforcement may 
also increase the emission of behavior called 
schizophrenic. 

There seem to be two ways in which the 
present investigation might be attacked. The 
first is to take the position that it is not the 
behavior but its antecedents that determine 
whether the behavior is pathological. This 
indeed is the position taken by Wegrocki 
(1939): 

The patient’s delusion is an internal resolution of a 
Problem; it is his way of meeting the intolerable 
Situation. That is why it is abnormal. It represents 


a spontaneous protective device of the personality, 
Something which is not learned [p. 169]. 


Aside from the difficulties involved in formu- 
lating behavior as spontaneous and unlearned, 
the empirical material necessary for a distinc- 
tion between clinical (or real) and experi- 
Mental (or unreal) pathology is far from 
readily available. The relevance of the present 
Work to a behavioral formulation of the etiol- 
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ogy of schizophrenia could be negated if one 
denied that the HIT and WAT are measures 
of IDT or that IDT plays an important role 
in the designation of a person as schizo- 
phrenic. Paradoxically, the formulation of 
schizophrenia that could most comfortably 
make these assumptions is the behavioral ap- 
proach which considers both the designator 
and recipient of the diagnosis as playing 
social roles. 

At a more directly empirical level, when 
the present results are compared with studies 
that used the same manipulations with dif- 
ferent tasks, the effects are found to be con- 
sistent. Levy (1967), finding that instructions 
led to an abrupt change in size constancy 
perception, concluded that verbal reinforce- 
ment was merely an “inefficient way of telling 
the subject what to do.” Ayllon and Azrin 
(1964) found, however, that instructions 
alone did not have enduring effects: only 
when they were used in conjunction with 
contingent reinforcement was a high rate 
of performance maintained. In the present 
study, by suggesting that students act like 
mentally ill people, the need for gradual 
shaping was eliminated: a discriminative 
stimulus was provided for a behavior that 
was already available. Compared to reinforce- 
ment, instructions were followed by a more 
abrupt increase in IDT. 

Only under specific conditions, however, 
should reinforcement be considered “a poor 
form of instruction”: when the target behav- 
ior is clearly in the person’s repertoire. It is 
likely that among people who do not possess 
a clear picture of how the mentally ill should 
behave (whether this conception is the correct 
one in the culture or not), such as very young 
children, the effects of instruction and rein- 
forcement would be quite different. Further, 
as demonstrated in both experiments of this 
investigation and by Ayllon and Azrin 
(1964), the conditions encouraging the emis- 
sion of a behavior may be different from those 
needed to maintain it. At a more general level, 
one might say that if the person does not 
know the role, he cannot “manage the im- 
pression,” and, in turn, that if the behavior 
is not (a) recognized as a particular social 
role, (b) designated as such by the appropri- 
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ate societal label, or (c) followed by valued 
consequences, its level of emission will not 
increase. This view is similar to that of 
Becker (1963) on marijuana smoking and of 
Benedict (1934) when she noted: “Even in 
trance the individual holds strictly to the 
rules and expectations of his culture, and his 
experience is as locally patterned as a mar- 
riage rite or an economic exchange [p. 77].” 
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Levitz and Ullmann found that instructions and social reinforcement enable 
college students to enact the “mentally ill” role on two common indicators of 
schizophrenic thinking and generalized their findings as a paradigm for how 
schizophrenic thinking is shaped. Evidence is cited that schizophrenics are 
unresponsive to social instructions and verbal reinforcement, and differ from 
normals in being unable to turn off disturbed thinking. The importance of 
determining reinforcement contingencies and the mechanisms responsible for 
schizophrenic thinking in order to modify central schizophrenic symptoms 


is emphasized. 


The notion that the schizophrenic person fails 
to learn appropriate social roles and patterns of 
consensually validated language has been well 
enunciated by Cameron (1944) and Sullivan 
(1947) in social learning theories of schizo- 
phrenia. These theories have attempted to ac- 
count for the pervasive social disarticulation of 
the schizophrenic in the face of rather consistent 
social nonreinforcement or punishment for the 
emission of such schizophrenic symptoms as 
thinking disorder, hallucinations, delusions, or 
social withdrawal. Other learning theories have 
attempted to derive the failure of schizophrenics 
to respond appropriately with social reinforce- 
ments from socially learned mechanisms, such as 
fear of social censure (Rodnick & Garmezy, 
1957), a predisposition to disorganize under the 
stress of high anxiety (Broen & Storms, 1966; 
Mednick, 1968), or an acquired deficit in adient 
social motives (Rosenbaum, Mackavey, & Gris- 
sell, 1957), and in maintaining a goal-oriented 
major task set (Shakow, 1962). 

A more recent “sociopsychological formulation 
of schizophrenia” which appears to take excep- 
tion to all extant psychological positions has been 
proposed by Ullmann and Krasner (1969). They 
suggest that enacting the “mentally ill” schizo- 
phrenic role may be reinforced by the short- 
term advantages of a deviant social role (e.g., 
avoiding army induction, being disturbing to 
others, and other means of avoiding unpleasant 
and increasing pleasant stimuli). When this de- 
viant behavior leads to the assignment of the 
schizophrenic diagnostic label, further training 
in the role will ensue by modeling and reinforce- 
ments in the psychiatric interview and in the 
hospital complex. The maintenance of appropri- 


1Requests for reprints should be sent to Gerald 
Rosenbaum, Department of Psychology, Wayne State 
University, Detroit, Michigan 48202. 


ate schizophrenic symptoms is assumed to be 
attributable to their selective reinforcement by 
the culture of the large psychiatric hospital, 
Typical of the evidence cited in support of this 
formulation are experimental studies, such as 
Levitz and Ullmann (1969), which are aimed at 
demonstrating that disturbed thinking or other 
primary behavioral indications of schizophrenia 
may be shaped by verbal operant conditioning 
or instructions. 

The Levitz and Ullmann study is certainly a 
rather neat demonstration that college students 
can learn to enact the mentally ill role on at 
least two common clinical indicators of schizo- 
phrenic thinking. It is also patently clear that 
these Ss can very readily follow instructions to 
stop “putting-on” like schizophrenia and can 
easily turn off these indications of disturbed 
thinking. Unfortunately, the same is not true for 
schizophrenics. 

While Ullmann and Krasner may regard schizo- 
phrenia as a “put-on” and Szasz (1965) may 
Perceive schizophrenic hospitalization as a form 
of social injustice perpetrated by a system which 
Punishes deviance, the tragedy for the schizo- 
phrenic individual is his inability to turn off his 
disturbed thinking and to opt for himself. The 
Social psychological critiques of hospital prac- 
tices by writers like Goffman (1961) and 
Dunham and Weinberg (1960), and the develop- 
ment of token economies (Ayllon & Azrin, 1965) 
have contributed positively to the growth of 
more active programs for returning schizophrenic 
persons to the community. Nevertheless, the 
Peculiar learning history and genetic, neurological, 
Perinatal, or biochemical predispositions of the 
schizophrenic appear to seriously interfere with 
his response to well-conceived social reinforce- 
ments even in this era of community participa- 
tion in the treatment program through the 
unit system. 
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In view of these considerations and the in- 
tractibility of schizophrenic symptoms to durable 
modification, it is difficult to understand how 
careful investigators like Levitz and Ullmann 
(1969) can seriously generalize their results to 
an approximation of “what necessarily happens 
to people who are called schizophrenic and hos- 
pitalized [p. 639]” or to a paradigm enabling 
one to “shape an experimental group to the point 
of incarceration [p. 639].” They found that 
instructions, verbal reinforcement, and preferably 
their combination produced statistically signifi- 
cant increases in the frequency of uncommon 
word associations and disturbed Holtzman Ink- 
blot responses emitted by college students, as 
well as some intertask transfer. The combined 
experiments permit comparisons of the different 
training procedures on both tasks and generate 
the conclusion that instructions lead to more 
rapid acquisition but less durable increases in 
these responses, The study demonstrates that 
college students can simulate the mentally ill 
role well enough to produce the relevant indica- 
tions of disturbed thinking under the conditions. 
Despite Levitz and Ullmann’s concern for the 
unethical implications of unduly shaping dis- 
turbed thinking in their Ss, their data indicate 
that their Ss were in no danger of becoming 
schizophrenic as their emission of these responses 
was primarily under the control of self-instruc- 
tions to play a deviant role on these tasks. With 
the resumption of normal roles, these Ss might 
be expected to resume appropriate responses. By 
Contrast, schizophrenics do not resume normal 
roles, and they do continue to emit disturbed 
thinking in spite of instructions, reinforcement, 
and all of the contemporary therapeutic maneu- 
vers that are available for shaping appropriate 
responses, 

It should be noted that the sociopsychological 
thesis that the schizophrenic is an individual 
Playing the mentally ill role for reasons of 
short-term self-interest is tantamount to concep- 
tualizing the schizophrenic as a malingerer. The 
crux of this proposition requires evidence that 
schizophrenics do respond to self-instructions or 
Social instructions to play more appropriate roles. 
The present evidence suggests that Ss who could 
Conform to this criterion for malingering are the 
Students in the Levitz and Ullmann study, not 
Schizophrenics. The theoretical problem of schizo- 
Phrenia is the explanation of the failure of social 
Instructions and reinforcement to appreciably 
Modify symptomatic behavior in approximately 
1% of the population. The sociopsychological 
formulation solves this problem by denying it. 
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The recent interest in social psychological 
analysis of schizophrenia appears to be partly 
attributable to growing enthusiasm for operant 
analysis of social processes. The particular posi- 
tion of Levitz and Ullmann appears to rest also 
on certain assumptions concerning learning effects 
and thought disorder in schizophrenia. 


Learning and Schizophrenia 


In contrast to mental retardates, diminished 
learning rate does not appear to be a primary 
parameter of schizophrenia. Both clinical and ex- 
perimental data on a number of tasks (Jones, 
1961; Lindsley, 1956) indicate that schizo- 
phrenics show normal learning functions when 
reinforcements are appropriate. The fact that 
schizophrenics show instrumental learning of both 
symptomatic and more appropriate behaviors is 
cited as central to the Ullmann and Krasner 
(1969) thesis that schizophrenia is primarily a 
“put-on” in which certain individuals are tagged 
with this pejorative label because their deviant 
role playing is disturbing to others. Yet Meehl 
(1962) concludes his penetrating analysis of this 
problem with the conviction “that schizophrenia, 
while its content is learned, is fundamentally a 
neurological disease of genetic origin [p. 837].” 
Instrumental learning in schizophrenia is not 
surprising and continues to occur in all orga- 
nisms with both acquired and physiological 
deficits as they adapt to their environments. 

The treatment issue in schizophrenia is the 
determination of the conditions for and types of 
reinforcement that will increase the operant rate 
of appropriate responses and minimize symp- 
tomatic behaviors, regardless of etiology. Work 
with token economies (Ayllon & Azrin, 1965) has 
leaned heavily on the use of candy, cigarettes, 
and other amenities of life as reinforcers, rather 
than verbal and social approval. Premack (1959) 
has ingeniously suggested that the opportunity 
to perform high-frequency behaviors, even symp- 
toms, may be employed as reinforcers in behavior 
modification. In summarizing the literature, Buss 
and Lang (1965) note that biologically noxious 
stimulation is the only contingency that has led 
to complete elimination of schizophrenic deficit 
(eg., Rosenbaum, 1967), while motivating in- 
structions (Cohen, 1956) and positive verbal 
reinforcement (Cohen & Cohen, 1960) have 
been least effective in modifying schizophrenic 
performance. 


Thought Disorder in Schizophrenia 


As Levitz and Ullmann (1969) suggest, most 
investigators regard language and thought dis- 
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order as central to schizophrenic pathology and 
therefore as the most important behavior to 
shape. To date, the powerful control and rein- 
forcers used in token economies have been em- 
ployed to shape improvements in grooming, 
participation, cooperation, and ward morale (e.g., 
Atthowe & Krasner, 1968), but there have been 
no reports of the elimination of thought disorder. 
The Levitz and Ullmann demonstration that 
normal Ss can be shaped to make more uncom- 
mon associations, coupled with the findings by 
Ullmann, Krasner, and Edinger (1964) that 
schizophrenics can be trained to make more 
common word associations, are empirical con- 
tributions toward this objective. The failure of 
schizophrenics to maintain such experimentally 
induced improvements in language and of nor- 
mals to retain trained thought disturbances prob- 
ably reflects the long-standing differences in lan- 
guage hierarchies and socialization implied by the 
psychological learning theories of Cameron 
(1944) and Sullivan (1947). Effective shaping of 
schizophrenic thinking is likely to require more 
than the verbal reinforcement of common associ- 
ates. It seems reasonable to assume that some 
theoretical formulation of the habit strengths and 
reinforcement histories of schizophrenic language 
hierarchies and the mechanisms leading to dis- 
turbed verbal behavior would be critical to 
durable shaping. 

Recent work on schizophrenic language by 
Chapman, Chapman, and Miller (1964) suggests 
that thought disorder in schizophrenia results 
from the associative intrusion of strong dominant 
meaning responses in the mediation of overt 
language behavior. Cromwell and Dokecki (1968) 
have lately reviewed the principal theories of 
schizophrenic language and offered their own dis- 
attention interpretation, that schizophrenics are 
deficient in the ability to withdraw attention 
from prior stimuli. Meehl (1962) has even 
suggested that “cognitive slippage” is the result 
of a central neural integrative deficit which inter- 
acts with specific genetic, biochemical, and 
learning mechanisms. The development of sound 
programs for shaping disturbed thinking in the 
varied conditions designated as schizophrenia is 
likely to use a number of these theoretical 
formulations. There may even be some schizo- 
phrenics who are best understood as a “Dut-on.” 
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Levitz and Ullmann have added operant reinforcement to the list of allegedly 
schizomimetic conditions that have been used to generate theories of schizo- 
phrenic thought disorder. The diversity of such allegedly schizomimetic condi- 
tions casts doubt on the validity of interpreting schizophrenic symptoms on 
the basis of any one of them. The very power of operant procedures to mimic 
any behavior, regardless of its origin, argues against invoking its schizo- 
mimetic power as evidence concerning the nature of schizophrenia. The poten- 
tial usefulness of operant methods to alter schizophrenic symptoms need not be 
predicated on an operant origin of the disorder. 


Levitz and Ullmann (1969) have interpreted 
their findings as consistent with their hypothesis 
that schizophrenic symptoms are caused by 
operant reinforcement. The present discussion is 
focused on their general approach to the problem 
of schizophrenia, that is, their citing performance 
of normal Ss in a schizomimetic condition as 
evidence concerning the origin or nature of 
schizophrenic thought disorder. 

Levitz and Ullmann’s study was well designed, 
and it demonstrated that an operant etiology of 
schizophrenic symptoms is at least plausible. The 
investigators concluded, with appropriate cau- 
tion, that their results are “consistent with” but 
“not proof of? the operant origin of schizo- 
phrenic thought disorder. I agree, except in a 
matter of relative emphasis on the two sides of 
the authors’ conclusions. There are additional 
reasons, not discussed by Levitz and Ullmann, 
for viewing their results as very weak support 
for the hypothesis that reinforcement history 
accounts for the origin of schizophrenic thought 
disorder. 

Operant methods are capable of producing 
almost any conceivable change in voluntary 
behavior and even, perhaps, some involuntary 
(autonomic) responses. Verbal responses are 
simply one more kind of Tesponse that can be 
altered operantly. Could anyone doubt after 
reviewing the successes of operant workers that 
one could train college students to rave as 
paretics, speak inadequately as motor aphasics, 
or even to quack as ducks? However, would 
anyone seriously suggest that such a demonstra- 
tion of the power of operant methods would 
constitute support for the hypothesis that pa- 
retics, aphasics, and ducks fail to speak as normal 
adult human beings primarily because of un- 
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fortunate reinforcement histories? The operant 
method has the power to shape responses that 
mimic almost any behavior in its natural state, 
regardless of its origin. Such power rules out 
any one such demonstration as convincing evi- 
dence that behavior of unknown origin occurred 
primarily because of reinforcement contingencies. 

There is a second good reason for caution in 
viewing the schizomimetic power of reinforce- 
ment as support for a primarily operant etiology 
of schizophrenic thought disorder. Many other 
schizomimetic conditions have been reported as 
producing schizophreniclike verbal behavior. In- 
vestigators of these conditions have often, like 
Levitz and Ullmann, argued that the similarity 
of schizophrenic cognition to that of Ss in some 
one schizomimetic condition is evidence that the 
naturally occurring schizophrenic symptom has 
a cause like that of the behavior elicited in the 
schizomimetic condition, The very multiplicity of 
these conditions that produce behavior at least 
superficially akin to schizophrenic symptoms 
casts doubt on the validity of the argument for 
any one of them. The following is at least a 
partial list of these allegedly schizomimetic con- 
ditions that have been invoked as evidence for 
some particular theory of schizophrenia. 

1. LSD, mescaline, and other drugs. Recently, 
the most widely discussed schizomimetic condi- 
tions have been LSD, mescaline, and other drugs. 
Numerous investigators have reported schizo- 
phreniclike cognitive behaviors in response to 
drugs (Hoffer & Osmond, 1967). These behav- 
iors include disturbed association, pressure of 
thought, neologisms, blocking, hallucinations and 
delusions. Several investigators have inferred 
from such evidence that schizophrenia is caused 
by substances similar to one or another of these 
drugs. This hypothesis has stimulated a vast 
amount of research on the biochemistry of these 
drugs, and has led to the identification of new 
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schizomimetic drugs (notably adrenochrome, 
adrenolutin, and sernyl) and, concomitantly, to 
revised hypotheses of the biochemical basis of 
schizophrenia. 

2. Sensory deprivation. The McGill investi- 
gators, who initiated the sensory deprivation re- 
search tradition, reported that a prolonged low 
level of sensory variation produces a deteriora- 
tion in the ability to think in an organized, 
goal-directed manner, as well as increased day- 
dreaming, hallucinations, distortion of body 
image, and other perceptual aberrations (Bexton, 
Heron, & Scott, 1954; Heron, Bexton, & Hebb, 
1953). Several other investigators have recon- 
firmed these observations. 

Rosenzweig (1959) contended that the sensory 
deprivation reaction more closely resembles 
schizophrenia than either the LSD or mescaline 
psychoses. He suggested that both schizophrenia 
and the sensory deprivation reaction are caused 
by a loss of meaning to stimuli. In sensory 
deprivation, meaning is removed from the en- 
vironment by the experimental procedures, 
whereas in schizophrenia, it results from an 
internal derangement of communication between 
the patient’s “abstract system” and his “affect 
system.” 

Gaarder (1963) also cited the similarities be- 
tween schizophrenia and the cognitive disorders 
in sensory deprivation and suggested that schizo- 
phrenia is the result of lack of input of necessary 
data about the environment. 

3. Sleep deprivation. Many investigators have 
noted that prolonged sleep deprivation produces 
cognitive disorganization, hallucinations, and oc- 
casional delusions. The suggestion that schizo- 
phrenia has its origin in a mechanism similar 
to that of the symptoms of sleep deprivation 
has been made by Tyler (1947), Luby, Grisell, 
Frohman, Lees, Cohen, and Gottlieb (1962), and 
West, Janszen, Lester, and Cornelisoon (1962). 
Of these writers, Luby et al. were the most 
specific. They found in sleep deprivation a dis- 
turbance of the synthesis of adenosine triphos- 
phate and they proposed a similar defect as the 
cause of schizophrenia. Bliss, Clark, and West 
(1959) contended that naturally occurring schizo- 
phrenic episodes are often the direct result of 
sleep deprivation, pointing to similarities in 
cognition between schizophrenics and sleep- 
deprived Ss to bolster their contention. 

4. Hypnosis. King (1957) gave a proverbs test 
to normal Ss, both during hypnosis and in the 
wakeful state. Because he found that hypnosis 
Mcreased concrete and bizarre answers, he con- 
cluded that schizophrenia, like hypnosis, is a 
Manifestation of heightened suggestion. 
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5. Speeded performance. Flavell and Draguns 
(1957), following a lead of Schilder (1951), 
argued that each normal thought passes through 
a series of stages of development, and that 
schizophrenic cognition represents prematurely 
expressed early stages in the development of 
normal cognition. Flavell and Draguns (1958) 
offered as supportive evidence of this theory 
their finding of some similarities between the 
various idiosyncrasies of schizophrenics’ responses 
on a word association test and those exhibited 
by normal Ss when required to respond very 
rapidly. 

Phillips and Framo (1954) reached a similar 
conclusion about schizophrenic cognition by 
noting a similarity between the responses that 
schizophrenics typically gave to the Rorschach 
blots and those of normal adults presented with 
brief (tachistoscopic) exposures of the Rorschach 
blots. 

6. Distraction. Jung (1919) studied the effects 
of distraction on the responses of normal Ss in 
a word association test; his manipulation of dis- 
traction consisted of requiring S to mark a 
paper with a pencil in time with the beat of a 
metronome. Jung found that distraction more 
than doubled the production of rhyme responses 
and more than quadrupled the production of 
other clang associations. Later, Jung (1936) in- 
voked the similarity of these responses to those 
typically found in dementia praecox patients as 
evidence that the responses of the patients are 
deviant because of insufficient attention. 

Hassol, Magaret, and Cameron (1952) found 
that normal Ss showed an increase of “scattered 
speech” in telling TAT stories while simultane- 
ously listening to a recording of their own pre- 
vious story of the same TAT card. “Scattered 
speech” included imprecise speech, the use of 
substitute or approximate words or phrases, and 
interruptions in the sequence of the material. 
The investigators concluded that the findings 
supported Cameron’s hypothesis that schizo- 
phrenic disorganized speech results from the 
intrusion of personal fantasy material. 

7. Relaxed attention. Jung (1936) reviewed at 
length a study by Stransky, who obtained samples 
of continuous speech from normal Ss under the 
condition of “relaxed attention.” Stransky gave 
a stimulus word and instructed his Ss to speak 
for 1 minute, but not to pay attention to what 
they said. The Ss showed many features of 
speech that are commonly observed in schizo- 
phrenics, including perseverations, repetitions, 
sound associations, and condensations of se- 
quences of ideas into brief phrases or into single 
words (neologisms). Jung interpreted these find- 
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ings, and his own distraction studies, as proving 
that dementia praecox is characterized by a 
diminution of attention. 

8. Disruption of perception. Aaronson (1964), 
using hypnotism, induced a disturbance in depth 
perception in a single normal S, who responded 
with psychotic ideation and mood changes. The 
author inferred that disorder in depth perception 
may produce schizophrenia. Aaronson (1965), 
again using a single S, reported that he produced 
psychotic ideation, hallucinations, and with- 
drawal by communicating to the hypnotized S 
that time had stopped. Aaronson concluded that 
“Schizophrenia seems specifically a response to 
the elimination of some conventional dimension 
of perception [p. 81.” 

9. Anoxia. Several investigators, including 
Freeman (1931), have noted schizophreniclike 
cognitive aberrations that accompany oxygen 
deprivation. McFarland’s (1932) study was 
especially thorough. He reported that normal 
young men with reduced oxygen intake manifest 
several schizophreniclike symptoms, including de- 
fective judgment, perseveration, an abandonment 
of moral inhibitions, and, subsequently, partial 
or complete amnesia. In more severe oxygen 
deprivation, hallucinations and delusions were 
found to appear. Hinsie, Barach, Harris, Brand, 
and McFarland (1934) pointed to these phe- 
nomena as support for the hypothesis that 
schizophrenic symptoms might reflect deficient 
oxygen supply to brain cells. They took their 
hypothesis so seriously that they converted a 
patients’ dormitory into a large oxygen chamber 
for three months in an attempt to find a treat- 
ment for the disorder, 

10. Brain damage. Several influential clinicians 
have suggested that schizophrenic cognition re- 
sembles that of brain-damaged patients, and 
that schizophrenia is probably, therefore, a simi- 
lar condition, Kraepelin (1919) named the dis- 
order “dementia praecox” because he believed 
that these patients show a mental enfeeblement 
resembling in part that found in paresis, senility, 
and epilepsy, Many other writers have pointed 
out specific similarities in cognitive deficit, be- 
tween schizophrenics and brain-damaged pa- 
tients, and have inferred that schizophrenics also 
have cerebral organic malfunction. The deficit 
cited in common has most often been one of 
concept formation (Babcock, 1933; Bychowski, 
1935, 1943). Goldstein (1939) also took this 
view, but later reversed himself (Goldstein, 
1959), 

11. Childhood. Many writers, ciall; = 
choanalytic theorists, pointing tothe ey 
process” in the thinking of schizophrenics and 
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that of young children have inferred that schizo. 
phrenia results from a regression to an early 
stage of development (Fenichel, 1945). 

12. “Primitive” racial development. Another 
version of the regression hypothesis is that 
schizophrenics have regressed to an earlier stage 
of man’s development such as that of prehistoric 
man or members of contemporary “primitive” 
societies. Storch (1924), Arieti (1955), and 
Werner (1948) have pointed to a number of 
similarities of cognitive functioning that they 
believe support this hypothesis. 

13. Dreams and sleep. The ancient observation 
that dreams resemble madness has been exploited 
most fully by psychoanalytic theorists who see 
“primary process” in both. A number of psycho- 
analysts have inferred that schizophrenic thought, 
like that of dreams, reflects an abandonment of 
the regulative and censoring functions of the 
ego (Freud, 1953; Rosen, 1953). 

Despite the current impasse reached by investi- 
gators employing the method of schizomimetic 
conditions, the method is not logically unsound. 
If two syndromes are truly identical, one must 
take seriously the hypothesis that they have a 
similar cause. However, a major difficulty faced 
by research workers is the crudity of most mea- 
sures of schizophrenic thought disorder. Apparent: 
similarities may vanish when specific details of 
two error patterns are considered. Cameron 
(1938a, 1938b) is well known for making this 
point in an attempt to refute both the conten- 
tion that schizophrenic thought is regressive and 
that it shows organic deterioration, Chapman, 
Burstein, Day, and Verdone (1961) have also 
elaborated this argument in reference to the 
regression hypothesis. Writers too numerous to 
mention have recently made the same point in 
relation to the allegedly schizomimetic drugs. 
It must be noted that Levitz and Ullmann’s 
measure of similarity on the word association test 
is one of the least specific in the literature, 43 
they merely trained atypical responses, without 
any scrutiny of the nature of the atypical re- 
sponses. However, their measure on the Holtzman 
Inkblot Test was more specific. 3 

A second difficulty faced by research in this 
area is the crudity of current systems of diag- 
nosis. Almost all investigators of schizomimetic 
conditions have not attempted to specify what 
types of schizophrenics are mimicked. If there 
are meaningful subdisorders within schizophrenia, 
one might expect the cognitive disorder of some 
of them, but not others, to be mimicked by one 
or another schizomimetic condition. Tutko and 
Spence’s (1962) study exemplifies the promise of 
this approach. They found that on the passive 
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sorting portion of the Object Sorting Test, 
process and reactive schizophrenics depart from 
normal performance by distinctly different error 
patterns, and that the errors of brain-damaged 
patients resemble only those of the process 
schizophrenics. 

Levitz and Ullmann have added to the long 
list of schizomimetic conditions one more condi- 
tion that produces deviant verbal behavior like 
that of schizophrenics. One might ask whether 
anything that is known about reinforcement 
should lead one to take it more seriously as a 
basis of schizophrenic thought disorder than the 
dozen or so other schizomimetic conditions. The 
contrary seems true. Reinforcement may change 
behavior in any direction, toward health as well 
as illness, while the other schizomimetic condi- 
tions almost always debilitate cognition. More- 
over, some schizomimetic conditions may pro- 
duce only changes that are identical to those of 
some variety of schizophrenia. 

Many operant workers might contend that the 
very power of reinforcement to change behavior 
lends support to the presumption that in the 
absence of contrary evidence, any behavioral 
variation is a learned one. This is a hypothesis 
worthy of investigation. The most appropriate 
way to test the hypothesis in reference to schizo- 
phrenia is probably by means of longitudinal 
studies of family interaction. Such studies would, 
unfortunately, be difficult to perform. 

The paucity of evidence for an operant origin 
of schizophrenic symptoms should not, however, 
discourage workers who seek to modify schizo- 
phrenic symptoms by operant methods. Promising 
first steps in modifying schizophrenic cognition 
have already been reported by several investi- 
och as pointed out by Ullmann and Krasner 
1969). 
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AN EXPERIMENTAL PROCEDURE FOR THE 
MODIFICATION OF PSYCHOTIC BEHAVIOR 
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A hospitalized patient exhibiting hallucinatory behavior was subjected to a 
self-monitoring procedure with and without social reinforcement. After 16 
days of treatment, hallucinations were extinguished with no remissions reported 
for approximately 6 months. Other aspects of her behavior also showed 


improvement. 


Of the various new techniques used in 
modifying behavior, perhaps the most promis- 
ing are those which emphasize self-control. 
Self-control can be thought of as a process 
by which an individual sets up conditions in 
his environment to bring about specific be- 
haviors in himself (Goldiamond, 1965a). 
Self-control techniques have been demon- 
strated to be effective in modifying behaviors 
ranging from poor studying habits (Sulzer, 
1962) to stuttering (Goldiamond, 1965b). 

The technique with perhaps one of the 
simplest applications is self-monitoring. Kan- 
fer (1967) contends that self-control is facil- 
itated by immediate positive feedback. He 
feels that behavior may be brought under con- 
trol when accurate feedback is available. By 
self-monitoring, the individual is provided 
with this feedback and becomes aware of the 
effect and consequences of his behavior. 
Essentially this technique consists of an in- 
dividual keeping a record of the frequency in 
Which he engages in the behavior to be con- 
trolled. This provides the feedback which 
Makes the individual cognizant of this be- 
havior and enables him to bring it under his 
Control. Self-monitoring has demonstrated 
effectiveness in modifying eating (Stollak, 
1966) and smoking (Rutner, 1967) behavior. 
Tt has been shown that the effectiveness of 
self-monitoring may be increased when the 
) Procedure is made a “public event” subject to 
: environmental consequences (Rutner, 1967). 
| The present study was undertaken to deter- 
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mine if self-monitoring with and without social 


T “Requests for reprints should be sent to Ivan 
a Rutner, who is now at the University of Mani- 
oba, Department of Psychology, Winnipeg, Canada. 


reinforcement could function as a modifier of 
psychotic behavior. 


PROBLEM 


Eva was a 47-year-old female volunteer resident 
of a state mental hospital diagnosed as schizophrenic. 
She had been hospitalized for a period of 13 years. 
Eva sought hospitalization as a result of frequently 
occurring auditory hallucinations. These hallucinations 
contributed to a condition of severe depression and 
anxiety. What was particularly distressing to Eva 
was the extent to which her behavior was con- 
trolled by her “voices.” She reported that she was 
not even permitted access to the bathroom unless it 
was ordered by the “voices.” What was especially 
intriguing about this problem was Eva’s strong 
desire to resume control of her behavior. She felt that 
if her “voices” could be brought under control 
and eliminated she would be able to leave the hospital 
and function adequately in the community. This 
opinion was also shared by the hospital staff. 


PROCEDURE 


As a result of Eva’s desire to control her behavior 
and because of the presence of “adequate personal 
resources” available to her, a procedure using self- 
control was selected. These personal resources (Kanfer 
& Phillips, 1966) include an ability to give reliable 
self-reports on the basis of self-observation, the 
capacity of learning some degree of self-control, and 
a history of responding to the therapist as a posi- 
tive reinforcer. 

Of the self-control techniques that could have 
been used, self-monitoring was selected because of 
the relative simplicity of its administration. Eva was 
instructed to privately record the frequency of her 
hallucinatory behavior for a 3-day period. She was 
next instructed to publicly record the daily fre- 
quency of hallucinatory behavior on a recording 
chart which was provided. The chart was strategically 
placed on the ward, directly behind the nursing 
station, in full view of everyone on the ward. As a 
result of the chart placement, anyone on the ward 
could be aware of Eva’s progress at any time. The 
chart, beside providing immediate feedback to Eva, 
also functioned as a stimulus for social reinforcement 
on the part of the nursing staff, experimenters, and 
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fellow residents of the ward. The social reinforcement, 
consisting of praise, attention, encouragement, etc., 
was contingent on the recording of a reduction in 
reported hallucinatory behavior. When Eva failed 
to report a reduction, social reinforcement was 
withheld. 


RESULTS AND DISCUSSION 


Figure 1 represents the results of the 
present investigation. The chart identifies re- 
ported hallucinatory behavior on a daily basis. 
An inspection of Figure 1 reveals a marked 
reduction in this behavior. Reported hallucina- 
tions decreased in frequency from 181 on the 
first day, to 11 on the third day of private 
recording. No hallucinations were reported on 
the first day of public recording. Reported 
hallucinations then increased in frequency to 
16 and decreased and leveled off at 0 on the 
sixteenth day. Reported hallucinatory be- 
havior has remained at 0 with no remissions 
reported for approximately 6 months, 

The above results raise a very provocative 
question: What could have accounted for this 
reduction in reported hallucinatory behavior? 
Since reported hallucinatory behavior steadily 
decreased in frequency during the 3 days of 
“private” recording, would “private” recording 
have been sufficient to maintain this reduc- 
tion? While this question cannot be definitely 


answered on the basis of the present study, it 
seems reasonable to assume that the chart 
placement and subsequent social reinforcement 
were responsible for maintaining the reduction 
in reported hallucinatory behavior. During 
the 3-day period prior to the recording chart 
being placed on the ward, Eva’s hallucinations 
and their subsequent recordings were “private 
events” furnishing Eva with feedback ex- 
clusively. However, once the chart was pro- 
vided, Eva’s behavior became a “public event” 
subject to social approval and reinforcement. 
The remission reported on the fifth experi- 
mental day was most probably a reaction to 
this change in conditions. Concomitant with 
this reduction in reported hallucinations, a 
dramatic change was noted in other aspects of 
Eva’s behavior. She appeared more relaxed, 
began to socialize more with other patients, 
her appearance improved, and she asked for 
a job. Perhaps more important, although 
certainly less quantifiable, she appeared to be 
happier. The results indicate that the present 
Procedures were successful in reducing the 
frequency of reported hallucinatory behavior 
and in maintaining this reduction for an e% 
tended period of time. While these results 
are encouraging, they must be interpreted 
with some qualification. There is no evidence 
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to suggest that the results obtained with the 
present patient can be generalized to all pa- 
tients exhibiting similar behavior. Self-mon- 
itoring, like other forms of self-control, is at 
present limited in its application. Since the 
self-control techniques require patient co- 
operation and participation in carrying out the 
procedures without direct supervision, it is 
necessary that the patient be capable of ex- 
hibiting the necessary behaviors to meet such 
requirements. Another variable is the degree 
of aversiveness of the maladaptive behavior 
to the patient. Unless this behavior is aversive 
to the extent that its reduction or termination 
is reinforcing to the patient, it is highly un- 
likely that patient participation can be main- 
tained in the absence of the therapist. 

The present report has been concerned 
with the frequency of reported hallucinatory 
behavior. It was this report of hallucinations 
that was observed and recorded as the overt 
maladaptive behavior. Since hallucinations 
are rarely if ever observable events, it might 
be appropriate to ask what, in addition to 
such reports, do clinicians mean when they say 
“the patient hallucinated?” If they mean only 
that the patient gave a verbal report of an 
hallucination, then it must be assumed that it 
is this verbal report that is considered to be 
the psychotic behavior. 

It has been strongly suggested that in 
mental hospitals much of what is identified as 
Psychotic behavior is nothing more than 
bizzare verbal behavior which is inadvertently 
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shaped up and maintained by the social re- 
actions of the staff (Ayllon & Haughton, 
1964). It would seem that if psychotic verbal 
behavior can so easily be shaped up and 
maintained by bootleg social reinforcement, 
then perhaps such behavior is more a reflection 
of the reinforcement contingencies found in 
the social environment than of some under- 
lying intrapsychic conflict. 


REFERENCES 


Aytton, T., & Haucuton, E. Modification of sympto- 
matic verbal behavior of mental patients. Be- 
haviour Research and Therapy, 1964, 2, 87-97. 

GOLDIAMOND, I. Self-control procedures in personal 
behavior problems. Psychological Reports, 1965, 
17, 851-868. (a) 

Gorpramonp, I. Stuttering and fluency as manip- 
ulatable operant response classes. In L. Krasner & 
L. Ullmann (Eds.), Research in behavior modifica- 
tion. New York: Holt, Rinehart & Winston, 1965. 
(b) 

Kanrer, F. H. Self-regulation: Research, issues and 
speculations. Paper presented at the Ninth Annual 
Institute for Research in Clinical Psychology, Uni- 
versity of Kansas, April 1967. 

Kanrer, F. H., & Pumps, J. S. Behavior therapy: 
A panacea for all ills or passing fansy? Archives 
of General Psychiatry, 1966, 15, 114-128. 

Rurver, I. T. The modification of smoking behavior 
through techniques of self-control, Unpublished 
master’s thesis, Wichita State University, 1967. 

Srottack, G. E. Weight loss obtained under various 
experimental procedures. Unpublished manuscript, 
Indiana University, 1966. 

Sutzer, E. S. Research frontier; Reinforcement and 
the therapeutic contract. Journal of Counseling 
Psychology, 1962, 9, 271-276. 


(Received for early publication March 5, 1969) 


Journal of Consulting and Clinical Psychology 
1969, Vol. 33, No. 6, 654-660 


PERCEPTION OF DISABILITY BY THE NONDISABLED* 


A. P. MacDONALD, Jr.,2 anD JANET HALL 


Rehabilitation Research and Training Center, West Virginia University 


Fifty graduate students (30 males and 20 females) rated the seriousness of 
four disability types for six personal-social areas as they would affect an 
adult male, 28 years of age, married, with two preteenage children. Considerable 
support was obtained for the notion that certain disabling conditions are seen 
as more debilitating to some areas of life than others. Using Duncan's 
multiple-range test, 70% (i.e. 42 out of 60) of the pairs of personal-social 
dimensions were found to differ significantly. As predicted, locus of control and 
ratings of the seriousness of emotional disorders were significantly correlated 
(r= —.30). That is, emotional disorders were seen as more debilitating by 
internally controlled Ss. No support was found for the hypothesis that dis- 
abilities other than emotional disorders would be viewed as more debilitating 
by externally controlled persons. Implications of the findings for rehabilitation 


programs are discussed. 


Numerous approaches are continually tried 
in an effort to rehabilitate greater numbers of 
clients. Notable among these is the approach 
directed at the discovery of especially relevant 
psychosocial variables. Once identified and 
understood, such variables may be instru- 
mental to the development of more efficient 
intervention strategies, as well as to the refine- 
ment of existing strategies. 

Though it is widely recognized and ac- 
cepted that the disabled are disabled people 
rather than disabled people, remedial pro- 
grams seem to focus more on the latter. Major 
emphases seem to be placed on (a) the 
offering of vocational rehabilitation services 
(which typically include vocational training 
and job placement) and (b) the treatment of 
the disabling condition itself (e.g., prosthetic 
devices and surgery). A broadening of focus 
may do much to facilitate the rehabilitation 
process. 

A disabling condition may have, in addition 
to vocational impact, a measureable effect on a 
number of psychosocial dimensions. It seems 
proper to question the impact on such areas as 
family, social interaction, parenthood, marital 


2 This research was supported in part by West 
Virginia University Regional Rehabilitation Re- 
search and Training Center, through the Social and 
Rehabilitation Service, Department of Health, Educa- 
tion, and Welfare. 

2 Requests for reprints should be sent to A. P. 
MacDonald, Jr., Research and Training Center, Old 
Mountainlair, Room 41, West Virginia University, 
Morgantown, West Virginia 26506. 


relationships, self-concept, etc., separately and 
in their various combinations. For example, if 
a specific disability has a greater effect on 
marital relationships than on vocation, effec- 
tive vocational training may be inhibited by 
the use of in-residence training programs. In 
such cases, a more successful approach might 
be to keep clients within their families and to 
couple vocational training with marriage 
counseling. 

It is also important to know how others 
view the disabled, as well as how the disabled 
view themselves. Both questions have been 
addressed. For example, the disabled and 
others have been found to perceive disabled 
persons as “different” from the nondisabled 
(Neff & Weiss, 1965). The perception of the 
disabled as different may have important im- 
plications for the ways in which society will 
treat them. One consequence is that they are 
often given social status comparable to that of 
an underprivileged minority group (Wright, 
1960). | 

Apart from society’s perception and treat- 
ment of the disabled, there is the issue of bis 
perception of himself. Personality charactet- 
istics may in part determine the nature of 
that perception. Characteristics such as Pes 
simism, low self-concept, etc., may be 4 
crippling as any physical impairment, and 
may do much to inhibit adjustment to dis- 
ability. For example, denial (ie., refusing t0 
accept the fact that one is disabled) is viewed 
by some investigators as the primary PSY | 
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chological mechanism which interferes with 
the rehabilitation process (Alger & Rusk, 
1955; Ashenhost, Hurwitz, & Gruen, 1960; 
Barker & Wright, 1952; Barnes, 1952; Fisher, 
1958). An important contribution would, 
therefore, be made if personality character- 
istics peculiar to denial of disability could be 
isolated. 

One personality variable which would ap- 
pear to be especially relevant to adjustment to 
disability is locus of control (Rotter, 1954, 
1966). Locus of control deals with the extent 
to which persons view themselves as having 
some control over their reinforcements. Those 
who believe that they have control are con- 
sidered to be internally controlled (I’s). Those 
who believe that their destinies are controlled 
by luck, chance, fate, or powerful others, 
are externally controlled (E’s). I’s have been 
found to be (a) more willing to engage in 
remedial behaviors to confront their person- 
ality problems (Phares, Ritchie, & Davis, 
1968), (b) more likely to stop smoking after 
the release of the research report that linked 
cancer to smoking (James, Woodruff, & 
Werner, 1965), and (c) superior to externals 
in actively seeking (Davis & Phares, 1967) 
and learning (Seeman, 1963) information 
televant to problem solution. 

Though internals have been found to be 
More open to remedial programs, they have 
been reported to be somewhat more defensive 
regarding negative professional interpretations 
of their personalities. That is, I’s were found 
to recall less negative information about 
themselves than did E’s (Phares et al., 1968). 
One explanation of this finding might be that 
Ts see emotional disorders (assuming that 
negative personality interpretations suggest 
emotional disorders) as relatively more 
threatening to themselves. To the extent that 
emotional disorders imply a loss of inner con- 
trol, they might be more threatening to I’s, 
who believe that they have control to lose. 
On the other hand, E’s might be more 
threatened by other disabilities, the stimulus 
characteristics of which they would view as 
eliciting negative reaction in the external 
world. 

The following study was done to investigate 
the relevance of personal-social dimensions to 
disability groups as well as to test the notions 
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that (a) I’s perceive emotional disorders as 
more debilitating than E’s and (b) E’s view 
disabilities other than emotional disorders as 
more debilitating than I’s. 


METHOD 
Subjects 


Data used in this study were collected from 50 
(30 males and 20 females) graduate students at West 
Virginia University. All Ss had been administered the 
Rotter Internal-External locus of control scale 
(Rotter, 1966) 1 month earlier, as part of another 
study. 


Procedure 


A disability scale was developed for the measure- 
ment of Ss’ perceptions of the seriousness of dis- 
ability types for several personal and social di- 
mensions, The scale was designed to tap five major 
classes of disability: (a) internal disorders—heart 
condition, back condition, and diabetes; (b) sensory 
disorders—blindness, deafness, and speech loss 
(muteness); (c) disfigurements—scarred face, being 
extremely overweight (obesity), and hunched back; 
(d) amputations—amputated leg, amputated arm, 
and amputated hand; (e) emotional disorders— 
having irrational fears, being extremely depressed, 
and being withdrawn. 


The Ss rated each disability on a four-point (from 
“extremely debilitating” through “not much debilitat- 
ing”). The score for each disability type was com- 
puted by summing the ratings of the three specific 
disabilities contained within it. 

The questionnaire was preceded by the following 
instructions: 


Very little is known about people’s views toward 
disability, and we need your help in collecting in- 
formation in this area. We would like YOUR 
OPINION about the seriousness of different dis- 
abilities. This is not a test. There are no right or 
wrong answers; we are interested in your opinion 
only. Below is a list of disabilities. For each 
disability, please mark an X in the category which 
best describes how serious each disability is. Do 
not omit any of the disabilities. Rate each one only 
once (mark only one X for each disability). Do 
not place any X’s on the lines separating the 
categories. 

In that disabilities affect different people in 
different ways, we will be asking you to in- 
dicate how serious each disability is on the job, 
in the marriage, in the community, and so on. 

In answering, assume that the person disabled is: 
(1) the male head of a family; (2) 28 years old; 
(3) a high school graduate; (4) earning a modest 
income; and (5) the father of two pre-teenage 


children. 


The Ss were asked to rate the seriousness of 
each disability separately for six personal and social 
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TABLE 1 
DIFFERENCES BETWEEN THE MEANS OF EACH OF THE PERSONAL-SOCIAL DIMENSIONS 
FOR ĪNTERNAL DISORDERS 

Dimension Vocational | Marital | Social Parental | Familial | Personal 
Vocational ps Er ai : 
Marital ++ $ 
Social bad = * 
Parental 
Familial 

M* 7.32 6.40 | 5.04 | 6.76 6.54 | 6.98 


Note.—Data were analyzed by Duncan's multiple-range test for correlated means, N = 50. 


à The higher the mean, the more debilitating. 
* p< 05. 
KD S01, 


dimensions (each dimension was rated on a separate 
page). The dimensions were as follows: 

1. Vocational. “In your opinion, how seriously 
debilitating are the following for the adult male in 
his job?” 

2, Marital. “In your opinion, how seriously 
debilitating are the following for the adult male in 
his relationship with his wife?” 

3, Social. “In your opinion, how seriously debilitat- 
ing are the following for the adult male in his 
relationship with his friends and neighbors?” 

4. Parental. “In your opinion, how seriously 
debilitating are the following for the adult male in 
his role as a father?” 

5. Familial. “In your opinion, how seriously 
debilitating are the following for the adult male in 
his relationship with his family in general?” 

6. Personal. “In your opinion, how seriously 
debilitating are the following for the adult male in 
his feelings about himself?” 

As aforementioned, it had been decided a priori 
that the 15 disabilities might be collapsed into five 
major classes or disability types. To test the validity 


of the a priori clusters, intercorrelations between all 
the items (within the personal dimension only) were 
computed, The correlations were computed on data 
collected from 78 cases, of which the present sample 
was a part.’ 

Correlational analysis provided considerable sup- 
port for the a priori decisions. Only two adjustments 
were needed: (a) blindness had to be dropped from 
the sensory disorder cluster, as it correlated too 
generally, and (b) disfigurements and amputations 
were collapsed into one category, which was called 
“cosmetic disorders.” Subsequent to these adjust- 
ments, no item correlated more highly with an 
item of any cluster than it did with the items within 
its own cluster. 

Reliability estimates (Cronbach’s alpha coefficient) 
(W=50) were computed for each disability cluster. 
The mean coefficients for each cluster averaged over 
the six dimensions were as follows: internal dis- 


® The 78 Ss were taken from disparate settings. To 
insure homogeneity of sampling, it was deemed ap- 
Propriate to use the data collected from one class 
of 50 graduate students for the major analyses. 


TABLE 2 


DIFFERENCES BETWEEN THE MEANS or EACH OF THE PERSONAL-SocIAL DIMENSIONS 
FOR SENSORY DISORDERS 


Dimension Vocational | Marital k Social | Parental | Familial | Personal 
Vocational +k * 
Marital ee ** ** +* 
Social ** * + 
Parental 
Familial 

Wo 5.96 | 4.76 | 5.44 5.94 5.82 | 6.16 


Mte ae were analyzed by Duncan's multiple-range test for correlated means. N =50. 
g. 


the mean, the more debilitating. 
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TABLE 3 
DIFFERENCES BETWEEN THE MEANS OF EACH OF THE PERSONAL-SOcIAL DIMENSIONS 
FOR Cosmetic DISORDERS 
Dimension Vocational | Marital | Social Parental | Familial | Personal 
Vocational * ** x% ee ** 
Marital t siji 
Social ai wie s 
Parental +x 
Familial ae 
Ms 13,26 12.04 | 10.80 12.04 11.90 | 15.82 


Note.—Data were analyzed by Duncan’s multiple-range test for correlated means. N = 50. 


* The higher the mean, the more debilitating. 
*b < 05. 


> = 101: 


orders, .64; sensory disorders, .78; cosmetic disorders, 
81; and emotional disorders, .85.4 


RESULTS 
Perception of Disability 


A one-way analysis of variance was done 
for each disability type across the six personal- 
social dimensions separately. All Fs were sig- 
nificant at well beyond the .005 level of con- 
fidence. To test for the differences between the 
means of each disability type across the di- 
mensions, the data were analyzed with Dun- 

4The reliability estimates for each disability type 
for each of the dimensions (i.e, vocational, marital, 
social, parental, familial, and personal, respectively) 
were (a) internal disorders: .60, .69, .72, .62, .57, 
and 65; (b) sensory disorders: .65, -77, .83, .74, 
83, and .84; (c) cosmetic disorders: .74, .82, .81, 
84, .79, and .86; and (d) emotional disorders: .77, 
80, 86, .84, .91, and .92. 


can’s multiple-range technique for correlated 
means. 

The mean ratings obtained for each dis- 
ability cluster, within each dimension, and 
the results of the Duncan multiple-range 
analyses (which tested the significance of the 
differences between the means for all pairs) 
are reported in Tables 1 through 4. 

Inspection of the data reported in Tables 
1 through 4 reveals considerable support for 
the notion that people alter their perceptions 
of disabilities according to specified dimen- 
sions. Seventy percent (i.e, 42) of the 60 
pairs of dimensions tested were found to differ 
significantly. 

The combinations that were found to differ 
significantly are too numerous to present in 
detail here—though they are, of course, sum- 
marized in Tables 1-4. Certain molar com- 


TABLE 4 


DIFFERENCES BETWEEN THE MEANS OF EACH OF THE PERSONAL-SOCIAL DIMENSIONS FOR 
EMOTIONAL DISORDERS 


Dimension 


Vocational | Marital 


Social | Parental | Familial | Personal 


Vocational i 


Marital 
Social 
Parental 
Familial 


+e eK 
ee 
ee 


** 


ak 
ee 
ex 


+ 


ek 


Me 7.74 | 8.86 | 


8.98 10.10 


9.78 i 8.98 


Note.—Data were analyzed 
ae higher the mean, the more debilitating. 


>< lo 


by Duncan’s multiple-range test for correlated means. N =50. 
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parisons may be briefly referred to, however. 
For convenience, they will be summarized by 
disability type: 

Internal disorders: The vocational dimen- 
sion received the highest mean rating. The 
mean for vocation differed significantly from 
the means given to all dimensions except the 
personal dimension. Internal disorders were 
seen to be least important socially. The 
mean rating for the social dimension was sig- 
nificantly lower than that of any other di- 
mension. 

Sensory disorders: The mean rating for the 
personal dimension was found to be highest. 
The personal dimension differed significantly 
from all dimensions except the vocational and 
parental. Seen as least affected was the marital 
dimension, which differed significantly from 
each of the other dimensions. 

Cosmetic disorders: The mean rating as- 
signed to the personal dimension was sig- 
nificantly greater than that given to any other 
dimension. Though the vocational area was 
rated as significantly less affected than the 
personal, it perhaps should be noted that it 
was rated as second highest and significantly 
higher than any of the remaining dimensions. 
The social dimension received a significantly 
lower rating than any of the other dimensions, 

Emotional disorders: Contrary to the pat- 
tern reflected in the ratings assigned to the 
internal, sensory, and cosmetic disorders, the 
vocational dimension was rated as least af- 
fected by emotional disability. The mean 
rating assigned to the vocational dimension 
was significantly lower than those of all of the 
remaining dimensions. The parental dimension 
was seen as being significantly more affected 
by emotional disability than any of the other 
dimensions. 

The investigators were somewhat concerned 
about the independence of the marital, 
parental, and familial dimensions, Since these 
dimensions all fall within the purview of the 
general rubic “family,” it was conceivable 
that Ss would not distinguish between them. 
That is, though they were conceptually dis- 
tinct, they may not have been empirically so. 
The data indicate that the Ss did distinguish 
between these dimensions. Within sensory dis- 
orders, the mean rating for the marital di- 
mension differed significantly from those given 
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to the parental and familial dimensions, Al] 
three dimensions differed significantly from 
one another within the category of emotional 
disorders. 


Locus of Control 


Pearson product-moment correlations were 
done for the ratings of each disability type 
and the locus of control scale. To control for 
the effects of the six personal-social dimen- 
sions, as well as to minimize the number of 
statistical comparisons, the sum of the cluster 
scores over all dimensions was used. That is, 
the internal disorder score was the sum of the 
six ratings of internal disorders which each 
individual made. The same procedure was fol- 
lowed for the remaining disorders. Locus of 
control scores were available for 47 of the 
50 Ss. As predicted, I’s reported that they 
perceived emotional disorders as more debil- 
itating than E’s A —.30 (df= 45, p< .05, 
two-tailed test) correlation obtained between 
the scores for emotional disorders and the 
locus of control scale (high scores are as- 
sociated with externality). No significant 
relationship was found to obtain between locus 
of control and internal, sensory, or cosmetic 
disorders. The correlations were .16, —.05, 
and .01, respectively. 


DISCUSSION 


It remains for future research to determine 
how closely the attitudes of the nondisabled 
toward disabilities approximate those of the 
disabled. One interesting finding from these 
data is that the nondisabled generally per- 
ceived disabilities as being less debilitating 
socially than elsewhere. If these perceptions 
mirror those of the disabled, we may be ovet- 
emphasizing the social area at the expense of 
more crucial areas. If, on the other hand, these 
perceptions are highly discrepant from those 
of the disabled, these data are reflecting gross 
insensitivity to the problems experienced by 
the disabled. Validation of the former notion 
would indicate that emphasis might be best 
placed elsewhere. Validation of the latter 
would indicate a need for educating the non- 
disabled about the problems of the disabled. 
Such a program might do much to facilitate 
social adjustment among the disabled. 


PERCEPTION oF DISABILITY 


In this study, Ss were asked to report their 
perceptions of the debilitating effects of vari- 
ous disabilities on six personal-social dimen- 
sions. This is a departure from the usual re- 
search practice of having Ss report their feel- 
ings about the disabled. The present method 
(although it may have implications for de- 
termining attitudes toward the disabled) was 
chosen with specific interests in mind. Apart 
from the obvious concern for measuring at- 
titudes of the nondisabled toward disabilities 
per se, the investigators were interested in 
gaining information about perceptions of the 
nondisabled regarding the effects of disabling 
conditions on important aspects of life. To 
the extent that such perceptions might de- 
termine one’s reactions to becoming disabled, 
we may be in a position to draw valuable im- 
plications.’ 

Self-reported attitudes about disabled per- 
sons have been regarded as indexes of be- 
havior toward the disabled. Similarly, self- 
reported attitudes about disabilities may be 
valid indexes of reactions to becoming dis- 
abled. It must be emphasized, however, that 
the validity of this assumption has not (to the 
authors’ knowledge) been tested. Therefore, 
interpretation of these data, from that point 
of view, must be regarded as too highly 
speculative to be engaged in here. It remains 
for future research to clarify this issue. 

The finding that Ss discriminated consider- 
ably between the personal-social dimensions 
Seems to be an important one. Though it may 
turn out that the pattern of discrimination 
differs between disabled and nondisabled per- 
Sons, it would seem that effective rehabilita- 
tion might be greatly facilitated if such dis- 
ctiminations were measured and considered. 


| For example, it might be found that some 


disabled persons view their conditions as 
most debilitating to their marital relationships 
and least debilitating vocationally. In such 
Cases the practice of bringing those clients 
into in-residence vocational training programs 


®It should be kept in mind that the Ss were 
asked to indicate how debilitating various disabilities 
Would be for another person. Caution should there- 
fore be exercised in generalizing from responses about 
others to responses about self. The authors are 
Presently conducting a study in which Ss are asked 
i ae the seriousness of the disabilities in relation 
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might be ill advised. Rehabilitation of such 
clients would most likely be greatly facilitated 
by permitting them to remain with their 
families and by coupling vocational training 
with marriage counseling or family therapy. 
Correspondingly, a finding that a disability 
was seen to be most debilitating to a client 
in “his feelings about himself” would suggest 
that personal counseling or therapy might be 
an extremely important adjunct to that client’s 
rehabilitation program. 

At the very least, the findings here (though 
far from definitive) indicate a need for future 
research to explore the differential impact of 
disabilities on various personal and social as- 
pects of life. Resulting knowledge may prove 
extremely valuable to the refinement and de- 
velopment of rehabilitation programs. 

The hypothesized relationships between in- 
ternal-external locus of control and reactions 
to the classes of disabilities received only mild 
support from these data. Though a significant 
correlation was found to obtain between locus 
of control and ratings of emotional disorders 
(as predicted, I’s perceived emotional disorders 
as more debilitating than E’s), none was found 
to be significant for the remaining disability 
types—though two of the three were in the 
predicted direction. Even the significant cor- 
relation obtained (—.30) was rather low, how- 
ever. In that locus of control is an intraper- 
sonal variable, the hypotheses might have 
been more efficiently tested had the Ss been 
asked to judge the debilitating effects of the 
disabilities in relation to themselves, rather 
than to another (i.e., the adult married male) 
(see Footnote 5). 
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Until quite recently, the focus of clinical 
and experimental investigation of dreaming 
has been on (a) interpretation and standardi- 
zation of dream content (Freud, 1955; Hall, 
1966); (b) psychophysiological concomitants 
of “dreaming sleep” (Hartmann, 1967; Luce, 
1966; Snyder, 1963); (c) psychological as- 
pects of sleep and dreaming (Foulkes, 1966) ; 
and (d) hypothesizing about the possible bio- 
logical functions of various stages of sleep 
(Fisher, 1965; Shapiro, 1967; Ullman, 1959). 

In light of the large amount of empirical 
work on the psychophysiology of sleep and 
dreaming (Hartmann, 1967), there has been 
relatively little systematic investigation of 
Psychological factors contributing to differen- 
tial frequency of dream recall (DRF). In the 
absence of a definitive psychological theory 
of DRF, the few studies reported in the 
literature have focused on variables that seem 
logically related to DRF. A small but signifi- 
cant relationship has been found between 
high test anxiety and high DRF measured 
by questionnaire (Domhoff & Gerson, 1967; 
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FREQUENCY OF DREAM RECALL ESTIMATED BY 
THREE METHODS AND RELATED TO DEFENSE 
PREFERENCE AND ANXIETY? 
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Dream recall frequency (DRF) was measured by questionnaire (Q), diary 
(D), and sleep interruption via telephone (T). Q DRF was high for aggres- 
sion out, projection, and denial, and low for intellectualization and aggression 
in (p> .05), especially for Ss higher on test anxiety (p < .05) than the median 
of their particular defense group (high “relative anxiety”). Compared to Q 
DRF, D and T DRF were markedly higher for all groups; there were no 
defense differences. The Ss with higher relative anxiety had significantly more 
D and T DRF than Ss with low relative anxiety ($ <.05). There were no 
Q, D, or T DRF differences for high versus low “general anxiety” (anxiety 
relative to the overall sample median). The relationship of DRF to procedural, 
psychological, and psychophysiological factors is discussed. 


Strauch, Bliemeister, & Malzahn, 1966; Tart, 
1962), and high DRF measured by home 
dream diaries (Singer & Schonbar, 1961). 
Schonbar (1959) reported a similar relation- 
ship between test anxiety and diary DRF, but 
the unusually high correlation of .50 may have 
been due to statistical artifact (Domhoff & 
Gerson, 1967). 

Negative correlations between DRF and 
ego strength were found by Schonbar (1959) 
using diaries, and by Domhoff and Gerson 
(1967) using questionnaires. Tart (1962) 
and Domhoff and Gerson (1967) found low 
but significant negative correlations between 
questionnaire DRF and repression. In light 
of the nearly tautological nature of the pre- 
diction, the small proportion of the variance 
accounted for by repression is surprising and 
challenging. 

Other studies have reported low but signifi- 
cant correlations between questionnaire and 
diary DRF and cognitive control measured 
by the Embedded Figures Test (Witkin, Dyk, 
Faterson, Goodenough, & Karp, 1962), the 
rod-and-frame test (Schonbar, 1965), and the 
schematizing test (Lachman, Lapkin, & 
Handelman, 1962). Measures of “fantasy 
predominance,” “associative productivity,” 
and “introspective constriction” derived from 
Rorschach scores (Orlinsky, 1966) were cor- 
related with DRF measured by the rapid 
eye movement, sleep interruption method of 
Dement and Kleitman (1957). The first two 
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Rorschach measures correlated positively, the 
last one negatively with this measure of DRF. 

Empirical research has suggested a variety 
of psychological, psychophysiological, and 
methodological factors which seem to con- 
tribute to DRF: the method used (eg., 
checklist vs. verbatim diary), the utility of 
recalling dreams, consolidation of memory 
traces, attitude toward dream life, repression, 
the psychophysiology of the waking process, 
receptivity toward inner life, mislabeling 
dreams as thoughts, “depth” of sleep, stage 
of sleep, anxiety, confabulation of “recall” 
upon awakening. Therefore, in the study 
reported here, DRF is treated as a dependent 
variable, 


Relationship of DRF to the Method 


The questionnaire (Q) method is derived 
from a set of global estimations of a collection 
of past events, that is, estimation by S of a 
Preconscious, experiment-independent tend- 
ency to recall dreams upon waking from sleep. 
Diary (D) and telephone (T) measures of 
DRF are frequency counts of day to day, 
experiment-dependent (i.e., externally im- 
posed) and therefore conscious focusing 
of attention on the psychological aspects of 
the waking process. It was assumed that for 
most Ss the absolute amount of DRF would 
increase from initial Q estimates to D and T 
measures. Complex shifts, rather than mono- 
tonic increases, in the relative amount of 
DRF from Q to D and T as a function of 
personality differences would underscore the 
importance of specifying method when dis- 
cussing the relation of DRF to personality. 


Relationship of DRF to Defense Preference 
and Anxiety 


Psychoanalytic theory acknowledges the 
close relationship of defense and anxiety. De- 
fense is conceptualized as an adjustive means 
of containing (“countercathecting”) , binding, 
and/or “neutralizing” (i.e, sublimating) anx- 
jety. Defense styles could be thought of as 
the | Psychologically “bound” form of free 
anxiety in the sense that matter is a bound 
form of energy. There is strong theoretical 
justification for focusing on the relationship 
or interaction of “energic” (free anxiety) and 
“structural” (defense style) aspects of normal 
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and neurotic adjustment rather than looking 
at these factors separately. There is generally 
a similar approach to behavior analysis in the 
Hullian concept of the interaction of drive 
and habit strength. 

In most studies, DRF has been related, 
not only to isolated dimensions, but relatively 
undifferentiated “energic” and “structural” 
categories (e.g., anxiety, neuroticism, etc.) 
that tend to obscure potentially relevant indi- 
vidual differences. In the study reported here, 
the “structural” dimension, defensiveness, was 
operationalized with a defense preference test 
by which five groups of Ss, each characterized 
by a different defense style, can be obtained. 
The five defense groups are aggression toward 
objects, projection, intellectualization, aggres- 
sion toward the self, and denial-repression. 

Anxiety may be statistically independent of 
“defensiveness” or other general structural 
personality variables, but it is dependent, in 
both a statistical and a psychological sense, 
on particular defensive styles. Each style has 
an expected or “normal” level of anxiety, and 
it is this level, rather than some overall 
population value, which provides a meaning- 
ful criterion against which high or low levels 
of anxiety are predictive of behavior. For 
example, the median value of anxiety test 
scores of groups high on “intellectualization” 
might be in the vicinity of 20, while a broadly 
defined, structurally less differentiated group 
high on “defensiveness” might obtain a 
median value around 30. If nothing else were 
known about an S other than that he is high 
on “defensiveness,” his anxiety test score of 
30 would have little predictive value. If it 
were known that this S could be clearly clas- } 
sified as an “intellectualizer,” that same score 
of 30 might indicate high anxiety relative to 
what his particular defensive style can com- | 
fortably accommodate. The score of 30 might 
be diagnostically indicative of a serious 
weakening of defensive capacity. 4 

Therefore, in the present study, all Ss with 
anxiety test scores above the median of their 
Particular defense group were considered to 
be high “relative anxiety” Ss whatever the 
relationship of their score to the overall sample 
median, that is, whatever their level of “get 
eral anxiety.” The Ss with anxiety test scores 
below the median of their particular defensive » 


FREQUENCY oF Dream RECALL 


group were called low “relative anxiety” Ss. 

Accordingly, the major prediction was that 
DRF would be most clearly related to level 
of relative anxiety, and only minimally if at 
all related to general anxiety. No specific 
predictions were made as to the differences in 
the relationship under the three methods of 
obtaining DRF. 


MeEtTHOoD 
Subjects 


The Ss were 86 male undergraduate students most 
of whom were freshmen and sophomores, all but a 
few of whom lived in dormitories. They were 
selected from an S pool made up of students taking 
introductory psychology courses at the University 


of Michigan, and thus were neither paid nor 
volunteers, 
Materials 
Independent variables. The Defense Measuring 


Instrument (DMI) devised by Gleser and Ihilevich 
(1969) was used to select Ss for five defense prefer- 
ence (experimental) groups and one defense non- 
preferent (control) group. The five defense dimen- 
sions scored are turning aggression outward toward 
others (TAO), projection (PRO), intellectualization 
(INT), turning aggression inwardly onto the self 
(TAS), and reversal (REV, ie., denial). According 
to the authors of the test, these dimensions repre- 
sent a comprehensive and theoretically meaningful 
Way of dividing the field of defense styles. The test 
i an objectively scored, highly structured projective 
test involving responses to stories tapping various 
conflict areas. A number of construct validity 
me have been obtained (Gleser & Ihilevich, 
The IPAT Anxiety Test developed by Cattell and 
his associates was used to obtain anxiety scores. 
is an objective, 40-item questionnaire with 
very high reliability and “varied and impressive” 
evidence of validity (Buros, 1965). 
Dependent variables. Three measures of DRF 


f were used: (a) estimation of frequency of recall 


ftom questionnaire items chosen by S (Q DRF); 
frequency count of indications of dream recall 
0n a 30-day check list “diary” sheet (D DRF); and 
€) frequency count of indications of dream recall 
or seven completed calls during the period of about 
5 weeks (T DRF). Any perceptual, conceptual, or 
affective element presumed by S to be part of a 
Team was scored “dream recalled.” 
A discussion of the relationship of dream experi- 
ences, dream recall, and dream reporting is beyond 
© scope of this report.? It is assumed, however, 
at reports of dreaming more often than not are 
Sttuine indications of the presence of recall, and 


ER discussion of this relationship is planned in 
© context of another paper on the validity of the 


, telephone (T) method of obtaining DRF data. 
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that recall is generally a residue of an actual dream 
experience rather than some sort of arousal phe- 
nomenon (Foulkes, 1967) or confabulation to please 
E (Rechtschaffen, 1967). 

y The questionnaire (Q) was composed of nine 
items, but only the item on frequency of occurrence 
was used. The S was asked to choose one of four 
categories of frequency as most characteristic of 
himself: (a) just about every day or every other 
day; (b) about once a week or twice a week; 
(c) couple of times a month; (d) hardly ever. The 
S could pick a second-most-applicable item, Verbal 
estimations were converted into numerical estima- 
tions based on a month’s time. In a separate study 
of 23 Ss taken from a similar student population, a 
test-retest product-moment correlation of .906 was 
obtained for an interval of 4 weeks.* 

The dream “diary” (D) was an objective checklist 
composed of six of the nine questions from Q plus a 
question on length of time between waking and 
filling in the sheet. The S responded by putting a 
mark (X) under the appropriate “a versus b” 
alternative for each question within the column 
that represented that particular day. The sheet was 
divided into 30 columns, one for each of the 30 
days S was to fill out the sheet. As with Q, only 
the item contributing to DRF was used. 

A split-half product-moment correlation of .92 ob- 
tained for the 23 Ss of the control group indicated 
high stability over time (ie. from the first to the 
second 15-day interval). The odd-even correlation 
for the same was .65 (p < .001) indicating a certain 
amount of day to day fluctuation. There seems to 
be a general consistency of DRF under conditions 
that impose an ongoing task of reporting the psycho- 
logical aspects of waking. 

The telephone procedure (T) to be described was 
based on the same six questions from Q that were 
used in D. A “telephone diary sheet” was filled out 
by the research assistants (RAs) who made the calls.5 
While scoring the interview on the T sheet, RA 
recorded the interview with a Wollensak T-1500 
tape recorder by means of a pickup coil placed 
near the earpiece of the receiver. The transcriptions 
of these interviews were not used in this study. 
Only the item relevant to DRF was used. The T 
procedure was suggested by Arthur Shapiro,® and 
used in a study by Rosen (1967). 


Procedure 


The Ss were selected for the five defense groups 
if their score on a particular dimension was at least 
equal to or greater than the mean plus 1.5 standard 
deviations, and was the highest score of the five. 
The Ss were selected for the control group by 
randomly choosing 23 Ss, all of whose scores were 
within the expected range of +1.25 standard devia- 


4The author wishes to thank Gary Talpos for 
providing the data for the statistical test. 
5 Special thanks to Kurt Hecox and Andy Broder 


for their valuable assistance. 
6 A. Shapiro, personal communication, March 1967. 
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TABLE 1 
INTERCORRELATIONS OF DRF FoR 
THE THREE PROCEDURES 
Procedure Q D | T 
Q 018 20° 
D 4b 
a N = 83, 
bN =81. 
*p <.001, 


tions. After selection, Ss filled out the IPAT test and 
the dream questionnaire, and then received the 
dairy sheets. 

Finally the T method was outlined. The Ss were 
told that during the period they were filling out 
the diary, they would be called sometime during 
the night once or twice a week, but not on week- 
ends. The Ss were not informed that the calls would 
be made only between the hours of 5:00 and 7:30 
A.M, during which time Ss spent most of their time 
in dreaming sleep (Stages 2 and 1-REM sleep). 
The Ss were told to consider the D and T pro- 
cedures independent, and not to skip D on the days 
they received a call (unless, of course, they couldn’t 
go back to sleep). 

An extended discussion of the relationship of T 
to laboratory sleep interruption methods is being 
prepared for publication. Suffice it to say there is 
considerable evidence that in terms of both DRF 
and content of dream recall, there is great similarity 
in the data obtained by home and laboratory meth- 
ods. If one is interested in DRF or content of dream 
recall rather than the relationship of these to the 
particular stage of sleep from which S is aroused, T 
fairly well simulates the laboratory procedure. 


RESULTS AND Discussion 
Relationship of DRF to Method 


In order to explore the meaning of DRF 
as variously measured, intercorrelations and 
analysis of variance of DRF from the three 
methods were obtained. Table 1 shows the 
intercorrelations of DRF for questionnaire 
(Q), diary (D), and telephone (T) proce- 
dures. Only the correlation between D and T 
is significant. The variance shared by these 
measures is probably due to the fact that 
they both involve on the spot, continuous, 
externally imposed, conscious focusing of at- 
tention on an otherwise preconscious process. 
These intercorrelations suggest that Q repre- 
sents a different set of factors from D and HL, 
and that although D and T are far from 
identical, they do seem similar, 
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Table 2 bears out this initial impre 


methods when DRF is observed as a 
of defense preference and anxiety, 
estimations of DRF increased from abe 
times per month for Q to about 18 ti 
month (about 60%) for D, and to ap 
mately one out of every two calls ( 


for D and T. 

From the analyses of variance carri 
on the data of Table 2 (see Table 3), 
fairly clear that the D and T data sho 
greater similarity to each other than 
does to the Q data, both in terms of y 
effects are significant and the proporti 
variance accounted for. Specifically, the 
fense (just short of the .05 level) and 
action (p < .05) effects are significant 
account for about 19% of the variance 
Q DRF data. Ordinarily, an effect fa 
reach the .05 level would be most parsim 
ously reported as nonsignificant. Howeve 
light of the apparently close relationship 
tween anxiety and defense, both in te 


TABLE 2 


Mean DRF RELATED TO DEFENSE PREFERENCE 
ANXIETY FOR Q, D, anD T METHODS 


Defense preference groups 


Level of 
anxiet, 
i TAO | PRO | REV | INT 
Questionnaire 
High 82| 89] 7.7) 16 
Low 58] 3.9] 53] 3.2 
Diary 
High | 21.7 | 23.0 | 23.1 | 14.2 
Low 13.6 | 14.1 | 15.5 | 16.6 
Telephone* 
High | 14.6 | 15.0 | 19.3 | 13.7 
Low 10.7 | 129 | 9.4 | 13.7 


Note.—Cell Ns for the defense preference groups 
five to seven; for the control group, 10 to 11. 

af soil numbers are oa Dee De ee prot ai t 
seni per 30 calls. lows com] 
bers in the three tables, + 
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TABLE 3 


ANALYSES OF VARIANCE OF DRF RELATED To 
DEFENSE PREFERENCE AND ANXIETY FOR 
Q, D, ann T METHODS 


Source df | MS | F | è 
Questionnaire 
Relative anxiety 1 36.5 | 1.9 
Defense preference 4 | 46.1) 2.4* 09 
Defense X Anxiety 4 49.9 | 2.6** | .10 
Error (within groups) 52 19.2 
Diary 
Relative anxiety 1 | 362.6 | 4.4** | .07 
Defense preference 4 15.5] — 
Defense X Anxiety 4 | 104.1} 1.3 
Error (within groups) 48 | 83.0 
Telephone 

Relative anxiety 1 | 21.6] 4.6%} .07 
Defense preference 4 OB iti 
Defense X Anxiety 4 25] = 
Error (within groups) 46 4.7 


fi Note,—Cell Ns for the defense preference groups vary from 
ve to seven; for the control group from 10 to 11. e is an 
Unbiased estimate of the proportion of variance accounted for 
(Coben, 1965), 
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theory and data reported in this study, it is 
felt that this defense effect is significant. 

_ Neither D nor T data show such variabil- 
ity. For these methods, only the relative anx- 
lety effect is significant (p < .05), accounting 
for about 7% of the variance in each set of 
data, Apparently the demand characteristic 
Common to both procedures, that is, exter- 
nally imposed focusing on dream life, can 
account for much of the D and T results. It 
is clear, though, that DRF cannot be con- 
sidered independently of the measure used 
to obtain it, and that differences especially 
between Q versus D and T cannot be ig- 
nored, especially in studies which select Ss 
on the basis of high or low DRF. 

Q DRF seems to be a function of defense- 
related, perhaps attitudinal, experiment-inde- 
Pendent tendencies to focus or not to focus 
on dream life. D and T DRF seem to be 
p atively more independent of specific de- 
ense styles. They seem more related to an 
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experimentally induced, conscious tendency to 
recall dreaming. For all three methods, these 
“internally” or “externally” based tendencies 
are apparently exaggerated by high relative 
anxiety. 


Relationship of Q DRF to Defense Preference 


Defense Groups TAO (aggressive outward), 
PRO (projection), and REV (denial) showed 
significantly more Q DRF than Defense 
Groups INT (intellectualization) and TAS 
(turning aggression inwardly). When these 
groups were divided on the basis of high 
versus low relative anxiety, the relationship 
between defense styles and Q DRF increased; 
relatively anxious Ss in the aggression-out 
(TAO), projection (PRO), and denial (REV) 
groups had significantly more Q DRF (about 
eight times per month) than relatively anx- 
ious Ss in the intellectualizer (INT) and 
aggression-in (TAS) groups (about two per 
month). There was almost no differentiation 
in Q DRF across defense groups for rela- 
tively nonanxious Ss. Apparently, this lack of 
differentiation for low relative anxiety Ss 
contributed to the failure of the overall 
defense effect to reach the .05 level. 

The importance of valuing dreams was sug- 
gested by Domhoff and Gerson (1967). Evi- 
dence for the relationship of Q DRF to a 
positive valuation of dream recall was indi- 
cated in results from the Strauch et al. study 
(1966) showing that Q recallers more often 
believe that dreams have meaning and like 
to recall dreams. Q DRF might be a reflec- 
tion of the strength of S’s identification with 
specific class-bound or subcultural attitudes 
toward dream life. Intellectualization (INT) 
and aggression-in (TAS) could be seen as 
a tendency toward overcommitment (in the 
sense of conformity, bureaucratization, etc.) 
to social undervaluation of the utility or 
relevance of dream life. 

Perhaps dream recall is a frequently occur- 
ring, natural phenomenon of early childhood 
which is differentially reinforced by the soci- 
alization practices in societies that tend to 
value or undervalue dream recall. In our 
culture, does the presence of high Q DRF 
indicate a certain “primitivity” which may 
or may not be necessarily neurotic? In other 


666 


words, does high recall in some people indi- 
cate a certain freedom (e.g., nonconformity, 
creativity, capacity for innovation) from bu- 
reaucratic and “institutional” inhibition while 
indicating in others a neurotic rebelliousness, 
incompetence, or other disturbance? 


Relationship of DRF to Relative Versus 
General Anxiety 


It was predicted that DRF would be dif- 
ferentially related to relative anxiety rather 
than general anxiety. This prediction was 
borne out by the following results: (a) the 
relative anxiety main effects in the D and T 
data were significant (p<.05); (b) the 
interaction effect between relative anxiety and 
defense preference in the Q data was signifi- 
cant (p< .05); and (c) there were no sig- 
nificant differences in Q, D, or T DRF for 
high versus low general anxiety, either for the 
defense-preferent Ss as a whole (all of the Ss 
in the defense groups) or for the defense-non- 
preferent (control) Ss; none of the six ¢ tests 
reached significance. 

The differential effect of relative anxiety 
was also expressed in hostility elicited by the 
T procedure. Hostility was defined opera- 
tionally on the basis of RAs’ notes to the 
effect that S had hung up the phone without 
answering; expressed hostility verbally via 
sarcasm, impatience, exasperation; or was 
“not there” or “awake” on two or more sepa- 
rate occasions. A frequency count was made 
of Ss expressing hostility at least once during 
T. There were no defense differences. When 
the sample was divided on the basis of general 
anxiety, a chi-square of .29 (df = 1) was ob- 
tained, indicating no difference in the number 
of Ss high or low in general anxiety expressing 
hostility toward T. When the medians for 
individual defense groups were used to divide 
the sample, that is, when relative anxiety was 
used, a chi-square of 10 (df=1) was ob- 
tained indicating a significant difference 
beyond the .005 level. Fewer Ss high in 
relative anxiety expressed hostility than Ss in 
the low relative anxiety group. 
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INTELLIGENCE DEFINED AS EFFECTIVENESS OF 


APPROACHES 


CONSTANCE T. FISCHER * 


Duquesne University 


The need for a generally acceptable definition of intelligence is reviewed. Em- 
phasis is placed on the fact that any proposed definition should be consonant 
with research findings as well as current and previous conceptions. The follow- 
ing definition is offered: Intelligence refers to the effectiveness, relative to age 
peers, of the individual’s approaches to situations in which competence is highly 
regarded by the culture. This proposed definition opens the way to a theoretical 
and clinically applicable conception of intelligence, which is also presented. Its 
main tenet is that along with more widely acknowledged variables, a person’s 
available frames of reference or meaning networks are the limiting conditions 
of his possibilities for action. Within this scheme, frames of reference together 
with their behavioral aspect constitute the “approach” referred to in the 


definition. 


The term “intelligence,” as a referent to 
measured intellect, came into being in the 
wake of last century’s faculty psychology. 
Subsequent formulations of intelligence and 
measurements thereof have been explicitly 
described by Tuddenham (1963). The guid- 
ing concerns were primarily based on the 
empirical, pragmatic activities of Binet and 
Terman. Thus, such issues as IQ constancy 
and culture-fair testing are by-products of the 
prevailing emphasis on test scores. The nature 
of intelligent behavior itself has received sig- 
nificantly less attention. Similarly, discussions 
of the nature-nurture controversy have cen- 
tered on changes in test IQ rather than on 
the child and his environment. Factor ana- 
lytic conceptions of intelligence, still the most 
prevalent, are, of course, restricted by the 
limits of already constructed, usually prag- 
matic tests. Perhaps the most poignant re- 
minder of the difficulty in moving beyond the 
tests themselves is the continuing prominence 
of Boring’s (1923) definition of intelligence 
as that which intelligence tests test. 

The above operational definition has, of 
course, proved conducive to highly developed 
factor analytic conceptions of intelligence 
both as a hypothetical construct and as an 
intervening variable. During the development, 
definition and conception have become nearly 


1 Requests for reprints should be sent to Co: 
mstance 
T Fischer, Psychology Department, Duquesne Uni- 
versity, Pittsburgh, Pennsylvania 15219. 


synonymous. Definition is intended here as 
the identification of a phenomenon as it is 
most widely understood. Facts are highly 
reliable observations of relevant relationships 
and may be placed in appendixes to the 
definition. The facts are about the defined 
phenomenon, and, of course, should be taken 
into account by a theory of conception. The 
latter, which goes beyond the facts, serves 
as a guide to integration and further fact- 
gathering. A definition can be consonant with 
many conceptions; a concept, however, neces- 
sarily precludes assimilation of some facts. A 
major tenet of the present paper is that while 
definition and conception imply each other, 
treating them as identical is not conducive 
to the development of theory and the dis- 
covery of empirical relationships. 

Although there is, then, a need for an 
agreed-on definition of intelligence prior to) 
research and theorizing, there is none which 
is commonly held and utilitarian. Theoretical 
articles usually proceed without an explicit 
definition of their topic. Psychology and 
education textbooks typically discuss the 
problems of definition, make reference to 
common-sense notions of brightness, offer the 
operational alternative under the heading of 
intelligence as a hypothetical construct, and 
then present a sample of theories from Binet 
through the latest factor-analytic conclusions: 
Finally, a summary of research facts is give”: 
Where a straightforward definition of intel | 
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ligence is attempted, the usual outcome is 
either a list of traits, said to represent intel- 
ligence, or an outright conception or theory 
of intelligence. In all cases, an explicit, com- 
monly held definition is somehow by-passed. 
Where the operational alternative is given 
as such, it is not usually accompanied by what 
would seem to be a necessary discussion of 
intelligence tests, 

The purpose of the present paper, then, is 
to offer a definition of intelligence that will 
reflect its most basic common meaning. At 
the same time, it will neither include nor 
exclude specific theories or conceptions. The 
proposed definition, nevertheless, is intended 
to reflect recent trends in both research and 
theory, and to encourage further develop- 
ments. It is especially structured to allow for 
growing clinical conceptions of intelligence 
(eg, Glasser & Zimmerman, 1967; Lyons, 
1963), one of which is outlined. 


PROPOSED DEFINITION 


Intelligence refers to the effectiveness, relative to 
age peers, of the individual’s approaches to situ- 
ations in which competence is highly regarded by 
the culture. 


This definition is consistent with the history 
of the concept as comparative, quantitative, 
and operationally grounded in standardized 
tests. It is thus consistent with the prevalent 
philosophy of science position, as presented by 
Spiker and McCandless (1954). They argue 
that intelligence is a useful concept insofar 
as it relates to other psychological variables, 
and that it has no reality outside of these 
relationships. Confusion about the nature of 
intelligence, according to their view, arises 
through the multiplicity of measurement 
Operations. The proposed definition goes 
beyond abstract operational reference by 
Specifying what it is that intelligence tests 
test, as well as by implying that measurement 
1S not of an entity. 

Besides serving historical continuity, the 
definition is contemporary. It is consistent 
with Bouchard’s (1968) description of an 
emerging research emphasis on both the 
Process character of intelligence and on the 
Question of intelligence as “a sub-problem 
associated with the more general problem of 
determining the structure of cognition and 
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thought [p. 27].” Thus, since “structure of 
cognition and thought” may be subsumed 
under “approach,” such research topics 
as information-processing strategies (Hunt, 
1961), computer simulation of cognitive 
structures (Reitman, 1965), and a structure- 
of-intellect model (Guilford, 1967) may all 
be classified under intelligence as defined here. 
Piaget’s (1950) formulations of schema, con- 
cept, assimilation, and adaptation may also 
be encompassed by “approach.” 


Effectiveness 


The reference to effectiveness is consonant 
with both historical and current use of the 
term “intelligence”; that is, it brings to mind 
the early emphasis on problem solving, and 
is quite in line with recent proposals of intel- 
ligence as sets of abilities and/or adaptations 
(Hayes, 1962; Mischel, 1968; Tuddenham, 
1963; Wallace, 1966; Wesman, 1968). From 
varying perspectives, all these authors stress 
the roles of antecedent conditions and reward 
parameters in the acquisition and demonstra- 
tion of abilities (intelligence), The present 
definition’s reference to effectiveness under- 
cuts entity and constancy notions, which these 
theorists decry. It encourages reference to 
an individual’s effectiveness rather than to 
his potential or aptitude. Indeed, this termi- 
nology implies what in practice is meant by 
aptitude: that present levels of effectiveness 
are generally predictive of later levels insofar 
as extant equipment, maturation, and experi- 
ence are requisite to further development. 

The reference to age-peer comparison is a 
reminder that in practice, IQ is the estimate 
of intelligence, and that it has specific sta- 
tisical properties. Finally, reference to age- 
peer standing also suggests that type of 
effectiveness may change with age. 


Approaches to Situations 


Tn order for the definition to have maximal 
utility, the term “approaches” must be amen- 
able to several interpretations. Specifically, 
strict behaviorists will conceive of approach 
as response. However, while the definition is 
intended to be useful to the behaviorist and 
learning theorist, its major implications are 
for the psychologist with an existentially 
oriented, holistic perspective. For the latter, 
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approach connotes both the frames of refer- 
ence, or network of meanings, and intentional 
behaviors of the testee. Portions of this sec- 
tion, therefore, will not be relevant in their 
present form to behavioristic conceptions of 
intelligence. 

Use of the term “approach” implies that 
there may be alternative routes to effective- 
ness. Inasmuch as the present definition 
avoids any implication that intelligence is an 
entity, the possibility of change is more 
readily apparent. Furthermore, the definition 
directs attention to the specific approaches 
used by the testee. In turn, it becomes pos- 
sible to investigate ways of teaching him 
more effective approaches. 

This view is also consonant with the em- 
pirical finding that test scores are unstable 
while patterns of perception and behavior are 
developing; concretely, up to and through the 
early school years, approaches are themselves 
unstable. Liverant (1960), in his presentation 
of intelligence as a descriptive rather than 
explanatory concept, emphasizes the need “to 
examine first the nature of antecedent condi- 
tions and then the subsequent responses sub- 
sumed by intelligence [p. 102].” Such an 
examination, he points out, would indicate 
whether a testee’s failure of an item were due 
to expectancy and reward-value variables or 
to the absence of the correct response in the 
testee’s repertory. In the present paper, 
however, it is posited that description must 
go beyond operationally identified stimulus 
and response variables. In this view of ap- 
proach, the testee behaves in accordance with 
his personal frames of reference—the meanings 
of the stimulus object available and salient 
to him. It is, then, more fruitful to deter- 
mine the identity of the stimulus (its mean- 
ing) for the testee. Similarly, it would be 
more productive to ask, first, what are the 
essential components (including frames of 
reference) of the ability to earn credit for a 
given item? And then, which components are 
absent for this testee? 

The testee’s situation includes more than a 
physical setting and a relationship to the 
examiner. The meaning, for him, of being 
tested is also a highly relevant frame of 
reference. For example, when a person fails 
an item it does not necessarily indicate 
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“inappropriate expectancies” or “reward 
values,” or absence of the scorable response 
from his repertory. It may be the case that 
the testee has approached the test situation 
with a frame of reference quite different from 
that intended in the test manual. For instance, 
he may have viewed intelligence as creativity, 
and hence disdained to give obvious answers, 
Or, the presence of the examiner may have 
served the testee’s prevailing concern with 
interpersonal matters: perhaps he viewed the 
situation as one in which he must not venture 
guesses lest he be judged foolish. 

While the above discussion is consistent 
with Wechsler’s (1950) description of intel- 
ligence as a function of the whole personality, 
it is not consonant with his recommendation 
of factor analysis as a preferred technique for 
identifying components of intelligence. The 
inadequacy of this recommendation is that 
statistical techniques necessarily preclude 
attention to individual frames ot reference. 

Perhaps, if the above view proves fruitful, 
new intelligence tests may go beyond testing 
for knowledge of content and presence of 
“thinking processes” to include direct sampling 
of approach. Already there is a tendency 
to view creativity as an integral aspect of 
intelligence, and creativity may be seen as 
use of unusual, yet effective, approaches. 


Culture 


The definition’s reference to competence 
highly regarded by the culture serves as a 
reminder that tests designed to measure in- 
telligence are historically and culturally 
rooted. Test content reflects the dominant 
values of the test maker’s socioeconomic class 
at the time of the test’s construction. Thus, 
standard intelligence tests have stressed 
school-related skills. Now, in an age of com- 
puter technology, concern with remaining 
above mechanization has led to a renewed 
emphasis on creativity as a uniquely human 
aspect of intelligence. 

As it becomes more apparent that maturity 
and self-esteem are necessary for the mainte- 
nance of nonviolent, nonsuppressive interper- 
sonal relationships, these too may be directly 
Tepresented on intelligence tests. Relevant 
here is Wesman’s (1968) argument for peri- 
odic revision of both tests and school cur- 
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riculum in order to represent a wider spectrum 
of the abilities the culture has come to value. 
The present emphasis on culture is also con- 
sistent with the fact that scores decline with 
age. The decline may reflect increased spe- 
cialization at the cost of breadth. Further- 
more, maturity, which is associated with ef- 
fectiveness of approach, is not directly repre- 
sented in standard intelligence tests. Thus, 
many of the sources of adult effectiveness are 
not sampled by these tests. 

Moreover, it is apparent that by definition 
there cannot be a culture- or class-fair test. 
Instead of continuing efforts toward approxi- 
mating such a test, it would be better to rec- 
ognize that general intelligence tests indicate 
a person’s standing in his culture at large, re- 
gardless of the reasons. In this sense, these 
tests are valid for the culturally deprived in 
that low scores indicate low competitive 
standing. However, if the score is not to be a 
permanent label of low effectiveness, it must 
be accompanied by modifiers representing the 
tester’s evaluation of the bases of the low 
scores, for example, “hereditary retarded in- 
telligence,” or “low intelligence in a culturally 
deprived child.” Ideally, further data should 
describe the context in which the scores were 
earned: the testee’s approaches and their 
degree of effectiveness, as well as other rele- 
vant aspects of the testing situation, includ- 
ing the examiner’s role. This goes beyond the 
standard recommendation of stating whether 
an IQ is valid; it recommends a description of 
the ways in which the score is valid. In this 
Manner, scores are more clearly differentiated 
for research as well as theorizing purposes. 


Further, they are available for guidance 


eyond mere placement. 

A related asset of the definition’s emphasis 
on the role of culture in intelligence test con- 
struction is that the appropriateness of other 
types of tests becomes more apparent. At 
times, a person’s overall standing in culturally 
valued areas is irrelevant; for example, tests 
of specific abilities are more appropriate for 
determining school readiness, job training, or 
telative standing in a subculture. 

_ Finally, thematizing the role of culture has 
implications for the debated existence of a 
eneral factor underlying all intelligence. To 
extent that competencies valued by a 
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given culture at the time of the test’s construc- 
tion are closely related, “g” can be demon- 
strated through particular statistical tech- 
niques. 


EFFECTIVENESS OF APPROACHES AS A 
CLINICAL CONCEPT 


It is hoped that the proposed definition 
will facilitate emergent, more comprehensive 
conceptions and theories of intelligence. The 
purpose of the present section is to indicate 
the direction of one such conception as well as 
to sketch its applicability to both theory and 
clinical practice. The conception is holistic 
and existential. Specifically, it is based on the 
assumption that the only reality man can 
know is one that he co-constitutes. That is, 
insofar as man experiences things and events, 
he apprehends them through his own network 
of categories and meanings—his frames of 
reference. In turn, his behavior expresses his 
organizations of meaning. Frames of refer- 
ence, of course, are not necessarily cognitive 
or logical, but are rooted in man’s entire being, 
including its bodily aspect. This emphasis on 
frames of reference places the focus on the 
individual who is demonstrating intelligent 
behavior, rather than on abstract, discon- 
nected processes. Such a shift of focus implies 
that human processes may not be as auto- 
matic (and hence logical) as present research 
models would suggest. 

While the present position acknowledges 
the importance of neurophysiological struc- 
tures and functions for the development of 
frames of reference, it asserts that the latter 
may not be reduced to the former. It is also 
recognized that frames of reference are learned 
and that it is important to discover how such 
learning occurs. Such discovery, however, will 
remain in terms of statistical probability un- 
til Ss are viewed as individuals. Some persons 
respond in a manner different from the ma- 
jority because they perceive the material or 
event differently. The efficacy of obtaining S’s 
experience of the stimulus situation has been 
empirically demonstrated by Giorgi (1967; 
Giorgi & Colaizzi, 1966). 

The above perspective has concrete rele- 
vance for clinical use. In order to understand 
past behavior or predict future action, the 
perceptual and behavioral possibilities avail- 
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able to the individual must be known. In 
other words, a man’s approaches are the lim- 
its of his possibilities, since he can act only in 
accordance with the meaning a situation has 
for him. 

The method best suited for comprehending 
a testee’s frames of reference, and thus of his 
present possibilities, would seem to be a de- 
scriptive one. That is, behavior-in-context 
should be observed, and its major trends de- 
scribed as accurately as possible, prior to any 
theoretical or explanatory activities. For ex- 
ample, the statement, “When required to 
answer verbally, the testee never altered or 
corrected a response,” remains truer to what- 
ever frames of reference it reflects than would 
reductive allusions to anxiety, overconfidence, 
motivation, or lack of autocriticism, Further, 
through such a description, a reader can view 
the testee more directly than he could through 
an examiner’s derived constructs. It bears 
acknowledging, however, that the examiner’s 
and reader’s own frames of reference are the 
limiting conditions of their understanding of 
the testee. 

Descriptive analysis is dependent on the 
examiner’s understanding of which approaches 
are required for success on each subtest and 
each item, A form of “testing the limits,” as 
used in projective techniques, seems appropri- 
ate for identifying absent frames of reference. 
Glasser and Zimmerman (1967) have shed 
some light on the attitudes and behaviors nec- 
essary for success on various WISC subtests. 
Earlier publications on the clinical usefulness 
of individual intelligence tests have specified 
item requirements only as vehicles for the 
demonstration of psychoanalytic constructs 
(e.g., Fromm, Hartmann, & Marschak, 1957; 
Rapaport, Gill, & Schafer, 1946). 

Among intelligence tests, the Wechsler 
scales seem most suited to descriptive analy- 
ses, especially since the subtests require vari- 
ous extended relationships with the examiner. 
Further, they also tap different modes of re- 
lating available to the testee; for example, 
some subtests provide relatively clear stan- 
dards which the testee can see for himself 
(e.g., Block Design), while others require that 
the testee act according to personal standards 
(e.g., Comprehension). Finally, many items 
often tap highly personal meanings of com- 
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mon objects and events. Together, these fea- 
tures afford a sample of the testee’s approaches 
and their effectiveness. While projective tech- 
niques are better suited for ascertaining the 
richness of S’s frames of reference, an indi- 
vidual intelligence test is appropriate for 
gauging the effectiveness of concordant be- 
havior. The more restricted the testee’s avail- 
able frames of reference, the more severe is 
his disadvantage in our culture. Obviously, a 
testee’s available approaches as well as ef- 
fectiveness are best described through data 
from many sources. 

The following excerpts from an intelligence 
test report concretely exemplify the descrip- 
tive method. The account first provided 
identifying information, reason for referral, a 
list of supplementary tests administered, and 
WISC scores accompanied by a statement of 
IQ and its statistical meaning. This was fol- 
lowed by a paragraph captioned “Context of 
Earned Scores,” in which the testee’s general 
appearance, attitudes toward the examiner, 
and concerns with being tested were described. 
Then, along with other comments related to 
the testee’s uneven performance, the follow- 
ing statements were made: 


When she was uncertain of what was expected of het, 
Diane’s diligent manner gave way to sarcasm, melo- 
drama, and rhetorical comment such as, “You don't 
expect me to get THAT do you?!!” On the other 
hand, when Diane could see the full parameters of 
a task, and knew that she possessed the required 
tools, she worked steadfastly, quietly, and effectively. 
E example, on Mazes she exceeded the scoring 
scale, 


Quoted after this were supportive remarks 
made by the testee about her school work. 
Finally, in the Conclusions section of the re 
port, Diane’s present standing in her classroom 
was estimated, and it was suggested that: 


Diane’s growth and development will be facilitated 
most if her teachers understand that her occasioni 

professed disdain for a particular activity is probably 
an indication that brief individual instruction is ™ 
order, That is, at such points she would profit from 
an explicit demonstration of what is required an 

how it is done, an opportunity to try her skill, and 
finally, reassurance that she is on the right track. 


It was intended that readers of the above 
Teport would not only know Diane’s levels of 
effectiveness, but also something about the 
approaches (frames of reference and their be 


these various levels. The primarily descriptive 
method obviated the usual recourse to causal 
constructs and personality traits along with 
their multifarious meanings to readers. In- 
stead, it could become apparent that for 
Diane, the meanings of unfamiliar tasks and 
uncertain expectancies are ones of danger, 
of something to be avoided. From intratest 
and intertest data, as well as interview ma- 
terial, these meanings emerged as a major 
network in terms of which she views much 
of her environment. Most often, this frame 
of reference precludes efficient problem solv- 
ing. Perhaps, if her teachers follow the report’s 
suggestion, Diane can come to see the un- 
familiar in a new light, one less fraught with 
negative connotations. 

Returning to a theoretical discussion, it is 
acknowledged that since frames of reference 
are not directly observable, their ontological 
status is that of a construct. In the absence 
of strict S-R identifications, however, they are 
not intervening variables. Neither are they 
hypothesized to exist as entities, as causal 
forces or energies, nor as cybernetic processes. 
Instead, frames of reference are proposed as 
abstractions of man’s essential character when 
he is emphasized as a being in relation to 
others, objects, and events, That is, man is 
always oriented at some level to something, 
his frames of reference being a term for 
this possibility. Specific frames of reference 
are inferred through the question: What 
Meanings must be experienced by the person 
that such-and-such a behavior is possible for 
him? Finally, this construct is not intended 
as a reduction of other phenomena, nor is 
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havioral aspects) through which she reaches 


it to be imposed on behavior prior to a de- 
scription of the behavior itself. Justification 
for the last statement is use of the descriptive 
method, which is based on the following view 
of causality. 

Any activity, or approach, valued enough 
by society to be called for on an intelligence 
test always occurs in the company of other 
necessary components (relation with tester, 
etc.). Since all are necessary, no one com- 
Ponent has the status of an explanatory con- 
struct (anxiety, drive, etc.); without any one 
of the components, the situation does not 
ų &xist. On the other hand, if the total situation 
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could be described, then the “if-then” require- 
ments of explanation would be fulfilled: ¿f all 
these components are present, then such-and- 
such is occurring. These components fall into 
two stages: in the first are those that can 
be abstracted as the frame of reference; in 
the second are those that can be seen in the 
activity itself. 

In his article presenting an abilities con- 
ception of personality, Wallace (1966) sug- 
gested that if all of an individual’s abilities 
and ability deficits were known, as well as the 
antecedent conditions necessary for their dem- 
onstration, then motivational inferences would 
be superfluous. To go further, it is posited 
here that complete knowledge of an individ- 
ual’s approaches can serve to indicate his 
available frames of reference, and hence which 
activities are possible for him. Explanatory 
constructs would, then, be unnecessary since 
an explanation is already constituted by a 
complete description of necessary and suf- 
ficient conditions. Specifically, we have by- 
passed the need to posit interference of one 
construct (intelligence) by another construct 
(anxiety, impulsivity, etc.). 

For example, in the following statement 
anxiety assumes its usual explanatory status: 
“The boy was so anxious that he impulsively 
gave only partially correct responses.” A de- 
scriptive analysis, however, might reveal that 
something like the following were necessary 
aspects of the boy’s frames of reference: he 
may experience the test as something he must 
do well, while being uncertain of his ability, 
and see no effective way to stick with the test 
items (or, see his “impulsive” response as the 
best he can do). 

Elements essential to activity in addition 
to a particular frame of reference are better 
known and are merely suggested here: pres- 
ence of the item and physical tools, prior 
practice, unlimited time, feedback from the 
examiner, adequate vision, etc. 

The level of possibility opened by a par- 
ticular frame of reference should also be 
specified. For example, physical retaliation 
may have a place in a testee’s network of 
meanings, but in what way? Perhaps its focal 
meaning is one of being attacked. Or, it may 
be that while he is aware of instances of 
physical retaliation around him and, at times, 
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can fantasy himself as an avenger, it is liter- 
ally inconceivable to the testee that he might 
actually wreak physical vengeance on a real 
person. Or, the interpersonal meaning of being 
with another (the tester) may preclude men- 
tion of conceivable violence. Hence, if a 
test item asks what he should do to pro- 
tect himself from an armed assailant, none 
of the above frames of reference would open 
the way to a response involving aggressive 
self-protection. 

From the descriptive approach, more than 
just the testee’s level of intelligence may be 
reported. At least some of the ways he viewed 
the test items can be stated. These networks 
of meanings, when adequately submitted to 
“testing of limits,” may be understood as the 
current limits of his possible approaches. A 
starting point is thus provided for both retro- 
spective investigations of how a testee’s ap- 
proaches came to be and predictive efforts 
toward specifying the conditions under which 
the same approaches would again be evident. 
These data are, of course, relevant for teaching 
the therapeutic efforts. 
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Operant-interpersonal treatment of marital discord is premised on the assump- 
tion that successful marriages can be differentiated from unsuccessful marriages 
by the frequency and range of reciprocal positive reinforcements exchanged by 
both partners. Beginning with the clarification of behavior change objectives 
for each partner, a four-step treatment approach culminating in an exchange 
of positive responses on a reciprocal basis is suggested. The treatment of four 
couples complaining of low-rate conversational and sexual behavior is sum- 


marized, stressing the use of a token 


system as a prosthesis to facilitate the 


transition to increased positive reinforcement. 


The operant-interpersonal approach to 
marital treatment rests on three assump- 
tions concerning the character of marital 
interaction. First, it is assumed that the exact 
pattern of interaction which takes place be- 
tween spouses at any point in time is the 
most rewarding of all of the available alterna- 
tives. This implies that the interaction be- 
tween spouses is never accidental; it repre- 
sents the best balance which each can achieve 
between individual and mutual rewards and 
costs (Thibaut & Kelley, 1959, p. 12). Thus 
when a husband consistently fails to leave his 
friends in order to spend time with his wife, 
it may be concluded that his friends offer 
gteater relative rewards than his wife. 

The second assumption is that while the 
specifics may vary for each couple, most mar- 
tied adults expect to enjoy reciprocal rela- 
tions with their partners. Reciprocity has the 
general sociological connotation that “each 
party has rights and duties [Gouldner, 1960, 
P. 169]” and the specific behavioral connota- 
tion that each party to an interaction should 
dispense social reinforcement at an equitable 
Tate (Patterson & Reid, 1967, p. 1). In effect, 
a quid pro quo or “something for something” 
arrangement underlies successful marriage 
(Jackson, 1965, p. 591). The exchange of 
Tewards in marriage may be viewed as a 
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quasi-legal contract affording distinct safe- 
guards to each partner. Whenever one partner 
to a reciprocal interaction unilaterally re- 
wards the other, he does so with the con- 
fidence that he will be compensated in kind 
in the future. For example, if the husband 
agrees to entertain his wife’s parents for a 
weekend, he does so with the expectation that 
his wife will accompany him on a weekend 
fishing trip at some time in the future. 
Reciprocity develops as a consequence of 
a history of positive reinforcement. There is 
extensive empirical support for the proposi- 
tion that ego will be more attracted to alter 
and will reinforce alter more if he has 
been positively reinforced by alter (Bachrach, 
Candland; & Gibson, 1961; Brewer & Brewer, 
1968; Byrne, 1961, 1962; Byrne & Nelson, 
1965; Homans, 1961; Komorita, Sheposh, & 
Braver, 1968; Newcomb, 1955; Pruitt, 
1968).2 When disordered marriages are eval- 
uated in light of this reinforcement-attraction 
hypothesis, it is seen that each partner rein- 
forces the other at a low rate and each is 
therefore relatively unattractive to and un- 


reinforced by the other. 


3 Byrne and Rhamey (1965) have postulated a law of 
attraction magnitude which takes the following form: 


=(PRz X M) 
4 = m (SERXI INEX 2 TR 


. . . [where] X is a positive linear function of the 
sum of the weighted positive reinforcements (Num- 
ber X Magnitude) received from X divided by the 
total number of weighted positive and negative rein- 
forcements received from X [p. 887]. 
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The third assumption is that in order to 
modify an unsuccessful marital interaction, it 
is essential to develop the power of each 
partner to mediate rewards for the other. In 
support of this assumption, it can be shown 
that individuals will be more positively at- 
tracted to each other when each has been 
successful in influencing the other to comply 
with his wishes (Thibaut & Kelley, 1959, p. 
124; Thibaut & Riecken, 1955). Conversely, 
it can be shown that when one party to an 
interaction fails in his influence attempts, he 
becomes “socially bankrupt” (Longabaugh, 
Eldred, Bell, & Sherman, 1966, p. 87) as he 
lacks the resources needed to control the 
other’s behavior; failing to gain control 
through positive strategies, he resorts to nega- 
tive means of control. 

In successful marriage, both partners work 
to maximize mutual reward while minimizing 
individual costs. A reciprocal exchange of 
potent social reinforcement is established in 
which each partner controls sufficient rewards 
to compensate the other for the rewards which 
are expected or received from him. In an un- 
successful marriage, both partners appear to 
work to minimize individual costs with little 
apparent expectation of mutual reward. In an 
effort to trim costs, few postive rewards 
are dispensed; positive reinforcement, as a 
strategy of behavioral control, is replaced by 
negative reinforcement (removal of, an aver- 
sive event following the expected response). 

Either or both of two broad patterns of 
behavioral control, coercion and withdrawal, 
are likely to emerge in unsuccessful marriages. 
In coercion, one member seeks to gain posi- 
tive reinforcement from the other in exchange 
for negative reinforcement (Patterson & Reid, 
1967). As an example, a husband might wish 
his wife to express greater affection; following 
the failure of his amorous advances, he might 
become abusive, accusing his wife of anything 
from indifference to frigidity, abating his 
criticism when he receives the desired affec- 
tion. The three flaws in this approach are: 
first, to the extent that he makes himself 
unpleasant, he is less likely to receive affec- 
tion; second, to the extent that he is abusive 
or accusing, he debases his wife’s affection and 
simultaneously reduces its reinforcing proper- 
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ties for himself; and third, to the extent that 
her affection is offered in compliance to his 
demand, it will appear to be appeasement 
rather than a gesture of genuine affection 
(Haley, 1963, 1967). 

The withdrawal which is likely to occur in 
unsuccessful marriages is analogous to one of 
several strategies available in a prisoner’s 
dilemma game: 

If the structure of the situation is such that (a) the 
reward is small, (b) terminating the interaction is 
made difficult, and (c) the retaliatory response must 
take a form which is identical to the disliked other's 
harmful behavior, then cognitive consistency prin- 
ciples would lead one to expect an increase in ill will, 
but no retaliatory behavior [Swingle, 1966, p. 270]. 


Retaliatory behavior might require the hus- 
band to match his wife’s actions; but to the 
extent that he devalues his wife for so be- 
having, consistency would demand that he 
behave otherwise. Withdrawal has the advan- 
tage of denying satisfaction to his wife while 
at the same time creating a situation requiring 
her to continue to behave assertively. Thus 
it is a low-risk tactic of control. The reinforce- 
ment for approach behavior on the wife’s 
part would be termination of the husband’s 
withdrawal. At the same time that the hus- 
band is withdrawn from his wife, however, 
he may also find other social and nonsocial 
reinforcers in his cronies, mistress, or can 
of beer. 

Based on this formulation, the operant- 
interpersonal approach seeks to construct a 
situation in which the frequency and intensity 
of mutual positive reinforcement is increased. 
The effect of positive reinforcement in in- 
ducing positively “biased scanning [Janis & 
Gilmore, 1965],” or searching for assets 
rather than liabilities in the spouse, has been 
well demonstrated, particularly when the 
positive reinforcement is large and is offered 
under positive sponsorship (Elms & Janis, 
1965, p. 53). It is anticipated that this post- 
tive scanning will replace negatively bi 
attitudes, making positive responding more 
likely. Positive responses, in turn, are intended 
to augment the range and importance of social 
reinforcement mediated by each spouse for 
the other, leading to reestablishment of 
successful interaction patterns, 
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TREATMENT CONSIDERATIONS 


Operant-interpersonal treatment occurs in 
four orderly steps. The first step requires 
training the couple in the logic of the ap- 
proach, consisting of two self-evident prem- 
ises. The first premise is that the impressions 
which each spouse forms of the other is based 
on the behavior of the other. Accordingly, 
when one changes his behavior, there are 
corresponding changes in the other’s impres- 
sions and expectations of him. Spouses who 
lose sight of this typically attribute marital 
difficulty to the personality of the other. For 
example, the wife who believes that her hus- 
band “is passive” implicitly suggests that her 
husband’s personality must change before the 
marital disturbance can be overcome. Con- 
versely, the wife who believes that her hus- 
band “behaves passively” need only find ways 
to modify her behavior (as she is in control 
of his actions) in order to modify his prob- 
lematic responses. 

This leads to the second premise, which 
asserts that in order to change interaction in 
a marriage, each partner must assume initia- 
tive in changing his own behavior before 
changes can be expected in his spouse. The 
typical couple is “locked into” problematic 
patterns of interaction as long as each re- 
quires a change in the other prior to changing 
his own behavior. If coercion and withdrawal 
are in fact basic problematic processes in dis- 
cordant marriages, then spontaneous behavior 
change is highly unlikely to occur, as the 
response cost is too great and the potential 
teward too small. 

Clients benefit from an explanation of the 
logic of the approach for at least two reasons. 
First, such explanations may help to free 
each spouse from his inaccurate and nega- 
tively biasing prejudices. Second, when each 
Spouse is fully aware of the logic of the 
treatment, he can participate more fully in 
effective therapeutic planning and execution 
(Stuart, 1967). 

The second step in treatment consists of 
asking each spouse to list the three behaviors 
which he would most like to accelerate in the 
other. This task is often subject to four dif- 
ficulties. First, couples tend to begin by list- 
ing requests for decelerating negative behav- 
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iors, which may be expected in view of the 
fact that much of their interaction prior to 
seeking treatment has been negative. The dif- 
ficulty with attempting to decelerate behavior 
is that its attainment would require the use 
of aversive stimuli or extinction patadigms, 
and the typical unhappy couple is already 
disproportionately committed to these nega- 
tive strategies. Second, each spouse is likely 
to phrase his “three wishes” in molar rather 
than molecular units, and many of these are 
likely to be formulated in preverbal terms. 
For example, a husband is likely to list the 
request that his wife “act more feminine.” 
“Feminine” is a modifier which is subject 
to varied interpretation and which may mod- 
ify a wide range of behaviors. These molar 
requests must be reduced to specifics which 
include description of the desired behavior, its 
rate, and the context in which it is expected 
to occur. Third, each spouse is likely to pro- 
claim that the other should “know what I 
want—if I have to tell him, then he is not 
as sensitive to me as he should be.” Many 
unhappily married couples share the naive 
expectation that their spouse should be clair- 
voyant, and it is therefore often necessary 
to stress the need that each must communi- 
cate his wishes to the other in order to in- 
crease the probability that they will be 
gratified. The fourth obstacle in this appar- 
ently simple listing of behavior change objec- 
tives is concerned with the punctuation of the 
behavioral chains, It has been observed that 
each spouse is likely to describe each unit of 
his own behavior as sandwiched between two 
negative actions on the part of his spouse 
(Watzlawick, Beavin, & Jackson, 1967, pp. 
54-59). These three element chains (other- 
negative, self-positive, other-negative) obscure 
the true interaction, as the speaker omits 
reference to his stimulation of the other’s 
initial negative actions. 

The third step in the treatment calls for 
transcription of the three wishes of each as 
headings on a Behavior Checklist which is 
posted at some convenient point in the house. 
Each spouse is asked to record the frequency 
with which the other performs the act which 
he desires. While monitoring undoubtedly 
influences the rate of occurrence of each set 
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of responses, this exercise does provide a 
crude base line against which to evaluate 
change. 

The fourth step consists of working out 
a series of exchanges of desired behaviors. 
The typical couple complains of a “lack of 
communication,” which is a euphemism for a 
failure to reinforce each other. On closer 
analysis, this complaint is frequently seen as 
a reference to low-rate conversational and 
sexual behavior. This communication gap is 
overcome in the fourth step by arranging for 
each partner to compensate the other for the 
behaviors which he identified as socially 
reinforcing to him. 

In marriages which have not dissolved 
reciprocity into coercion or withdrawal, a 
simple exchange of behaviors is effective. 
Couples are asked to accelerate desired behav- 
iors on an equal basis. For example, one 
husband complained that his wife failed to 
greet him at the door, that she did not 
straighten the family room in anticipation of 
his return home from work, and that meals 
were rarely ready on time. His wife com- 
plained that he failed to spend sufficient time 
with the children (it was agreed that 30 
minutes before bed was sufficient), failed to 
take her out for an occasional movie, and 
failed to pay attention to meals when they 
were well prepared. Accordingly, each of these 
behaviors was restated as a positive (e.g., 
greet husband at door) and was listed on the 
Behavior Monitoring Form. Each person 
recorded the frequency with which the other 
completed the desired behavior. When these 
behaviors were accelerated at a sufficient rate, 
other goals were added. 

In marriages in which reciprocity appears 
to be essentially absent, a token economy has 
proven useful. Where reciprocity fails, couples 
tend not to trust each other. It is therefore 
important to arrange for some immediate form 
of reinforcement. Tokens are an ideal media 
for such reinforcement because (a) they are 
given immediately, (b) they can be redeemed 
for the specific consequences which the recipi- 
ent deems desirable at that point in time, 
(c) they are concrete and unambiguous, 

(d) the giving and receiving of tokens is 
customarily associated with positive social 
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interchange, and (e) they permit an exchange 
of behaviors which are not contiguous. Token 
systems have been used effectively in a wide 
range of settings (e.g., mental hospitals— 
Atthowe & Krasner, 1968; classrooms—Clark, 
Lachowicz, & Wolf, 1968; and institutions for 
delinquents—Tyler & Brown, 1967), and ex- 
tension of this technique to marital treatment 
appears warranted. 


ILLUSTRATION AND RESULTS 


Four couples have used the token system 
to modify each other’s behavior. Individuals 
ranged in age from 24 to 52 and in education 
from high school diploma to doctorate. The 
couples were married for from 3 to 23 years 
and had a maximum of two children. Each of 
the couples sought treatment as a last-ditch 
effort prior to obtaining a divorce. In each 
instance, the wife listed as her first wish that 
her husband converse with her more fully, or 
at least that he not “close me out of his life 
even when he is at home.” Considerable dis- 
cussion was often necessary to identify what 
intensity level of conversation was positively 
reinforcing to the wife, and this was made 
clear and rehearsed during the treatment ses- 
sions. The wife was then instructed to pur- 
chase a kitchen timer which she could carry 
with her about the house. She was instructed 
to set the timer as soon as her husband en- 
tered and to give him one token when the 
bell rang after each hour in which he con- 
versed at the criterion level. If he failed to 
behave at the criterion level by the end of the 
first 30 minutes of each hour, she had to 
notify him of this and offer constructive sug- 
gestions, cueing him as to how his perform- 
ance could be improved upon. If she failed 
to do this, he had to be given a token even 
if he failed to perform adequately. If he so 
requested, at the half-hour cue time, the 
timer could be reset so that he could earn 4 
token during the next hour (so that he waited 
60 rather than 90 minutes before being 
rewarded), 

The criterion level for conversation is 
naturally a negotiable factor. No one could 
be expected to talk to his wife constantly; 
if for no other reason, his children would 
not allow it. Therefore, conversational tokens 
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Fic. 1, Average daily hours of conversation and weekly rate of sex of four couples— 
before, during, and after operant marital therapy. 


can be earned for a wider range of responses 
Tanging from intense conversation at one 
extreme to the wife’s feeling free to interrupt 
her husband with a question at agreed inter- 
vals at the other extreme. 

While tokens may have some intrinsic rein- 
forcing properties in their own right (in addi- 
tion to being associated with positive social 
responses when they are offered), they become 
more powerful when they function as contin- 
gencies for some other event. With the four 
Couples cited, tokens were redeemable at the 
husband’s request from a menu stressing 
Physical affection. A different menu was con- 
Structed for each couple which took into ac- 
Count the base-line level of sexual activity, 
the desired level of sexual activity, and 


the number of hours available for nonsexual 
(in this instance conversational) interchange. 
Each of these couples had sex less than once 
per week (ranging from once in the year 
prior to treatment to once in the week prior 
to treatment), each desired sex an average 
of three times per week, and each had ap- 
proximately 5 hours together on weeknights 
and 14 hours on weekends, making a total 
of approximately 52-54 hours per week. 
Accordingly, husbands were charged three 
tokens for kissing and “lightly petting” with 
their wives, five token for “heavy petting,” 
and 15 tokens for intercourse. (These be- 
haviors were not rehearsed during treatment 


sessions.) 
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Fic. 2, Marital satisfaction assessment inventory 
scores of four couples—before, at last session, and at 
follow-up of operant marital treatment. 


Tokens earned and spent were recorded on 
the Behavior Checklist, which also pro- 
vided data for continued graphing of inter- 
actional behavior. The performance of each 
of these couples is represented in Figure 1, 
where it will be seen that the rates of con- 
versation and sex increased sharply after the 
start of treatment and continued through 24- 
and 48-week follow-up periods. 

At the start of treatment, at the conclusion 
of regularly scheduled interviews, and at the 
time of follow-up, each spouse was asked to 
complete a brief inventory measuring the 
extent of his own and his perception of his 
spouse’s satisfaction in and commitment to 
the marriage. This inventory was adapted 
from the work of Farber (1957). The results 
are depicted in Figure 2 where it can be seen 
that the rate of reported satisfaction increased 
in association with the reported behavioral 
changes. These changes enabled the couples 
to become more similar to nonclinic families 
on this dimension (Levinger & Breedlove, 
1966, p. 369). 

With each of these couples, all therapeutic 
sessions were held jointly. Sessions were held 
during the first four, the sixth, eighth, and 
tenth weeks, for a total of seven sessions. 
When it is considered that these couples 
were each on the brink of filing for divorce, 
this could be considered relatively inexpensive 
treatment. Follow-up contacts were held by 
Phone or by mail, and all data, including 
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that collected during sessions, were based on 
self-report. 


DISCUSSION 


The therapeutic recommendations of alter- 
native courses of behavior are consistent with 
Goldiamond’s (1966, p. 118) recommendation 
to his patients that: “If he wished his wife 
to behave differently to him, then he should 
provide other stimuli than the ones which 
produced the behaviors he did not like.” Each 
spouse was directed in specific modifications 
of his own behavior in an effort to modify 
the behavioral environment in which his part- 
ner’s behavior occurred. The antecedents for 
changed behavior were probably twofold: 
therapeutic recommendations exercised some 
discriminant stimulus control as did the ex- 
pectation of changes in the behavior of the 
spouse. The therapist suggested that each 
party engage in behaviors which had doubtless 
been requested, cajoled, and demanded by 
each party countless times before. As thera- 
peutic directives, these new requests were 
differentiated from the old in two important 
respects. First, they were clarified. Second, 
they were removed from the context of co- 
ercive demands in which granting the request 
would have been tantamount to reinforcing 
dysfunctional patterns of negative reinforce- 
ment. In addition, the treatment was charac- 
terized to each couple as a “game” in which 
each would be able to modify the general 
rules of the relationship so that positive rather 
than negative strategies would pay off. 

It is impossible to isolate which aspects of 
this complex treatment approach carry the 
weight of the observed change. Behavioral 
rehearsal and the manipulation of social rein- 
forcers were the objectively identified vati- 
ables but other observers would undoubtedly 
stress other factors as well. One technical 
point of particular importance is the fact 
that at no time was an attempt made to 
“fade” (or slowly remove) the behavior modi- 
fication system, replacing it with more natural 
processes of behavioral control (Krasner; 
1965, 1966). Indeed, continued reliance 0" 
the therapeutic techniques as a means of pro- 
gramming and evaluating each other’s behav- 
ior could result in considerable gain with low | 
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cost. In fact, one characteristic of an ideal 
therapeutic technique might be that it is 
transferable to the natural environment 
without change. 

This pattern of treatment is likely to be 
challenged on two counts. Some critics will 
charge that the therapeutic strategies of this 
approach are “trivial” because they are based 
on “superficial” changes in behavior. Two 
counterarguments should be stressed. First, 
the only data which are ayailable to the 
therapist as a scientist are observable behav- 
iors, and it is at the level of behavior that 
changes must be sought (Skinner, 1953). 
Second, it must be stressed that these were 
the changes sought by the clients and were 
therefore in accord with the therapeutic con- 
tract. To have sought other goals would 
have meant ignoring the presenting com- 
plaints. 

In response to the triviality argument, it is 
also relevant to cite the data concerning the 
level of marital satisfaction. However, these 
must be considered to be “soft data” as they 
are based on global self-ratings such as 
“How committed are you to stay in this mar- 
riage?” and “What proportion of time spent 
with your spouse would you consider to be 
‘fully satisfying’?” It is impossible to deter- 
mine exactly what degree of meaningfulness 
can be attributed to answers to such questions 
and they should, at best, be interpreted as 
indications of trend. 
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OF AGGRESSIVE BEHAVIOR TOWARD 
THE MENTALLY ILL: 
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Thirty-six Ss participated individually in an experiment which required them 
to act as teachers whose job was to enable a person whom they believed to be 
an ex-mental patient to learn an arbitrarily correct series of button presses. 
The Ss could only “communicate” with the learner by shocking him when 
he was incorrect and could vary both the intensity and duration of the shock. 
It was found, using mean shock duration as an index of punitiveness, that 
the more emotionally unstable (and thus similar to the learner) an S believed 
himself to be, the less punitive he was toward the learner. The surprising 
utility of the shock duration measure as an index of aggression against a 


stigmatized person is discussed. 


It has been clear for some years that a 
person afflicted by conditions such as mental 
illness or blindness is stigmatized and de- 
graded in our society (Nunnally, 1961; Nun- 
nally & Kittross, 1958). More recently, owing 
partly to the stimulus provided by Goffman’s 
(1963) book, Stigma, a series of experi- 
mental studies was carried out which sought 
to specify some of the variables affecting the 
perception and treatment of stigmatized indi- 
viduals (Farina, Allen, & Saul, 1968; Farina, 
Holland, & Ring, 1966; Farina & Ring, 1965; 
Farina, Sherman, & Allen, 1968). The results 
of these studies support Goffman’s belief that 
stigmata are important determiners of inter- 
personal interactions. However, most of these 
studies have varied (perceived) character- 
istics of the stigmatized individual such as the 
_ severity of the stigma and the degree to which 
f he can be held accountable for it. In the 
Present study, on the other hand, the focus 
was on the other party in the stigmatizing 
Process, the person in interaction with the 
stigmatized individual. Unquestionably there 
are a multitude of characteristics which are 
Telated to the way a person feels and behaves 
toward a blemished other. The characteristic 
which was examined in the present study was 
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PERSONAL ADJUSTMENT AS A DETERMINANT 


psychological similarity. It was systematically 
varied while stigma remained constant. 

There is a dearth of experimentally based 
findings which indicate that perceived simi- 
larity to a stigmatized individual influences 
the way one responds to him, Nevertheless, it 
seems almost self-evident that such a rela- 
tionship exists and there is anecdotal evi- 
dence. Thus Wright (1960; cited by Goffman, 
1963) indicates that under some circum- 
stances the hard-of-hearing person may re- 
spond more harshly than the completely deaf 
to one who is deaf. A striking illustration of 
the way in which one’s similarity to a stig- 
matized group can increase aggressiveness 
toward that group has recently been given by 
Yablonsky (1966). He reports interviewing a 
white New York gang boy who was asked by 
a person he perceived as a Negro, but who 
had a Spanish accent, to join a gang that 
intended to kill all the “niggers.” Clearly the 
dark-complected man did not regard himself 
as a Negro. 

These examples imply a curvilinear relation- 
ship between similarity and punitiveness with 
intermediate similarity leading to greater 
punitiveness. It does not seem implausible 
that a person who is quite uncertain about the 
degree to which he resembles a stigmatized 
person, in contrast to those who know with 
certainty that they do or do not possess the 
stigma, would be the most anxious to estab- 
lish his distance from the stigmatized person. 
This hypothesis implies that persons of un- 
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certain or intermediate similarity to the stig- 
matized individual would behave in the most 
aggressive fashion toward the latter as a 
means of dissociating themselves from them. 
Rubin’s (1967) recent finding that an increase 
in self-acceptance leads to a reduction in 
ethnic prejudice also seems consistent with 
this hypothesis. 

An alternative and more straightforward 
hypothesis is that with increasing similarity 
there would be decreasing punishment admin- 
istered by Ss. This would happen if the closer 
one feels to a stigmatized individual, the 
more one identifies with him and feels com- 
passion for him. Milgram’s (1965) experi- 
ment is relevant since it shows that decreasing 
psychological distance between teacher and 
victim in his obedience experiment results in 
a decrease in punitiveness. 

One purpose of the present experiment was 
to permit a test between these two hypotheses, 
the curvilinear and the negatively monotonic 
one. The authors felt the effect of psycho- 
logical similarity would be enhanced under 
conditions of public disclosure as is suggested 
by the case of the New York Grand Dragon 
of the Ku Klux Klan who committed suicide 
when he learned that his Jewish identity had 
been discovered. Therefore, all of the Ss were 
led to believe that their stigmatizing condition 
(emotional instability) would become a part 
of their official college record. 


MerHop 
Subjects 


Thirty-six male introductory psychology students 
served as Ss in this experiment. Participation in 
psychological experiments was a requirement of the 
course and each S received 2 hours’ credit for this 
experiment. The Ss were assigned randomly in equal 
numbers to one of three experimental conditions. 


Procedure 


First session, When S arrived at the experimental 
laboratory, he was told by E? that the study dealt 
with the evaluation of rehabilitation techniques used 
for mental patients. The experiment, E explained, 
had to do with “what kind of people work best 
with mental patients in the sense of communicating 
with them and in teaching them something.” The E 
went on to outline some (fictional) views of psy- 
chologists concerning whether persons similar to or 
dissimilar from mental patients are most effective in 


3 Thanks are due Kenneth Wallston for conducting 
the experiment. 
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this regard, and indicated that the present experiment 
would attempt to find out. 

It was then explained that the experiment con- 
sisted of two parts which would require that he 
return some 4 or 5 days later. In the second part, 
one person would attempt to teach something to 
another. The teacher would always be a college 
student but the learner would be one of a group 
of former mental patients who had volunteered to 
participate for pay. In order to participate as a 
teacher, however, the student had to fall into one 
of three rather narrow categories of emotional 
stability: very stable, intermediate, or very unstable, 
He was told a Rorschach test would now be given, 
and it would be scored prior to the next session, If 
he fell into one of the three categories, he would serve 
as a teacher in the subsequent session, whereas, if he 
did not, he would merely learn the result of the 
testing and be excused. The Ss were then given a 
chance to withdraw from the experiment and were 
also told that their test results would become a 
part of their official records unless they objected. 
No one withdrew and no one objected. 

The first five cards of the Rorschach test were 
administered to all Ss. If S was assigned to the 
emotionally stable (ES) condition, Z’s manner was 
businesslike and nonevaluative, For those in the 
emotionally unstable (EU) condition, Æ pretended 
to be surprised at some of the answers and, in 
general, tried to give S the impression the responses 
were unusual. The Æ’s behavior toward Ss in the 
emotionally intermediate (EI) condition fell some- 
where between the two extremes just described. The 
purpose of E’s differential behavior was, of course, 
to prepare S to receive the particular psychological 
evaluation that chance rather than his own perform- 
ance had dictated for him. 

Second session. When he returned for his second 
appointment, each S was told he had a score which 
qualified him for the second phase of the study, but 
E was kept ignorant of the specific category to 
minimize bias. Hence, Æ could answer no questions, 
but he could arrange an appointment with one of 
the two psychologists who had scored the test. The 
S was here given a sealed envelope and told to spend 
a little while looking over his evaluation. Those in 
the ES condition found their emotional adjustment 
had been rated “very satisfactory” and ranked in the 
top 5% of college students in emotional stability. 
Those in the EI condition read that there were some 
signs of good and some signs of poor emotional 
stability which we hoped would induce uncertainty 
concerning how emotionally unstable they really 
were. Those in the EU condition received the 
following evaluation which we present in its entirety 
to convey the flavor of these “reports.” 


Mr. 
equivocally that his present emotional adjustment 
is decidedly unsatisfactory and a cause for concer. 
There is clear evidence from his protocols 0 
Serious unconscious factors which, when couple 
with stressful external circumstances, might 


’s test responses show quite un- 
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precipitate a breakdown. It is precisely because 
these difficulties do have an unconscious basis that 
they are doubly dangerous. While Mr. 

js no doubt vaguely aware of some psychological 
difficulties, it is extremely unlikely that he recog- 
nizes just how tenuous his emotional adjustment 
really is. His responses, which were unimaginative 
and constricted, indicated serious difficulties both 
in the expression of aggression and the manage- 
ment of sexual tensions. In short, I should say 
that Mr. is quite an emotionally un- 
stable student who is likely to experience 
difficulties in the future. On the basis of my 
experience in evaluating the emotional stability of 
college students based on the Rorschach, I should 
place Mr. in the bottom 5%. In addi- 
tion, I should recommend that he avail himself 
of the psychiatric services provided by the on- 
campus clinic if he is not already doing so. 


The three evaluations were approximately equal in 
length and were signed by one of the authors, with 
his PhD and clinical psychologist status indicated. 

At this point, the student was told that he and 
the learner would be in different rooms to protect the 
ex-patient’s anonymity. The task entailed teaching a 
pattern of five button presses. He was shown the 
teacher’s apparatus which had three lights labeled 
A, B, and C, as well as a button labeled “Shock” 
which could be set at any of 10 positions. He was 
told the lights corresponded to three buttons on the 
learner’s panel and his task was to teach the learner 
the pattern AABCA by shocking him each time he 
was wrong. He could set the shock from light to 
painful and administer it for as long as he wished 
but only 30 trials would be allowed. In the next 
toom there was a confederate who communicated 
the same 30 sequences of five button presses to all 
Ss, approaching and later departing from the correct 
sequence so that on the last trial he was as far 
as ever from the correct pattern. The confederate 
also recorded for each trial the level of shock used 
and the duration of the shock. This procedure is 
very similar to that employed in earlier reports and 
the apparatus is more fully described there (Farina 
et al, 1966; Farina, Sherman, & Allen, 1968). 

After the thirtieth trial, the student was stopped 
and it was pointed out that the learner had not 
Performed his task very well. He was then asked 
to complete a postexperimental questionnaire de- 
signed to check the effectiveness of the experimental 
manipulation as well as to assess S’s perception of 
the learner and evaluate the latter’s and his own 
Performance. 

Finally, a full debriefing session followed which 
had three objectives: (a) to determine whether and 
to what extent Ss were suspicious about any of the 
deceptions employed in the experiment—nine Ss were 
Suspicious about some aspects of the procedure and 
Were dropped from the study; (b) to explain in 
detail the real purpose of the experiment and to 
justify its several deceptions; and (c) to allow Ss 
to express their feelings and make any comments 
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TABLE 1 


NUMBER OF SUBJECTS INTERPRETING PSYCHOLOGICAL 
EVALUATION ACCORDING TO EXPERIMENTAL 
CONDITION 


Experimental condition 


Interpretation 
ES EI EU 
Emotionally stable 10 2 0 
Somewhat stable i 11 0 
Definitely unstable 0 1 11 


they cared to about the experiment. The Ss were 
encouraged to speak to the authors if they continued 
to be bothered by any aspect of the experiment, The 
Ss were promised a full write-up of the experiment 
and were, once again, before being dismissed, sworn 
to secrecy. 

Since the general procedure of this study as well 
as some of the “psychological” evaluations used 
could be expected to upset some of the Ss, at least 
during the experiment itself (Ring, Wallston, & 
Corey, in press), the present authors felt incumbent 
to determine whether there were any long-term 
negative aftereffects. Data relevant to this issue will 
be presented in the Discussion section. 


RESULTS 
Checks on the Manipulations 


Before examining other data, it is necessary 
to determine if Ss interpret the evaluations 
they received in accordance with the experi- 
mental intention. To do this a question was 
contained in the postexperimental question- 
naire which asked Ss to indicate how emotion- 
ally stable they were according to the evalua- 
tion. The numbers choosing each of the three 
alternatives for each condition are shown in 
Table 1. The table shows quite clearly that 
nearly all Ss accurately characterized the type 
of evaluation they received. Furthermore, the 
students responded emotionally to these eval- 
uations in the expected and desired direction. 
To determine this, they were asked to indi- 
cate how upset they were after reading the 
evaluations by checking the choices: 1. not 
at all; 2. a little; 3. fairly; 4. very much. 
Upset scores were derived by assigning 
numbers ranging from 0 for the first to 
3 for the last choice. The average scores 
for the groups were as follows: ES = .42; 
EI = .83; EU = 1.50. These group scores are 
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TABLE 2 


Mean DURATION OF SHOCK ADMINISTERED FOR 
Turee Brocks or 10 Trrats EACH AND 
For AIL 30 TRIALS 


Block 
Condition All trials 
1 2 3 
ES’ 13:867 | 15.15 13.69 42.70 
EI! 10.43 11.85 10.72 33.00 
EU 8.46" 8.48 8.25 25.19 


significantly different (F = 5.04, df = 2/33, 
p< 025). 

The other condition we wanted to create 
was to have Ss believe a copy of the evalua- 
tion would be sent to their counselor. To 
check on this, a question asked them to 
indicate what Æ had said concerning the re- 
port. Of the 35 Ss who answered this ques- 
tion, all but one indicated clearly that they 
remembered what £ had told them concerning 
the disposition of their test reports. The 
remaining student wrote, “I care very little 
about it.” It may be concluded, therefore, 
that the conditions necessary to test the 
hypothesis were satisfactorily established. 


Treatment of the Stigmatized Person 


As the principal measure of interpersonal 
aggression, mean trial shock duration was 
used, partly because other work using the 
same pseudoshock apparatus (Farina et al., 
1966; Farina, Sherman, & Allen, 1968) indi- 
cates the duration measure is a more sensitive 
index of interpersonal aggression than either 
intensity alone or a multiplicative index based 
on intensity and duration scores, although in 
this experiment, all three measures reveal 
very much the same pattern of condition dif- 
ferences. Other investigators (e.g., Berkowitz, 
1965; Hartmann, 1965), using similar shock 
devices, have also found a duration measure 
to discriminate significantly among experi- 
mental treatments. Additional reasons for the 
exclusive reliance on this type of index, based 
on S’s phenomenology, will be given in the 
Discussion section. 

The mean shock duration data are pre- 
sented in Table 2. The mean scores for each 
block of 10 trials are given in the table as 
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well as the average score for all 30 trials, As 
may be seen, there is a clear experimental 
condition effect with decreased punitiveness 
evident with increasing psychological similar- 
ity to the stigmatized learner. This effect, 
based on raw scores, is only marginally sig- 
nificant (F = 2.79, df = 2/33, 05 < p< .10). 
However, because of the skewness of the 
duration scores, the authors followed Berko- 
witz (1965) and subjected them to a square 
root transformation (Vx + Vx + 1) which re- 
sulted in an F = 3.68, p < .05. Consistent 
with this finding, a Kruskal-Wallis one-way 
analysis of variance (based on ranks) gener- 
ated a significant treatment effect (.02< 
< .03), with the ranking of conditions pre- 
cisely the same as in the parametric analysis 
of variance (i.e, ES > EI > EU). The re- 
sults of all of these analyses, then, support 
the conclusion that in this experiment a 
negatively monotonic relationship exists be- 
tween psychological similarity to a stigma- 
tized person and aggression toward him—the 
more one feels himself to be different from 
a blemished individual, the greater is one’s 
punitiveness toward him. The trend effect was 
not significant indicating no differences among 
the blocks of trials. 


Perception of the Stigmatized Person 


A number of items were included on the 
postexperimental questionnaire which were 
designed to measure possible differences 
among conditions in either self- or other- 
perceptions. For the most part, however, these 
items did not differentiate among conditions. 
The authors did find that the more similar 
an S$ felt to the confederate, the most confi- 
dent he was that the the latter would have 
learned the pattern had there been more trials, 
but the effect was only marginally significant 
(05<p<.10 by an F test; p< .05 by 
a x°). The only other significant difference 
(Ż < .05 by an F test) on a perceptual vari- 
able indicated that, compared to Ss in other 
conditions, EU Ss believed that the learner 
was experiencing less pain from the shocks. 
This is a trivial finding, however, since it was 
precisely these Ss who administered less in- 
tense as well as briefer shocks. And in fact, 
the correlation between estimated average 
shock intensity and actual intensity was .93 
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(p < .001) while the corresponding figure for 
duration was .38 (p < .05). 

The negative result which is difficult to 
understand concerns the perceived similarity 
of the student to the patient. To measure 
this, four items were included in the post- 
experimental questionnaire which asked S in 
various ways to indicate to what extent he 
was psychologically similar to the learner. 
The scores of these four items were summed 
to form a distantiation index and the three 
groups of Ss were essentially identical with 
respect to this index. In view of the success 
of the manipulation, differences among the 
groups should have emerged. Conceivably ego- 
defensive needs led those Ss who thought they 
were emotionally unstable to indicate being 
no more similar to the former patient than 
those who thought they were stable. However, 
we have no data to substantiate this or other 
possible hypotheses. 


Relationships between Perception and 
Treatment of the Stigmatized Person 
The results presented thus far imply that 
psychological similarity has a rather system- 
atic and straightforward effect on one’s behav- 
ior toward a person believed to be an ex- 
mental patient but no consistent relationship 
to one’s perception of him. From this it would 
appear that the treatment and evaluation of 
such a blemished individual do not necessarily 
correspond, a point that has been made before 
(Farina et al., 1966). One bit of evidence, 
however, suggests that under some circum- 
stances there might be a relationship. For 
_Ss in the EI condition, there is a correlation 
of 51 (p< .10) between felt similarity as 
Measured by the distantiation index and dura- 
tion of shock administered, whereas, for the 
other two conditions the correlations are —.09 
and — .14. Thus, in the EI condition, the more 
similar an S feels to the ex-mental patient, the 
more he shocked him. These data are con- 
sistent with the curvilinear hypothesis and 
Taise the possibility that there was something 
Psychodynamically special about the middle 
group. 


DISCUSSION 


Except for the tenuous finding just men- 
tioned which tends to complicate matters a 
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little, the results of this investigation appear 
fairly clear-cut. They lead to the general con- 
clusion that a person’s inclination to inflict 
pain on a stigmatized individual is blunted to 
the degree that the former believes himself 
to be psychologically similar to the latter. 
This generalization, nevertheless, should not 
be accepted too readily for it raises a number 
of issues that can only be discussed but not 
settled here. 

First of all, in what sense can it be as- 
serted that Ss in the EU condition, for ex- 
ample, believe themselves to be similar to the 
ex-mental patient? Their questionnaire re- 
sponses indicate that, compared to other Ss, 
they do not perceive themselves as any more 
similar to the learner. Assuming that these 
responses are not due to a conscious ego- 
defensive maneuver, it would appear that 
the independent variable has behavioral ef- 
fects which occur without any necessary cor- 
responding cognitive representation. The lack 
of cognitive parallels for these behavioral dif- 
ferences is in itself a matter deserving of 
attention and will be dealt with below. 

Another important qualification of the con- 
clusion stated at the outset hinges on the 
dependent variable which differentiated the 
conditions, namely, duration of shock. Why 
should this measure consistently relate to 
treatment of the stigmatized and not intensity 
of shock which is perhaps the more obvious 
variable? The authors’ interpretation rests 
on the assumption that there are strong nor- 
matively based restraints against treating 
former mental patients (and probably many 
classes of stigmatized persons) differently, 
and especially more negatively than “normal” 
persons. At all events, the prescriptions of 
American middle-class society demand that 
we respond to disadvantaged persons as if we 
were not aware of, or if it’s unavoidably no- 
ticeable, as if we did not care about, their 
stigma. We are not, in short, supposed to hold 
it against a man that he has been, for exam- 
ple, a prisoner or a mental patient. At the 
same time, it is undeniable that private atti- 
tudes toward such persons are often incom- 
patible with these normative prescriptions. 

The authors submit, therefore, that we are 
most likely to inhibit normatively inappropri- 
ate responses (e.g, condemnation, ridicule, 
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physical aggression, etc.) toward a disadvan- 
taged person when we can easily label or 
recognize them as such. It would be expected 
then that aggression toward the stigmatized 
would ordinarily assume more covert, indi- 
rect, or subtle form, thus keeping the ag- 
gressing individual often largely unaware of 
both the meaning and motivation of his ac- 
tions—a state of affairs that could serve an 
important guilt-minimization function for the 
person. 

In support of the foregoing argument, the 
data suggest that shock duration is a more 
subtle mode of aggression than shock inten- 
sity. The Ss were asked to estimate both the 
mean shock intensity and shock duration 
that they administered to the learner. When 
these estimates were correlated with the ob- 
jective values of these variables, the relation- 
ship, it will be recalled, was nearly perfect 
(r = .93) for intensity, but only mildly posi- 
tive for duration (r= .38). The difference 
between these two correlations was highly 
significant (z = 5.12, p < .0001). Obviously, 
it was much easier for Ss to monitor shock 
intensity, which they had to select before each 
shock, than shock duration for which they 
had no objective feedback. 

This interpretation, emphasizing the often 
covert character of aggressive acts toward 
the stigmatized, may also help to illumine the 
curious finding that revealed that it was the 
seemingly more sympathetic EI Ss who ad- 
ministered the longer shocks, If this is in fact 
a reliable relationship (and we repeat that it 
is most conjectural and demands replication), 
it points to the psychodynamically appealing 
proposition that among those persons who 
feel most uncertain concerning whether they 
too are stigmatized, those who are verbally 
the most positive in their attitudes toward 
the stigmatized are likely to be the most co- 
vertly punitive in their acts. 

Finally, the view taken here can be used 
to account for the somewhat puzzling finding 
(as in the Farina, Sherman, & Allen, 1968 
study as well as the present one) that signif- 
cant behavioral differences toward stigma- 

-tized persons occur in the absence of con- 
comitant perceptual or attitudinal differences. 
The authors propose that since many of the 
questionnaire items typically used in these 
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studies are patently evaluative in content, Ss 
can be expected ordinarily to respond to them 
in normatively appropriate ways. The authors 
also anticipate that items designed to tap 
evaluative dimensions but in more covert or 
nonobvious ways would be more successful in 
eliciting differences which would parallel be- 
havioral measures. 

It is worth pointing out that the present 
position can easily be subjected to empirical 
disconfirmation. To illustrate one possibility, 
since shock duration is regarded as a sensitive 
indicator of aggression only because varia- 
tions in duration are difficult for S to be 
aware of, any procedure which makes dura- 
tion as a variable more salient or easy to 
monitor (e.g., requiring S to record time of 
shock or feedback from confederate related 
to shock duration), for S ought to attenuate 
or obliterate altogether duration differences 
among treatment conditions similar to those 
used in previous experiments. 

So far the authors have only suggested an 
interpretation which accounts for the differ- 
entiating reliability of the duration measure; 
there has been no attempt, it should be noted, 
to provide an explanation for the findings of 
the present experiment. The question here is: 
Why should similarity lead to decreased puni- 
tiveness? 

The authors would like to argue, as was 
proposed in the introduction, that perceived 
similarity increases the likelihood of identifi- 
cation which, in turn, leads to a reduction in 
aggression. However, there is no direct evi- 
dence to support this interpretation and, as 
noted earlier, it is not even clear that perceived 
similarity did vary among the experimental 
groups as it was expected to. Furthermore, 
there are several plausible alternative expla- 
nations that cannot be ruled out. For exam- 
ple, it may simply be that Ss who receive 4 
Psychological evaluation which deals a blow 
to their self-esteem are less likely to be puni- 
tive toward another person, whether he 
stigmatized or not. Or it may be that when 
one is generally upset or preoccupied with his 
own feelings, one is not as likely to be puni- 
tive. It seems to the present authors that 
without the appropriate control groups whi 
might eliminate these alternative interpreta- 
tions, a high degree of confidence cannot be 
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- placed in the particular explanation that is 
fayored here. In addition to further research 
designed to evaluate the tenability of com- 
peting formulations, data are needed which 
will more directly corroborate the “identifi- 
cation hypothesis” than those available from 
the present study. 

Considered now are some reasons why the 
curvilinear hypothesis was not supported more 
strongly by the data, The post hoc view is 
that the heightened punitiveness toward a 
stigmatized person that this hypothesis pre- 
dicts is most likely to occur when (a) § is 
aware that he has the stigma or thinks or 
fears that he does and (b) he fears that im- 
portant others will learn of his real or sus- 
pected condition. It is crucial to note that this 
revised formulation requires the presence of 
both conditions in order for the effect to be 
observed. The present experiment presumably 
established the first condition, but one is left 
in doubt concerning the second. It is not at 
all clear that Ss thought it was very likely 
that the evaluations would ever really fall 
into the hands of persons whom they would 
wish not to know about their condition or 
just how upset they would be if such infor- 
mation were revealed to others. They were 
not, after all, now or formerly mental patients 
—it had only been implied for some that they 
might be. Whether being told that one is emo- 
tionally unstable and a possible candidate for 
a mental institution can be regarded as a 
stigma (let alone a severe one) is conjectural. 
What is needed to test this hypothesis 
definitively would seem to be an experiment 
_ which varies both an S’s motivation to con- 

ceal the stigma and the probability that it 
will remain concealed. Until such an experi- 
ment is conducted, it would be premature to 
abandon the (now revised) curvilinear hy- 
pothesis. 

One more point unrelated to any of the 
substantive or conceptual issues raised by this 
experiment requires brief mention. It was 
asserted earlier that certain ethical ques- 
tions could be raised concerning the proce- 
dures used in this study. While the following 
results do not in themselves resolve all such 
doubts about this kind of research, they do 
answer some of the more obvious questions 
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dealing with the long-term impact of this 
particular experiment. 

Some weeks after the conclusion of the 
study, open-ended questionnaires, to be filled 
out anonymously, were sent to each S. These 
questionnaires attempted to elicit an S’s 
retrospective account of his feelings during 
the experiment, immediately after it was over, 
as well as his present thoughts and feelings 
about it. Sixteen questionnaires were returned. 
One might think that if any selective factor 
were operative, it would favor those Ss who 
were most upset by Æ returning their ques- 
tionnaires. 

The results, in any case, were overwhelm- 
ingly negative. Only one S expressed any re- 
gret about having been in the experiment and 
that was because he wasn’t certain he had 
been told the truth. Most Ss actually said 
that they had enjoyed participating in the 
experiment and that they hadn’t at all minded 
being deceived. These findings strongly sup- 
port Milgram’s (1964) views about the lack 
of serious negative long-term effects from 
severe deception experiments and are also 
corroborated by the results of a more sys- 
tematic experiment (Ring et al., in press) spe- 
cifically directed to the question of immediate 
and long-term consequences of being an S in 
a Milgram-type obedience experiment. The 
findings from that experiment, too, offer no 
support to the position expressed by some 
(e.g, Baumrind, 1964) and no doubt feared 
by many others that there are likely to be 
worrisome long-term consequences for many 
Ss participating in such experiments, 

The only negative effect inferrable from 
these questionnaire data stems from the state- 
ments of a couple of Ss that they thought 
they might be more suspicious about psycho- 
logical experiments in the future. In view of 
some recent findings about the effects of de- 
ception experiments on subsequent experi- 
mental performance (e.g, Brock & Becker, 
1966; Fillenbaum, 1966), it is not immedi- 
ately obvious that these suspicions about fu- 
ture suspiciousness should in themselves be 
regarded as particularly alarming. 
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EFFECT OF INFORMATION, SUBJECT AGE, 
AND INSTRUCTIONAL SET ON 
PREDICTIVE JUDGMENTS 
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The effects of three variables on the accuracy of predictive judgment were 
examined. The three variables were (a) information, which had three levels— 
no information given the judges about the person whose behavior was to be 
predicted, biographical information given them about the person, and informa- 
tion given them about the person which was considered relevant to the 
behavior to be predicted; (b) subject age, which consisted of two levels— 
child and adult; and (c) instructional set, which consisted of two levels—no 
instructions given the judges, and instructions given them oriented toward 
furnishing a set regarding how to predict. Significant differences were found 
between the two levels of subject age and also among the three levels of 
information. The effect of instruction approached significance. There were some 
additive effects apparent when all three variables were considered at one time. 


Research in the area of accuracy of predic- 
tion has taken numerous forms. Initially 


; 


many of the studies dealt with the character- 
istics of the “good” judge of personality. In- 
telligence, artistic ability, interest patterns, 
sex, background, and social attitudes have 
been found to be related to accuracy of pre- 
dictive judgments (Cline, 1955; Estes, 1938; 
Taft, 1956). 

More relevant to the present study, how- 
ever, is some recent research which directs 
itself not to the characteristics of the judges 
but toward the conditions under which ac- 
curate prediction takes place. These studies 
have examined such conditions as training, 
information of various types concerning the S 
whose behavior was to be predicted, and feed- 


. back concerning accuracy of prediction 


(Blanchard, 1966a, 1966b; Crow, 1957; Kost- 
lan, 1954; Sechrist, Gallimore, & Hersch, 
1967; Sines, 1959). Generally, these studies 
have indicated that while there has been 
some progress in determining factors which 
affect accuracy of interpersonal prediction, 
the results are equivocal and the methodology 
questionable. For example, in a study by 
Fancher (1966), judges’ ability to predict 
increased with feedback, while they did not 


1Now at the Harrisburg State Hospital, Harris- 
burg, Pennsylvania, Requests for reprints should be 
sent to James L. Knestrick, 270-E North Arlington 
Avenue, Harrisburg, Pennsylvania 17109. 


in Sechrist et al.’s (1967) study. In addition, 
for most studies which have been reviewed, 
little attention has been directed to the im- 
portant consideration of Bruner and Tagiuri 
(1954) which involved the selection of ap- 
propriate criteria against which to assess the 
accuracy of prediction. 

The present study was designed to deter- 
mine the effect of three variables on the accu- 
racy with which judges can predict behavior. 
The three variables under examination were 
(a) age difference between the Ss whose be- 
havior was being predicted, (b) instructional 
set, and (c) information about Ss. 

Judges were asked to predict which alterna- 
tive act an S would choose to perform in a 
series of two-choice situations. The criterion 
against which to assess the accuracy of pre- 
diction was unequivocally determinable by 
the fact that S actually carried out one of 
the alternatives. This aspect of the study was 
responsive to the problem of criterion valid- 
ity—the criterion for assessing the accuracy 
of prediction was mot the judgment of an 
expert. 

Age difference between Ss was chosen as a 
variable because research has indicated that 
the characteristics of the S whose behavior is 
being predicted influence the accuracy of the 
predictions (Baker & Block, 1957; Estes, 
1938; Hamlin, 1954). It was assumed that 
concomitant with difference in age were other 
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characteristics which would influence the ac- 
curacy with which the behavior of Ss could 
be predicted. Two age levels were selected, 30 
years apart, in order to maximize the effects 
of this variable. 

Instructional set is used in the study to 
mean giving the judges instructions which 
might enable them to make more accurate 
predictions. It was assumed that people will 
manifest an element of consistency when re- 
sponding to various related environmental 
stimuli. This is especially true when the re- 
sponse requires that a performance actually 
be carried out. Since the behavior which was 
being predicted in the study required actual 
performance by the § whose behavior was 
being predicted, it was assumed that his re- 
sponses would be consistent when responding 
to similar items. Therefore, instructions were 
given the judges which alerted them to this 
assumed consistency. This variable had two 
levels, one of which was the consistency ma- 
nipulation just described, and the other was 
the absence of instructions other than those 
necessary to describe to the judges what was 
expected of them. 

The third variable under investigation was 
information about the S whose behavior was 
being predicted. The information took three 
forms. The first form of information was a 
control level in which the judges received no 
information concerning the § about whom 
predictions were to be made. The second was 
biographical information, consisting of back- 
ground information such as vocation, family 
structure, etc. This form of information was 
used because of research findings indicating 
that this might increase accuracy of predic- 
tive judgment (Kostlan, 1954; Sines, 1959). 
The third form of information was relevant 
information, chosen because of its assumed 
relationship to the behavior which was to be 
predicted, It consisted of some questions and 
answers intended to give the judges an idea of 
the general behavior patterns of the Ss. 

The overall purpose of the study was to 
determine the effects of the three variables al- 
ready discussed on prediction when the be- 
havior to be predicted was behavior which Ss 
must carry through. This requirement of 
performing the chosen alternative was as- 
sumed to influence the choice being made and 
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to affect accuracy in the judges’ predictions 
when in the presence of the three variables, 


MetHop 


Two video tape recordings were produced, each of 
which recorded the behavior of one S when he was 
presented with tasks requiring some physical manip- 
ulation. These acts constituted the behavior to be 
predicted. 

The sequence of events in the experimental situ- 
ation was as follows: Judges entered a room which 
contained a television monitor and a video tape 
recorder. Depending on the experimental conditions 
in effect at the time, one of the two video tapes 
was played to them, under one of three levels of 
information and one of two levels of instruction. On 
the tape the judges saw two people—one was the E 
and the other an S to whom 20 two-choice situa- 
tions were presented. The judges heard and saw the 
E provide the S with an opportunity to carry out 
one of the choices associated with the first task. At 
that point the tape was stopped and the judges were 
asked to predict which task S would perform. After 
the judges’ predictions were recorded, the tape was 
started again and the judges were allowed to ob- 
serve S’s choice. Then the second task was pre- 


sented. This procedure was followed for all 20 two- | 


choice situations. 


Video Tape Construction 


Two 25-minute, professionally directed video tapes 
were produced, one with an 11-year-old boy as the 
S and the other with a 43-year-old male adult. The 
situation which was seen on the tapes by the 
judges was that of two people facing one another 
across a table—E and S. The sequence of events for 
any one item of the 20 which were presented was: 
A card appeared on the monitor giving the number 
of the item to follow, for example, “Item 1”; the 
card faded out and a brief period of darkness en- 
sued; E and S appeared sitting at a table facing one 
another; Æ presented S with two choices (this was 
done both verbally, i.e., “Would you rather do this, 
or this?” and physically, i.e. Æ held up each choice 
so that both S and the judges could see the ma- 
terials clearly); the picture faded to black for the 
judges to record their predictions; the picture te 
appeared to show E asking S$ to choose which al- 
ternative to perform; S’s actual performance was 
shown in a close-up shot; the picture faded. This 
sequence was followed for all 20 items. 


The Tasks 


Santostefano (1957) investigated forced choice 
acts as measures of personality. He felt that there 
was a need for a testing method which 


(1) allows for the free expression of personality 
dimension in situations requiring the subject t0 
cope with people and objects, (2) produces um- 
equivocally identifiable responses, (3) yields mea- 


ae 


PREDICTIVE 


sures the meanings of which are not completely 
dependent on judgment, and (4) is feasible for 
use in ordinary clinic and research setting [p. 14]. 


He empirically determined the personality dimen- 
sions represented by choices in 20 two-choice situa- 
tions by factor analyzing their intercorrelations. As 
a result of the factor analysis, four relatively well- 
defined factors emerged. (See Santostefano, 1957, 
for the factor descriptions and tasks which loaded 
on each factor.) It was the choices in these 20 two- 
choice situations, then, which were used in this ex- 
periment as the behavior to be predicted. 


The Judges 


One hundred and twenty male volunteers from in- 
troductory psychology classes were used as judges. 
They were all males in order to control for re- 
ported sex differences in the ability to predict be- 
havior (Nidorf & Crockett, 1964; Whalen, 1955). 
Judges were randomly assigned in groups of 10 to 
each of 12 experimental conditions. The mean age 
of the group was 18 years 9 months. 


The Variables 


Subject age. An 11-year-old boy was selected as 
one S and a 43-year-old male adult as the other. 
Both were Caucasian. Since similarity between S 
and judge has been found to influence the accuracy 
of predictive judgment, and since similarity was not 
under study here, an attempt was made to control 
for it (Blanchard, 1966a, 1966b; Lundy, 1956; Not- 
cutt & Silva, 1951; Taft, 1966). This was accom- 
plished by determining empirically that adult age 
which the judges felt was as much unlike them as 
was an 11-year-old boy. This was done by having 
57 Ss whose average age was 18 years 7 months 
respond to a simple questionnaire. They were asked 
to indicate on a 10-point scale how much they felt 
like other age levels. The age levels ranged from 5 
to 55 years in steps of 5 years. Number 1 on the 
scale indicated complete similarity and Number 10 
complete lack of similarity. The results of this survey 
showed the mean value for the 10-year age level to 
be 8.42 and age 45 to be that age level above the 
age of the responders which they felt was equally 
unlike themselves as the 10-year level. On this basis 
the ages 11 and 43 were determined to be equally 
unlike the judges. 

The two Ss selected were instructed to respond to 
the tasks in any way they felt like responding. 
There was no attempt by E to influence their be- 
havior in any way. At the time of the video taping 
of their responses, they were unaware of the way in 
which the tape would subsequently be used. This 
Controlled for possible effects this information might 
have had on their responses. The items were pre- 
Sented in the same order, and the verbal instruc- 
tions given by Æ were the same in each case. 

Tt should be pointed out that since there was 
only one S to be predicted for each age group, gen- 
eralizations concerning age as a determinant of ac- 
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curate predictive judgments are limited. That is, 
while age is the apparent difference between the 
two Ss, they may differ along some other dimen- 
sion, unrelated to age, which could also discriminate 
between people of the same age. 

5 Instructions. The effects of two different types of 
instructions on the judges were evaluated. Under 
the assumption that one of the effects of training is 
to create a “set” that people are relatively con- 
sistent in their behavior, the effect of inducing such 
a set in relatively naive judges was evaluated. 

In the first level of instruction, the goal was to 
inform the judges what their task would be, but to 
furnish no procedural information which they might 
use to enhance their prediction. They were given no 
information concerning the tasks and were given 
none concerning how they might use the information 
gained on one task to enhance their prediction on 
subsequent tasks. 

At the second level of instruction, the judges were 
furnished with some procedural clues and were 
informed of the purpose of the tasks. The investi- 
gators felt that if the judges were aware of the fact 
that S’s responses to the tasks might reflect certain 
aspects of his personality, this might have some 
effect on the accuracy of their predictions, It was 
emphasized in these instructions that procedurally, 
consistency might be the best frame of reference 
from which to make their predictions. 

Information. It is reasonable to assume that the 
more information we have about a person the more 
accurate we should be in attempting to anticipate 
or “predict” his behavior. Accuracy of prediction, 
however, does not depend on magnitude of informa- 
tion alone. Other aspects of information such as 
relevance and type also play a role in influencing 
accuracy of prediction. Blanchard (1966a, 1966b) 
found that relevant information tended to enhance 
accuracy of prediction. Both Kostlan (1954) and 
Sines (1959) found background or biological infor- 
mation useful in the prediction process when com- 
pared with other sources of information, for example, 
tests, etc. With these findings in mind, it was de- 
cided to investigate the relative effects of three types 
of information on accuracy of prediction, The be- 
havior to be predicted was that which has been de- 
scribed above. The three types of information used in 
this experiment were as follows: 

Level I: No information. Under this condition the 
judges received no information concerning the per- 
son whose behavior was to be predicted, other than 
his age. They were informed of his age for reasons 
discussed under “Subject age.” 

Level II: Biographical information. Under this 
information condition, the judges received informa- 
tion about the S which could not be considered di- 
rectly related to the behavior to be predicted. Judges 
were given biographical information consisting of 
age, family description, and professional or voca- 
tional data. This information was supplied on a 
sheet of paper attached to the judges’ answer sheets. 
Before the experiment began, Æ read aloud one of 
the two, depending on which tape was to be played. 


694 
TABLE 1 
ACCURACY SCORES UNDER EACH 
EXPERIMENTAL CONDITION 
Information 
Subject | No informa- | Biographical | Relevant 
tion information information 
(ci) (c3) (ca) 
No instructions (b1) 
Child (a1) 11,11,10,10,9 | 11,10,10,10,9 13,10,10,10,10 
18,0,0.6, Fie 1 10,9.8,6, 
Instructions (b3) 
Child 10,10,9,9,9 14,12,10,10,10 | 12,12,11,11,10 
NC = 86 NC'= 98 NC'=99 
No instructions 
Adult (as) | 13,12,12,11,11 | 17,14,13,12,12 | 15,15,15,14,12 
,10,8,8, 12,12,11,11,8 | 12,11,10,9,9 
'C' = 102 C = 122 NC = 122 
Instructions 
Adult 17,15,14,13,11 | 16,15,14,14,13 | 15,15,15,14,14 
ad 12,11,11,10,7 | 13,13,13,13,12 
=111 NC =123 | NC =137 


Note,—NC = number of correct predictions per cell, 


E judges were requested to follow along as it was 
read, 

Level III: Relevant information, As will be recalled 
from the discussion of the tasks, Santostefano (1957) 
found four discriminating factors to be present in the 
responses of individuals to the 20 tasks. On the basis 
of Santostefano’s descriptions, the investigators have 
chosen to label the factors: Factor I—destruction 
aggression versus nondestruction aggression; Factor 
Ti—threat avoidance versus risk taking; Factor III— 
extroversion versus introversion; Factor IV—covert 
aggression versus order and structure of environ- 
ment. The authors felt that by having Ss respond to 
questions which reflected their tendencies relative to 
the above factors and by informing the judges of 
their responses, direct and relevant information con- 
cerning the predictive task would be conveyed. The 
questions for both Ss were the same. These questions 
and answers were intended to communicate to the 
judges a picture of the general behavior Patterns 
of the Ss; that is, they were designed to tap certain 
personality dimensions but were not meant to furnish 
information which was obviously related to the spe- 
cific tasks, If these forced choice acts do discriminate 
between “types” of people in terms of their response 

tendencies, then Ss responding to them should mani- 
fest some degree of consistency in their replies, 
Therefore, information furnished by S, if it reflects 
similar response tendencies in other areas of behav- 
ior, should enhance a judge’s accuracy when he is 
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predicting that S’s responses to these forced choice 
acts. Two weeks elapsed between the time S$ re. 
sponded to the tasks and the time he was asked 
to respond to the questions. This was done to 
minimize the effects of one upon the other. The 
verbatim responses of Ss were used without editing, 
As in Information Level II, the judges were pre- 
sented with these questions and answers prior to the 
experiment. The E read aloud both question and 
answer and had the judges follow along. They were 
then instructed to read them again to themselves, 


Experimental Design and Summary of 
Procedure 


There were three factors under investigation in the 
study, with 2, 2, and 3 levels each, respectively. The 
design was factorial, taking the form of a 2X2X3 
factorial analysis of variance with 10 Ss per cell, 

The same procedure was followed for all 12 experi- 
mental sessions. Judges entered a room in which 
10 chairs had been placed in front of a television 
monitor. One of the two video tapes (representing 
the two levels of the subject age variable) was 
already loaded on an Ampex video tape recorder. 
Pencils and an answer sheet were distributed to 
each S. Depending on under which condition of 
instruction the session was to be conducted, one of 
the two sets of instructions was stapled to the 
answer sheet. Also stapled to the answer sheet was 
one of the levels of information. (If the condition 
called for Level I of information, nothing was 
supplied.) The first page of the handout was the 
instruction page. The Æ read aloud the appropriate 
instructions and requested that the judges follow 
along. After E had assured himself that the instruc- 
tions were understood, the judges were asked to 
turn the page to the information section. After one 
reading aloud by E, the judges were asked to read 
it again to themselves. After all questions had been 
answered, the judges were cautioned to do their 
own work and to refrain from talking during the 
showing of the video tape. The tape was then 
started and the first item appeared. After the S on 
the tape had been presented with the two alterna- 
tives and the picture had faded to black, the tape 
was stopped by E and the judges were asked to 
indicate on their answer sheets which choice they 
felt S would make. There was no time limit set, but 
on the average all judges made their responses 
within 30 seconds. They could at any time refer to 
the instruction sheet or the information sheet. The 
order of presentation of items on the tape was 
randomly determined. 


RESULTS 


The measure of accuracy of prediction was 
the total number of correct predictions made 
by the judges in each of the 20 situations 
Presented. Table 1 reports these accuracy 
scores for each experimental condition. 


PREDICTIVE JUDGMENTS 


Table 2 provides the summary of the analy- 
sis of variance. The comparison between the 
two levels of subject age (Factor A) was 
significant. The F ratio resulting from this 
comparison was 51.19, which is significant at 
a.01 level. This difference was the result 
of greater accuracy of prediction when the 
S being observed was the adult S. 

The comparison between the two levels of 
instruction (Factor B) was not significant. 
The F ratio was 3,09; an F ratio of 3.93 is 
required to obtain significance at the .05 level. 
The direction of the obtained difference, how- 
ever, indicated greater accuracy of prediction 
under the condition of “instruction” as op- 
posed to the condition of “no instruction.” 

The effect of different types of information 
(Factor C) was significant at @.01, with an 
F value of 6.93. Because of the significant F 
ratio, the Newman-Keuls test was conducted 
to determine the magnitude of the differences 
between the treatments taken two at a time. 
Table 3 shows the results. With the level of 
significance set at .05, biographical informa- 
tion was found to be significantly effective in 
increasing accuracy of prediction when com- 
pared with the no-information condition. The 
relevant information condition was the most 
significant condition when compared with the 
no-information condition. A comparison be- 
tween biographical information and relevant 
information was found not to be significant. 
However, inspection reveals a greater number 
of correct predictions under the relevant 
information condition. 


TABLE 2 F 
SUMMARY TABLE FOR ANALYSIS OF VARIANCE 


Source df |! SS MS F 
Subject (A) 1 | 232.42 | 232.42 | 51.19* 
Instruction (B) 1 14.02 | 14,02 | 3.09% 
Information (C) 2 62.93 | 31.46 | 6.93* 
AXB 1 66 66 
AXC 2 7.10 3.55 
BXC 2 19 39 
AXBXC 2 | 738] 3.69 

Within (error) 108 | 490.30 | 4.54 
Total 119 | 815.60 


x393 required for significance at .05. 
*ps <01. ss 
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TABLE 3 


Marerx or DIFFERENCES BETWEEN TREATMENTS: 
Resurts or Newman-Krurs TEST 


Ordered treatment totals A | B | G 
Noinformation (A) = 384 — 45* 70* 
Biographical information (B) = 429 — 25 


Relevant information (C) = 454 


Note.—Differences necessary for significance at ‚05 between 
A and B, A and C, and B and C are 37,85, 45.40, and 37.85, 
respectively, 

*Significantata = .05, 


Discussion 
Subject Age 


It has been suggested by previous research 
in this area that perhaps the characteristics 
of the person being judged or about whom 
predictions are being made may influence the 
accuracy of the judgments or predictions 
(Baker & Block, 1957; Hamlin, 1954). As 
shown in Table 2, very significant differences 
were found between the two levels of subject 
age employed in the study. The judges were 
a great deal more accurate when predicting 
the behavior of the adult than when predict- 
ing the behavior of the child. One possible 
explanation for these results is that the 
judges were more accurate when predicting 
the behavior of the adult because they felt 
more similar to the adult. However, a control 
for similarity was employed in the study. 
It is felt that this difference can be more 
adequately explained as a function of subject 
characteristics which are concomitant with 
age. In an attempt to determine what these 
characteristics may be, each S’s responses to 
the 20 items were examined. Certain expect- 
ancies exist relative to S’s responses to these 
items. These expectancies are determined by 
the clustering of certain of the items on dif- 
ferent factors. As mentioned before, four dis- 
criminating factors were believed to be re- 
flected in these clusters. For example, on 
Factor I, five of the two-choice situations 
were found to cluster together. The cluster- 
ing was determined by the fact that people 
who chose Alternative A on one item would 
also tend to choose a particular alternative 
on some other item. Thus, when five items 
were found to cluster or “load” on Factor I, 
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TABLE 4 
SUBJECTS’ Responses To Irems LoApED on FACTOR I 
Responses 
Situation 
Child | Adult 
1. a, Breaking large light bulb. b b 
b. Breaking small light bulb. 
2. a. Breaking light bulb. b b 
b. Watering plant. 
3. a. Driving in spike. a b 
b. Pulling out spike. 
4. a. Striking sheet of glass. a b 
b. Refusing to strike glass. 
5. a, Examining with hand interior of 
box. b b 
b. Looking at concealed picture. 
3 5 


Note.—A person choosing the *ʻa” alternative of the first 
situation should also choose the “a” alternative of the other 
situations;similarly for '‘b,” 


this meant that there was a tendency for 
people responding to these items to respond 
in a “consistent” manner to them. For ex- 
ample, if one of the Ss chose to break the 
light bulb when faced with the choice of 
breaking a light bulb or watering a potted 
plant we would expect him also to choose to 
drive in a spike when faced with the choice 
of driving in a spike or pulling out a spike. 
The responses of the Ss used in this experi- 
ment were examined in order to determine 
whether their responses manifested this ele- 
ment of consistency, Table 4 shows the items 
that clustered together and each S’s responses 
to those items for Factor I. It is apparent 
that the adult S was more consistent in his 
responses to those five situations than the 
child. If we were to assign a value of 1 to 


TABLE 5 


ConsisTENcy OF SUBJECT RESPONSES 
FOR THE Four Factors 


Subject Nibe ot 

Factor | |————_—__-_} Number o 
situati 

reat akin ituations 
I 5 3 5 
Ir 3 2 4 
III 1 1 2 
IV 4 3 5 
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each consistent response, the adult’s score for 
Factor I would be 5 and the child’s score 
would be 3. Using this method of tabulation 
on all factors, the results are shown in 
Table 5. 

The investigators feel that these differences 
in consistency for the most part account for 
the difference between the two Ss in terms of 
how accurately their behavior was predicted, 
It may be that the younger S has not yet 
established enduring patterns of responding 
when dealing with environmental stimuli as 
has the adult. It may also be argued that the 
type of tasks used in the study do not tap 
the areas of the child’s behavior which are 
characterized by consistency. If, for example, 
the two-choice items had had a play versus 
work quality, the child might have responded 
consistently and his behavior would then 
have been more accurately predicted. This 
suggests that in the prediction of behavior, 
S’s age in combination with the type of 
behavior being predicted may play a vital 
role in the accuracy with which predictive 
judgments are made, These findings support 
those of Hamlin (1954), Baker and Block 
(1957), and Estes (1938). 

A word of caution is also in order at this 
point. Because practical considerations limited 
the number of video tapes which were made, 
we have only one S for each of the two age 
groups. Any discussion must take this fact 
into consideration. A number of replications 
using different Ss are required before one 
can say unequivocally that the subject age is 
that which determines consistency or that 
it is age which determines the accuracy of 
predictive judgments. 


TABLE 6 


Tora NuMBER or CORRECT PREDICTIONS 
For EACH SUBJECT UNDER Two 
LEVELS or ĪNSTRUCTION 


Level 
A Overall 
Subject F] MET 
instruction | 2»struction 
Child 267 283 16 
Adult 346 371 25 


{ 


PREDICTIVE JUDGMENTS 


Instructions 


The rationale for most training programs 
in clinical psychology, psychiatry, social work, 
and related disciplines is that somehow a 
person can be taught the skill which is re- 
quired to perform his professional function. 
Tn all cases, one of these required skills is the 
ability to make accurate judgments about 
people. It is assumed that people who receive 
the appropriate training are better able to 
make these judgments than those who receive 
no training. These considerations led to in- 
cluding instructional set as a variable to be 
assessed in the present experiment. 

As shown by the results, the differences 
between these two levels of instruction failed 
to reach significance. They were, however, in 
the direction of more accurate prediction 
under the “instruction” level, with an F value 
of 3.09, An inspection of the effect of instruc- 
tion for each S taken independently indicates 
that this manipulation may have been more 
effective when the adult S’s responses were 
being predicted. This is to be expected, since 
it was shown in the previous discussion that 
the adult was more consistent than the child. 
However, when this manipulation is consid- 
ered independently for the child, we find a 
trend on the part of the judges to be more 
accurate in their predictions when receiving 
instructions. 

Considering the facts that the overall dif- 
ferences approached significance and that 
where the S responded in a consistent manner 
there was greater accuracy, we are hesitant 
to conclude that instructional set had no ef- 
fect. It appears more likely that the judges 
did indeed respond to the instructions and 
were able to make more accurate predictions 
under this condition; however, the degree 
of inconsistency manifested by the child S 
tended to decrease the overall effects. Since 
the child was not completely inconsistent in 
his responses, there was some increase in 
accuracy in predicting his behavior when 
instructions were given. Obviously, if there 
had been a significant effect as a function of 
the particular S whose behavior was being 
Predicted under a particular level of instruc- 
tion, there would have been a significant inter- 
Action between these two variables. As can 
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be seen in Table 3, this interaction was not 
significant. 

These results suggest that perhaps the 
rationale behind training programs such as 
those mentioned above is valid and the ability 
to make accurate predictive judgments can 
be taught or communicated to the novice 
under the appropriate conditions, that is, 
correct instructions—instructions which are 
easily communicated and which are relevant 
to the behavior to be predicted. 


Information 


The third variable under investigation was 
information. The results of the analysis of 
variance show that the effect of the three 
levels of information was significant. The 
differences were in the direction of increased 
accuracy of prediction with different amounts 
and kinds of information. Table 3 shows the 
total number of correct predictions under 
each level of information and the test of 
ordered means conducted to determine where 
the differences occur. 

The statistically significant difference be- 
tween Levels I and II would suggest that 
accurate background information concerning 
S tends to enhance accuracy of prediction, 
whether or not it is related directly to the 
behavior to be predicted. The writers feel 
that the information furnished under Level IT 
was used by the judges to create a stereotype 
and their predictions were based on what 
they felt a person like that would do when 
asked to respond to the 20 tasks. These 
results tend to support those of Kostlan 
(1954) and Sines (1959). 

Level III of information (relevant informa- 
tion) appeared to be the most effective treat- 
ment. When compared with Level I, there 
was a statistically significant increase in ac- 
curacy under this condition. It also appears 
to be superior to Level II, but not at a sta- 
tistically significant level. Here, the judges 
apparently used S’s statements about his 
feelings and his behavior to form a hypothesis 
concerning how he would respond to the 20 
items. These hypotheses were more often 
correct than not, as evidenced by the results. 

The effect of information on accuracy of 
prediction suggests that information sufficient 
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to create a stereotype can be effective in 
enhancing predictive accuracy. Also, it ap- 
pears that information which is directly 
related to the behavior to be predicted is 
the most effective kind of information. 


Combined Effects 


When all three variables are considered, 
there appear to be additive effects. For ex- 
ample, the cell or treatment combination in 
which the smallest number of correct predic- 
tions was made was child S (a;), no instruction 
(bı), no information (cı). The treatment 
combination in which the greatest number of 
correct predictions was made was adult S 
(ag), instructions (be), relevant information 
(c3). This result indicates that while the vari- 
ables have an effect on accuracy when con- 
sidered alone, there appears to be an “ideal” 
combination of the three conditions, 

The conclusion can therefore be drawn that 
when behavior is being predicted, the accu- 
racy of the prediction will depend to a great 
extent on (a) the age of the person whose 
behavior is to be predicted—it must be con- 
sidered in light of the kind of behavior which 
is being predicted; (b) the presence of an 
accurate instructional set concerning how to 
predict; and (c) the amount of relevant 
information available to the predictor about 
the person whose behavior is to be predicted. 
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RORSCHACH PREDICTION OF SUCCESS IN CLINICAL TRAINING: 
A SECOND LOOK + 


RAE CARLSON ? 
California State College, Fullerton 


This study reports the development of a Rorschach Index based on an ego- 
psychological conceptualization of an “optimal personality picture” of the 
successful trainee in clinical psychology. Rorschach Index predictions for 155 
trainees of the Veterans Administration assessment project were compared 
with dichotomous predictions from the Miller Analogies Test (MAT) and the 
Strong Vocational Interest Blank (SVIB). The Rorschach Index predicted 
success and failure and identified failures for “personality reasons” more 
effectively than the MAT or the SVIB. Implications for use of an “ideal 
type” approach in prediction are discussed. 


A major and controversial finding emerging 
from the Clinical Psychology Assessment 
Project (Kelly & Fiske, 1951) was the failure 
of projective tests to predict differential 
success among clinical trainees. The present 
study reports previously unpublished data on 
the prediction of progress in clinical training 
with a Rorschach Index based on ego- 
psychological constructs. 

The scope, methods, and findings of the 
assessment project (fully described in Kelly 
& Fiske, 1951) are so well known as to 
tequire only a brief summary here. Graduate 
students admitted to the Veterans Adminis- 
tration (VA) training program in clinical 
Psychology in the late 1940’s were studied in 
a 5-year assessment program with the primary 
Purpose of “the evaluation of a variety of 
Procedures as predictors of later success in 
graduate training and professional functioning 
in clinical psychology [p. 193].” A wide vari- 
ety of objective and projective tests, subjec- 
tive and quantitative methods were used in 


1Preparation of this manuscript was aided by a 
United States Public Health Service Special Fellow- 
ship administered by Educational Testing Service, 
tinceton, New Jersey. Research reported in this 
Study was supported by the Veterans Administration 
Under Contract VAm-22556 with the University of 
ichigan. The writer is grateful to E. Lowell Kelly 
and to Ruth Bishop Heiser for their support and en- 
Couragement in the conduct of the study, and to 
illiam Ward and Roy Shore for their comments 
on an earlier draft of this paper. 

* Requests for reprints should be sent to Rae 
tlson, who is now at 4323 Rosedale Avenue, Be- 
thesda, Maryland 20014. 


making prediction ratings. While assessment 
staff members were encouraged to use their 
preferred personality theories and constructs 
in formulating evaluations of Ss, they were 
required to express their evaluations on a 
standard set of 8-point rating scales. Criterion 
measures, including objective tests and annual 
ratings by university and installation super- 
visors, were also expressed in terms of stan- 
dard 8-point scales. Results of the study indi- 
cated that only intellectual aspects of success 
in clinical training (as contrasted with clinical 
and social skills) could be predicted reliably; 
only the Miller Analogies Test and the Strong 
Vocational Interest Blank had any predictive 
value. Rorschach predictions were particularly 
fallible, showing no significant relationship 
to any criterion measures, 

The present investigation, undertaken as a 
subproject of the VA assessment project, 
made two major departures from the orienta- 
tion of the parent study: (a) in contrast to 
the eclectic, empirical approach of the larger 
study, predictions of success or failure were 
based on an explicit, theoretically derived 
formulation of an “ideal type”; (b) qualita- 
tive patterns, based on dichotomized vari- 
ables, were used for both prediction and out- 
come measures, in contrast to the quantitative, 
dimensional approach of the larger study. 

When the prediction situation is seen from 
the perspective of ego psychology, the im- 
mediately relevant assessment problem is that 
of identifying candidates whose career com- 
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mitment is based on mature ego-identity.* 
One looks for candidates who give evidence 
of (a) general maturity, differentiation, and 
ego integration, and (b) consonance between 
personal qualities and the demands of the 
career training situation so that the profes- 
sional task will demand and use potentiali- 
ties, and can become a basis of continuing 
ego-integration. An “optimal personality pic- 
ture” of the successful trainee (described 
below), based on a conceptual analysis of 
the requirements of clinical training, guided 
the development of the Rorschach Index. 
This basic rationale further assumes that 
(a) evidences of strengths and adaptive re- 
sources are more important than evidences of 
weakness, conflict, or defensiveness, and (b) 
given the presence of the criterial personality 
pattern, great diversity in other personality 
characteristics may be tolerated—while, con- 
versely, an S’s possession of many “positive,” 
valued characteristics is irrelevant if the basic 
“optimal personality picture” is lacking. 

The departure from a dimensional treat- 
ment of prediction and outcome measures was 
dictated by the view of selection-prediction 
questions as intrinsically dichotomous deci- 
sions about individuals, One asks: “Is this 
person a good bet?” “Will he make it or not?” 
The alternative kind of question asked in the 
VA assessment study (“How well does a po- 
sition on Scale A predict a position on 
Scale B?”), although supported by a more 
advanced measurement technology, is seen as 
fundamentally instrumental for making deci- 
sions of a quantitative kind.* 

According to this rationale, a personality 
measure specifically designed to assess a 
candidate's “fit” to the hypothetical optimal 
personality picture should be more clearly 
related to success in clinical training than 
measures lacking a conceptual basis in the 
criterial personality pattern. Thus, the hy- 
pothesis to be tested is that the Rorschach 
Index developed in this investigation (a) pre- 

*A succinct definition of Erikson’s 
of ego-ideoliy. 4 Ollero Wh Degen cae Ac 
(1966, p. 15): “The identity concept, in short, 
concentrates on the fusion of these elements (identi- 


fications, capacities, opportunities, and ideals 
viable self-definition.” aes 


“An eloquent statement of a similar rationale has 
been presented by Tyler (1959). 
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dicts success or failure in clinical training, and 
(b) discriminates successful and unsuccessful 
trainees more frequently than the “best 
predictors” of the VA assessment project— 
the Miller Analogies Test and the Strong 
Vocational Interest Blank. 


METHOD 
Development of the Rorschach Index 


The procedure used in the development of the 
Rorschach Index included the folowing steps: 
formulation of an “optimal personality picture” 
which should characterize the successful trainee, 
translation of this schema into the language of the 
Rorschach, and development of an objective Ror- 
schach Index on the basis of empirical study of a 
small number of cases. 

Three sets of assumptions guided the formulation 
of an optimal personality picture: (a) “Base-rate” 
assumptions—Since all of the trainees were college 
graduates who sought professional training in clinical 
psychology and had been screened and accepted by 
the Veterans Administration and by participating 
universities, it was assumed that the group would 
be adequate—and relatively homogeneous—with re- 
spect to intellectual ability and freedom from psycho- 
pathology, Therefore, many personality character- 
istics of clinical importance (e.g., adequacy of reality 
testing) could be considered peripheral to this pre- 
diction problem. (b) Ego integration—Qualities of 
psychological differentiation and integration con- 
sidered criterial for mature ego-identity status in- 
cluded evidence of inner resources and stability, 
mature expression of affect, and effective and flexible 
controls, with a minimum of constriction or rigidity. 
(c) Situational requirements—Success in clinical 
training demands a specific set of personality charac- 
teristics. The program is first of all an academic 
graduate program dealing with “psychological” con- 
tent at an abstract level; satisfaction and success 
in such a program should require intellectual ability 
used in goal-directed striving, and an introversive 
orientation. The development of effectiveness and 
responsibility in clinical skills further demands 
sensitivity and maturity in interpersonal relation- 
ships and an adequate and flexible system of controls. 

In translating this optimal personality picture into 
the language of the Rorschach,’ emphasis was placed 
on “structural” characteristics of the psychogram, in 
contrast to content, and on the organization of 
psychological processes, reflected in ratios, as opposed 
to absolute indicators. Traditional indexes of conflict, 
anxiety, and defensiveness were explicitly deempha- 
sized. The “ideal candidate,” then, should present 4 
Rorschach psychogram in which human movement 
Tesponses (representing inner resources, introversive 
orientation, the cognitive aspect of goal-directed 


® Cf. Klopfer, Ainsworth, Klopfer, and Holt (1954) 


for scoring and interpretation of Rorschach responses 
as used in this study. 
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striving) are high; form-color and form-texture re- 
sponses (representing mature expression of affect and 
dependency) predominate over relatively immature 
color-form and texture-form responses; and human 
movement and form-color responses (representing 
psychological differentiation and integrative controls) 
adequately balance form responses. 

Development of an explicit, objective Rorschach 
Index of this optimal personality picture proceeded 
as follows: Four case records of trainees (including 
the Rorschach protocol and other “raw” assessment 
data, but excluding any interpretations, prediction, 
or criterion ratings) were examined blindly® by 
the investigator, who predicted success or failure and 
attempted a provisional statement of explicit Ror- 
schach criteria on the basis of the pattern described 
above. Since these blind predictions proved accurate 
in discriminating the two PhDs and two dismissals 
in this group, two additional cases were submitted 
for blind predictions and refinement of Rorschach 
criteria. Again, these cases were successsfully pre- 
dicted, and a final statement of Rorschach criteria 
was developed to separate the successful and un- 
successful candidates in the total sample of six 
trainees. The resulting Rorschach Index consisted 
of five characteristics (based on Klopfer scoring? 
of main responses only). All five of the following 
characteristics must be present in a Rorschach record 
for a prediction of “success” in clinical training: 


M > sum C 

M+FCSCF+C 

FCSCF+C 

F% < 50% 

At least one FC and one Fc response 


Reliability of scoring of the Rorschach Index was 
established by submitting a random sample of 16 
Rorschach records from the total VA sample to an 
experienced clinician for scoring and prediction on 
the basis of the Rorschach Index. Agreement between 
predictions of the independent judge and the 
investigator was obtained in all cases. 


Subjects 


The Ss included 155 male graduate students en- 
tolled in the VA training program in clinical psy- 
chology who had participated in the 1946 and 1947 
assessment. programs. This sample included all 
trainees for whom Rorschach protocols and relevant 
criterion data were available. Ss, assessment pro- 
cedures, and criterion measures are described in 
Kelly & Fiske, 1951). 

— 


°E was aware that the first group of four cases 
included two successful and two unsuccessful 
trainees; predictions for the final set of two cases 
Were entirely blind. 

"Since the Rorschach Index was developed prior 
to the appearance of Klopfer’s revised system 
(Klopfer et al, 1954), it does not include such 
Current refinements in scoring or interpretation as 
orm-level rating or Klopfer’s formulation of ego 
Psychology and Rorschach hypotheses. 
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Procedure 


Rorschach protocols of all trainees were scored in 
terms of the Rorschach Index and predictions of 
success or failure made on this basis. In a few cases 
where original Rorschach records had been scored 
by Beck or Hertz systems, the investigator rescored 
records according to the Klopfer system in order to 
apply the Rorschach Index. 

Dichotomous predictions of success or failure were 
made from the Miller Analogies Test (MAT) and 
the Strong Vocational Interest Blank, VA Clinical 
Psychologist Key (SVIB), as follows: A cutting 
score of 69, empirically determined to give the best 
prediction, was established for the MAT. SVIB 
scores had been previously coded on a standard 
8-point scale; predictions of success were made for 
Ss receiving a coded score of five or above. 

Predictions of success or failure in clinical training 
were evaluated for two criterion groups: Group I 
(PhD vs. dismissal) included the 68 trainees who 
had either received the doctorate in clinical psychol- 
ogy or had been dismissed from the program by 
the spring of 1950. This group included trainees 
who had entered the program at various levels 
and who had participated in the 1946 or 1947 assess- 
ment programs. Group II (successful vs. unsuccess- 
ful) consisted of 105 first-year trainees from the 128 
Ss assessed in the summer of 1947. The Ss were 
eliminated if they had previous experience with the 
Rorschach (m=8) or relevant criterion data were 
incomplete (# = 15). The Ss were classified as suc- 
cessful if they had received the doctorate in clinical 
psychology or received a 1950 median university 
rating of five or above on the three component skills 
of diagnosis, therapy, and research. Unsuccessful Ss 
were defined as those who had resigned or had been 
dismissed from the program or had received a 1950 
median university rating of less than five on 
competence in diagnosis, therapy, and research. 

Predictions of success or failure were evaluated 
with chi-square or binomial tests. Since all hypothe- 
ses involved directional predictions, one-tailed tests 
were used. 


RESULTS AND DISCUSSION 


Table 1 summarizes Rorschach Index and 
MAT predictions for Group I. The data 
indicate that the Rorschach Index predicts 
success in clinical training, identifying early 
PhD’s as well as the MAT, and identifying 
early failures significantly better than the 
MAT (binomial probability < .02). 

Is the Rorschach Index sensitive to rele- 
vant personality characteristics “missed” by 
the MAT? One source of evidence on this 
point is available in the analysis of the 34 
dismissals. According to the basic rationale, 
the Rorschach Index should be relatively 
more effective in identifying candidates whose 


702 RAE CARLSON 
TABLE 1 
Correct PREDICTIONS or Success OR FAILURE For Group I 
PhD Dismissal Total 
predicted correctly predicted correctly predicted correctly 
Predictor N 
N % N % N % 
Rorschach 
Index 34 28 82 34 28 82 68 56 82 29,9** 

MAT 33 27 82 34 21 62 67 48 72 10.1% 

* 

#5 S001 


failures had been categorized by universities 
as due to “personality reasons” in contrast 
to “academic reasons.” Predictions from the 
Rorschach Index and the MAT for the two 
types of dismissals are summarized in Table 2. 
While the majority of dismissals of each 
type were correctly identified by both the 
Rorschach Index and the MAT, only Ror- 
schach Index predictions of “personality” 
failures attained statistical reliability. 
Results for the more homogeneous sample 
of first-year trainees in Group II are sum- 
marized in Table 3, Since MAT and SVIB 
scores were not available for all Ss, compari- 
sons are based on percentages of correct 
predictions of success or failure, Table 3 indi- 
cates that the Rorschach Index gave a signifi- 
cantly higher proportion of correct predic- 
tions of failure than the SVIB (binomial 
probability .005) or the MAT (p < .005), 
while the SVIB and the MAT were more 
accurate in predicting success (binomial prób- 
abilities of p < .08 and p < .06, respectively). 
One of the most striking aspects of the 
results is the sharp reduction in the accuracy 
of prediction from either the Rorschach Index 


TABLE 2 


Prepicrions or FAILURE ror SUBJECTS 
Later Dissrssep 


è Includes pen ** and ‘combi 
sate ‘personality’ and ‘combination of personality and 


or the MAT as one moves from Group I to 
Group II (binomial probabilities are .007 and 
.02, respectively), despite the fact that the 
two criterion groups contain some of the 
same individuals, While we have argued for 
the clear-cut PhD-dismissal criterion (which 
in this context means early PhD vs. early 
dismissal) as the more relevant measure of 
success or failure, it is also true that Group I 
represents greater heterogeneity in terms of 
age, experience, and ability. Therefore, the 
issues of heterogeneity-of-sample and concep- 
tual-relevance-of-criteria need to be examined 
separately. Fortunately, it is possible to do 
so by comparing results of the present investi- 
gation with previously reported findings from 
the VA assessment project on Rorschach In- 
dex predictions against three sets of criterion 
data for the same homogeneous sample of 
trainees represented in Group II of this study. 
Relevant comparisons are summarized in 
Table 4. 

Table 4 indicates clearly that the Ror- 
schach Index predictions for the same Ss are 
more accurate as the stringency and differen- 
tiation of criterion ‘measures increase. ‘“Pro- 
fessional Promise,” the criterion most success- 
fully predicted by the Rorschach Index, is 
based on a higher cutting score and uses the 
rater’s judgment of relevant weighting of per- 
formance on component skills for the indi- 
vidual case. “Successful versus Unsuccessful” 
(Group II of the present study) is based 
on a cutting score of five—the midpoint of 
the scale—and employs an arbitrary equal 
weighting of three component skills. “Pass- 

Fail,” as a criterion, assumes that trainees 
who have left the program (for whatever 
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TABLE 3 
Correct PREDICTIONS OF Success or FAILURE ror Group II 
Successful Unsuccessful Total 
predicted correctly predicted correctly predicted correctly 
Predictor N xe 
N % N % N % 
Rorschach | 
Index | 40 23 58 65 46 71 105 69 66 8.24** 
MAT 40 27 68 57 31 54 97 58 59 4.54* 
SVIB 40 24 60 63 34 54 103 58 56 2.46 
M Sor 


reason) are “failures” and that candidates 
still in training after 3 years (regardless 
of their level of competence) are likely to 
receive the doctorate, and are, therefore, 
“successful,” 

Clearly, the Rorschach Index is less gen- 
erous in predicting success than the MAT or 
the SVIB; predictive errors tend to be in the 
direction of “false negatives.” This is not sur- 
prising, of course, since the Rorschach Index 
was designed to identify only the most prom- 
ising candidates. However, base rates for 
prediction of success vary with the maturity 
and heterogeneity of the sample. Where suc- 
cess was predicted for 50% of the trainees in 
Group I, the base rate is significantly reduced 
to 40% among the younger, more homogene- 
ous trainees of Group II (binomial proba- 
bility .05), Predictive accuracy similarly 
decreases (from 82% to 66%; binomial 
probability .007) when the maturity-hetero- 
geneity of the sample and stringency of 
criteria are decreased simultaneously. 

These considerations suggest that the Ror- 
schach Index demands a degree of maturity, 
differentiation, and experience-in-living which 
may not be readily found in beginning gradu- 
ate students, (It should be noted that even 
the first-year trainees of Group II included 
many veterans with some years of wartime 
experience in addition to undergraduate train- 
ing.) While the apparent sensitivity of the 
Rorschach Index to something like “maturity 
of character development” may be counted as 
evidence of its construct validity, this con- 
sideration could limit its usefulness in the 
selection of beginning graduate students. 

Obviously the major strength of the 


present Rorschach Index—its derivation from 
a custom-made conceptualization of the de- 
mands of clinical training—becomes a barrier 
to its application in other prediction situa- 
tions. One could not expect this measure to 
serve as a general assessment technique, nor 
to identify promising candidates in other pro- 
fessional fields with different patterns of per- 
sonality requirements. Moreover, since the 
rationale of the present approach requires the 
conceptualization of intrinsic situational de- 
mands prior to development of predictive 
instruments, it is quite possible that this 
Rorschach Index may no longer predict 
success in current clinical training programs. 
The striking shifts in the Zeitgeist and tech- 
nology of clinical psychology over the past 
20 years may well have altered the pattern 
of demands and satisfactions to be found in 
clinical training and the characteristics of 
those who seek such training. 

However, the general approach developed 
in this investigation may hold promise for a 
variety of assessment and prediction prob- 
lems. The basic approach—that of formu- 
lating an “optimal personality picture” as a 


TABLE 4 


RORSCHACH INDEX PREDICTIONS ror Group II 
on THREE CRITERION MEASURES 


Basis for prediction of m 


Criterion success: PhD or 


Professional promises | Rated overall clinical 394+ 
competence > 6 
Successful- Median rating on three .28* 
Unsuccessful component skills > 5 
Pass-Fail% Still in training in third year |.13 


s For details, see Kelly and Fiske (1951, pp. 191-192). 
*p <.01, 
** p <.001. 
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basis for prediction—is independent of the 
ego-psychological constructs or the Rorschach 
test used here. The present findings, however, 
offer encouragement for the use of ego- 
psychological constructs—even at this rela- 
tively superficial level—as a source of rele- 
vant questions and for the Rorschach as an 
instrument for answering these questions. 
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VERBAL AND OVERT-BEHAVIORAL ASSESSMENT 
OF A SPECIFIC FEAR* 


ANTHONY F. FAZIO? 


University of Wisconsin, Milwaukee 


Using college students (NW = 70), self-ratings of fear on a modified Fear Survey 
Schedule and overt approach-avoidance responses to a large insect were 
investigated. Professed fears of insects were negatively related (p < .001) to 
approach responses toward them. Sex of the Æ and whether or not E handled 
the insect were not associated with significant differences in approach behavior. 
Behavior which might be described as at least mildly phobic occurred in 27.1% 
of the Ss. Implications of the diagnostic utility of the questionnaire and the 


behavioral test were discussed. 


Directly or indirectly the principal criterion 
used to evaluate the efficacy of psychotherapy 
has been the verbal report (e.g., the patient 
claims he is or is not feeling better). The 
utility of behavior modification techniques 
can be assessed, however, directly from objec- 
tive changes in overt behavior (Ullmann & 
Krasner, 1965, p. 28). Phobic behaviors are 
readily interpreted in behavioral terms as 
avoidance responses made in the presence of 
identifiable stimuli. Typically, in controlled 
investigations of behavior modification, pa- 
tients or Ss are interviewed and confronted 
with an anxiety producing situation in order 
to assess (e.g., Lazarus, 1961) or verify (e.g., 
Hogan & Kirchner, 1967) their professed 
fears, Nevertheless, Ss accepted for inter- 
viewing are often preselected on the basis 
of self-evaluative measures. Even though 
questionnaires and self-rating scales produce 
quantitative data quickly and easily, their 
relationships to actual overt behavior require 
Investigating public speaking 
fears, Paul (1966, p. 129) reported a signifi- 
cant correlation (r=.41, p<.001) between a 
self-report measure of confidence and behav- 
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havioral measure, they did not report the degree of 
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ioral checklist ratings by judges. Lang (1968), 
however, has commented on the generally 
nonsignificant correlations between fear-survey 
questionnaires and overt avoidance behaviors 
(eg., for fear of snakes, r= .26, p < .10). 

The first objective of the present study was 
to evaluate the relationship between professed 
or denied fears of a specific stimulus object 
(cockroaches) and overt approach-avoidance 
responses toward that object. Questionnaire 
items dealing with stimuli similar to and 
different from the phobic object were also 
examined. 

Of the many situational characteristics 
which may effect S’s behavior, two were ex- 
amined in the present study: sex of the Z and 
whether or not Æ handled the insect. Bandura, 
Grusec, and Menlove (1967) have reported 
avoidance responses to be significantly modi- 
fied by the modeling behavior of a peer. If 
behavioral tests are to be used as diagnostic 
tools, the extent to which therapeutic treat- 
ments are confounded with testing-situation 
influences needs to be assessed. 

The third objective was to obtain norma- 
tive data on avoidance responses of college 
students toward a relatively large insect. 


METHOD 


Fifty-six female and 14 male introductory psy- 
chology students voluntarily participated in the 
“interview, and survey.” The insect, a harmless 
variety of cockroach, about 2ł inches long and 
1 inch wide (Gromphadorhina portentosa), was kept 
in a 14 X 84 X 10 inch empty glass aquarium with a 
wire screen cover. The container was placed on a 
table behind a partition. 

The Fear Survey Schedule (FSS-III; Wolpe & 
Lang, 1964) was used as a vehicle for obtaining 
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TABLE 1 
Propuct-MoMENT CORRELATIONS BETWEEN 


QUESTIONNAIRE ITEM OR ITEM COMPOSITES 
AND RATED Overt Test BEHAVIOR 


Items r 

1. Cockroaches —.37* 
2, Large ugly insects —.47%* 
3. Harmless spiders — .46%* 
4. Worms —.16 
5. Flying insects —.49** 
6. Crawling insects —.49%* 
Composite insect score —.41** 

7. Speaking in public +.19 

8. Entering a room where other people are 

already seated —.16 

9. People in authority +.19 
10, Being watched working +.19 
11, Looking foolish +.22 
Composite social situation score +.16 


verbal responses about the particular stimulus object. 
The schedule contains 72 items (eg, cats, being 
alone, blood) to be rated from “Not At All” to 
“Very Much” in 5 steps, on the degree to which 
each stimulus situation provoked fear or was experi- 
enced as unpleasant. The higher the score, the 
greater the degree of fear inferred, Three additional 
items were added to the survey: Cockroaches, Large 
ugly insects, and Harmless spiders. 

For the overt-behavioral test, a 7-step rating scale 
was devised which allowed E to identify S’s re- 
sponses relatively objectively: 


1. Refused to move to a position from which the 
insect could be seen. 

2. Refused to stand close enough to the table 
to be able to touch the container, 

3. Refused to remove the screen cover, 

4. Tip of fingers failed to pass upper rim of 
container, 


5. Fingers did not touch surface or appendages of 
insect, 


6. Failed to grasp insect in firm hold } 
for insect to be picked up, rae eh st 


7. Insect lifted so that all parts were not touching 
any surface of container, 


Procedure 


The male and the female Æ assigned Ss in order 
of appearance to experimental conditions in which 
the Æ handled (H) or did not handle (H) the insect. 


The E sat adjacent to S at a secretarial desk from 
which the insect could neither be seen nor heard. 


For both H and H Ss, Æ said: 


This survey concerns fears and/or unpleasant 
feelings experienced by college students, caused by 
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things, places, animals, etc. Extensive research of 
such fears is presently being conducted at a 
variety of universities. Such fears are neither 
uncommon among college students nor shameful, 
To begin with, we would like you to fill out this 
standardized questionnaire. When you have com- 
pleted it, we will continue. 


They then completed the modified FSS-III. Not 
scoring their responses, E continued: 


In an attempt to assess the extent to which 
students might be afraid of handling certain ani- 
mals, we have selected one situation listed in the 
survey you have just completed. We are offering 
all students the opportunity to handle a com- 
pletely harmless, but large ugly insect. You are 
completely free to refuse to do so, if you wish, 
You will in no way be forced to do anything 
against your wishes. In a glass container on a 
table to the right of this partition is a relatively 
large insect, which occasionally makes a hissing 
sound, but which nevertheless is completely harm- 
less. 


In an effort to control the degree to which Ss 
would use Æ’s behavior as a cue, the instructions 
explicitly informed the H Ss that E was going to 
handle the insect, and informed the H Ss that Æ was 
not allowed to handle it. Except for these state- 
ments the instructions were as similar as possible. 


For H Ss, E said: 


First, we would like you simply to look at it; 
second, touch it, after I have touched it. Finally, 
pick it up off the base of the container, after I 
have picked it up, You are, of course, free to 
refuse any of these requests, 


For H Ss, E said: 


First, we would like you simply to look at it; 
second, touch it. Finally pick it up off the base 
of the container. You are, of course, free to refuse 
any of these requests, 


For H Ss, E said: 


T will touch the insect, then you may touch the 
insect, It really cannot hurt you. I will pick the 


pect up, then you may do so. It is quite harm- 
less, 


For H Ss, E said: 


Because my behavior may influence yours, I am 
not allowed to touch the insect. It really cannot 
hurt you. You may pick up the insect now. It is 
quite harmless, 


The E recorded all of S’s comments and scored 
S's approach-avoidance behavior. The test was ended 
when S picked up the insect, or S refused to con- 
tinue, or after 5 minutes had elapsed. 
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RESULTS 


Questionnaire scores for separate items and 
specific thematic combinations of them were 
correlated with rated overt test behavior for 
all 70 Ss (Table 1). The inserted items and 
scores for two additional items dealing with 
buglike objects were found to be significantly 
related to test behavior. These items were 
selected a priori on the basis of their simi- 
larity to the test object. In each case, the 
more S claimed to fear the insect, the less he 
or she approached and/or handled it. Verbal 
measures of fear were found to possess dis- 
criminant validity (Campbell & Fiske, 1959) 
by comparing the correlation for the com- 
posite of the insect items with a correlation 
for a composite of noninsect items. Five items 
dealing with a theme of feeling conspicuous 
before a group of observing persons were se- 
lected prior to examining the data. The theo- 
tetically irrelevant items dealing with social 
situations and, thus, their composite were not 
significantly correlated with the specific be- 
havioral test involving a large roach (Table 
1), 

The predictive utility of selecting Ss for 
interview on the basis of different degrees of 
professed fear was also evaluated. If only 
those indicating “Very Much” fear of, for 
example, “Crawling insects,” were selected, 
10% of the total sample (WV = 70) would 
have been interviewed. However, this subsam- 
ple contained only 15.8% of those in the total 
sample who behaved like phobics—a poor 
yield. If Ss indicating at least “A fair amount” 
of fear had been selected, 50% of the sample 
Would have been interviewed. However, this 
Procedure would have netted 89.5% of the 
Ss rated phobic-like on the basis of the be- 
havioral test. 

Except for one male S reporting “Much” 
fear of “Harmless spiders,” and one report- 
ing “A fair amount” for “Large ugly insects,” 
none of the other male Ss indicated more 
than “A little” fear of any of the insect items, 
even though two male Ss refused to place 
their hands in the container. These findings 
are consistent with the report of Manosevitz 
and Lanyon (1965) concerning females ex- 
Pressing significantly greater anxiety in gen- 
eral than males. 
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TABLE 2 
DISTRIBUTION OF OVERT-BEHAVIORAL RATINGS 


Scale steps 


1 2 3 4 5 | 6 | 7 


Not |Refused| Fingers| Not 


Would to re- Not 
suk] eac| ove [norpan] goh, | iito | Ett 
at insect| tainer | Sereen | “rim | ingect | imsect 


cover 


N=1 | N=1 | N=5 | N=12| N=1 | N=20| N=30 


Note.—Total N = 70, 


The data on situational effects were ana- 
lyzed only for female Ss because there were 
few male Ss (N = 14) and because their 
verbal and overt-behavioral scores reflected 
significantly less anxiety (p < .01) and avoid- 
ance ( < .05) than the scores of female Ss. 
A 2 X 2 analysis of variance revealed no sig- 
nificant differences in mean rated test behav- 
ior associated with sex of E (F = 1.51, df = 
1/52, p > .10), handling of the insect (F = 
.70, df = 1/52, p > .20), or interaction be- 
tween them (F = 1.56, df = 1/52, p > .10). 
For all 56 female Ss, the mean and standard 
deviation of the overt-behavioral ratings were 
5.54 and 1.55, respectively, ` 

In spite of the gruesome appearance of the 
insect, 71.4% of the Ss touched it when asked 
to, 42.8% picked it up, and only 2.8% re- 
fused to even stand near the container. From 
the distribution of behavioral scores (Table 
2) it might be the case that the behavior of 
placing one’s hand in the container divides- 
Ss into groups possibly described as phobic 
versus nonphobic. However, this dichotomy 
may also have been an artifact of the diff- 
culty in scoring this particular behavior (i.e. 
E had to judge if S’s hand came close to but 
did not touch the insect). 


Discussion 


Although verbal reports to relevant items 
correlated significantly with overt behaviors, 
at best only 25% of the behavioral variance 
was accounted for by this measure. Wicker 
(1970), in a review of research on attitude- 
behavior consistency, reports that attitude-be- 
havior correlation coefficients rarely exceed’ 
.30, and often are near 0,0. He suggests that. 
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if the stimuli in the verbal situation were more 
similar to those in the overt response situation, 
these correlations could be larger. For ex- 
ample, Ss could be asked to describe what 
they would do if they were asked to handle 
the insect, being tested on an independent 
occasion. 

A second approach toward improving the 
magnitude of these correlations is suggested 
by the work of Endler and his colleagues 
(Endler & Hunt, 1966; Endler, Hunt, & 
Rosenstein, 1962). In general, they have ob- 
tained support for the claim that self-report 
inventories containing a variety of stimulus 
situations and response alternatives are as- 
sociated with more valid personality descrip- 
tions. More specific self-descriptions, in turn, 
might allow for attitude-behavior correlations 
of larger magnitudes. 

One aspect of the present investigation 
concerned possible reactive error effects as- 
sociated with Æ (cf. Webb, Campbell, 
Schwartz, & Sechrest, 1966, pp. 12-34). At 
the beginning of the interview, Ss were told 
they were participating in a survey concern- 
ing their fears, and were asked to describe 
(i.e., rate) them, After completing the modi- 
fied FSS-III, they were informed that a test 
was arranged to verify at least one of their 
claims, This procedure can be interpreted to 
have produced substantial reactive effects, 
similar to response sets, in the direction of 
behavioral consistency. Thus, while the cor- 
relations obtained here were statistically sig- 
nificant, their relative magnitudes may de- 
crease when other procedures are used, 

The specific situational variations manipu- 
lated had no significant effect on S’s behav- 
ior, while individual differences in professed 
fear of insects accounted for a greater pro- 
portion of the behavioral differences. These 
findings suggest that the behavioral test might 
be more sensitive to differences in individuals 
and previous treatments than it is to ex- 
traneous characteristics of the situation, thus 
qualifying as a diagnostic device. However, 
modeling effects of Æ handling the insect can’ 
not be ruled out when repeated tests are 
employed. In a review of research on imita- 
tion, Flanders (1968) has classified modeling 

practices such as handling the phobic object 
as instances of vicarious reinforcement, and 
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Bandura et al. (1967) have presented datą 
suggesting that repeated exposures to a posi- 
tive model significantly affect avoidance re- 
sponses of children toward dogs. College stu- 
dents were used in the present study and were 
exposed only once to the modeling influences 
of E. Geer and Turteltaub (1967), using fe- 
male undergraduates and fear of snakes, ex- 
amined the modeling effects of a confederate 
who behaved “quite frightened” in one ex- 
perimental condition, and displayed “very 
little fear” in another condition. High-fear Ss 
observing the low-fear confederate were re- 
ported to have moved closer to the snake on a 
postobservation test than high-fear Ss ob- 
serving the high-fear confederate. All low-fear 
Ss, defined by touching the snake on the first 
behavior test, touched it on the second. Al- 
though behavioral tests are generally intended 
to assess rather than modify behavior, investi- 
gations using nontreated but repeatedly tested 
control groups will require checks on the 
effects of modeling. 

The present sample did not contain a suf- 
ficient number of male Ss to definitively test 
possible interaction effects between sex of Æ 
and sex of S. One important characteristic of 
research involving phobic-like behaviors in 
nonclinical populations is the tendency for 
male Ss to perform, or at least attempt, al- 
most every task requested. Efforts to increase 
the range of behavioral measures (i.e., remove 
the ceiling effect) have involved items of 
behavior which require S$ to handle the object 
for longer time periods and/or in some par- 
ticularly vulnerable manner (e.g., next to one’s 
face) (cf. Bandura, 1968), 

Of the Ss tested, 27.1% behaved in a man- 
ner which might be described as mildly phobic 
(ie., they refused to place their hand in the 
container). Lang and Lazovik (1963) report 
that about 3% of a college sample are to some 
extent snake phobic. Whereas clinical de- 
scriptions (e.g., phobic) do not allow data to 
be compared objectively, operational defini- 
tions involving overt behaviors are important 
precisely because they allow such compari- 
sons to be made both within and between in- 
vestigations. Thus the data in Table 2 may 
be compared with data on, for example, feat 
ve T (e.g, Hogan & Kirchner, 1967, P- 
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RELATIONS BETWEEN PERSONALITY AND SMOKING 


BEHAVIOR IN PREADULT SUBJECTS* 


GENE M. SMITH? 


Department of Anesthesia, Massachusetts General Hospital 
Harvard Medical School 


The present study of 562 high school and junior high school students yielded 
results which cross-validate and extend the generality of earlier work with 
1,462 adults. In both studies, smoking status was assigned on the basis of self- 
report information, and personality scores were derived from peer ratings, 
With the preadults, as with the adults, smokers scored significantly (p < .001) 
lower on measures of “Agreeableness,” and “Strength of Character,” and scored 
significantly higher on measures of “Extraversion.” In addition, the smokers 
(in both studies) scored significantly higher than the nonsmokers on the 
variables “crude,” “happy-go-lucky,” and “frank.” Among preadults, multiple 


discriminant analyses permitted smoking status to be assigned with accuracy 
ranging from 65% to 79%. Although most information concerning the psycho- 
dynamics of smoking has been obtained from studies of adults, the present 
study supports the use of such information in developing antismoking educa- 


tional campaigns aimed at preadults. 


Tn an earlier study of 1,462 college and 
nursing students, Smith (1967a) found highly 
significant differences between smokers and” 
nonsmokers regarding personality character- 
istics, measured both with self-report ques- 
tionnaires and with the peer rating method. 
Multiple discriminant analyses performed al- 
lowing five measures of personality, derived 
from peer ratings, to enter the prediction 
equation permitted smokers to be discrimi- 
nated from nonsmokers with 68% accuracy, 
and permitted heavy smokers (20 or more 
cigarettes a day) to be discriminated from 
nonsmokers with 76% accuracy, This is dis- 
tinctly greater accuracy than that obtained in 
any earlier study. In terms of sample sepa- 
ration, an accuracy level of 68% is obtained 
when the means for the two samples are ap- 
proximately 1.0 standard deviations apart. 
Similarly, 76% accuracy is obtained when the 


1 This work was supported in part by Grant TH-3 
from the American Cancer Society, in part by Pub- 
lic Health Service Grant GM-15904, and in part by 
a grant from the Council for Tobacco Research- 
US.A, William G, Cochran, Professor of Statistics, 
Harvard University, and Frederick Mosteller, Pro- 
fae of oe Statistics, Harvard Univer- 

Y, served as consultants and = 
fa moia their advice is grate- 

2 Requests for reprints should be sent to Gene M. 
Smith, t of Anesthesia, Harvard Medi- 
cal School, ae rnietts General Hospital, Boston, 


two samples are approximately 1.5 standard) 


deviations apart. The relationship between 
sample separation and accuracy of classifica- 
tion is discussed in detail elsewhere (Smith, 
1967a). 

The present investigation tests the gen- 
erality of the earlier adult results by replicat- 
ing the study in a sample of 562 preadults, 
The literature contains 31 primary reports of 
empirical studies of the relations between 
personality and smoking behavior,? but only 
two of these studies used preadult Ss (Salber 
& Rochman, 1964; Stewart & Livson, 1966). 
Thus, the present investigation of 562 high 
school and junior high school students adds to 
this small literature. 

The questions of whether, and how, smokers 
differ from nonsmokers regarding personality 
is of psychological interest for two reasons. 
First, the association between smoking and 
disease, although now widely accepted, is not 
uniformly interpreted. For example, Berkson 
(1958) and Fisher (1959), both statisticians, 
Suggest that such genotypic factors as consti- 
tution and temperament might predispose 
some persons to become smokers and also 
predispose them to develop lung cancer. 
Whether or not that position has merit, the 
differential morbidity rates of smokers and 


®The 31 references may be obtained by writing 
the author. 
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nonsmokers will probably become more un- 
derstandable as constitutional and psycho- 
social differences between these groups are 
further specified. Second, little is currently 
known about the motivational factors under- 
lying decisions to become a smoker, or not; 
to continue, or quit; to reduce, or not. This 
is of considerable practical importance be- 
cause the effectiveness of antismoking educa- 
tional campaigns will probably improve mono- 
tonically with the accumulation of dependable 
and comprehensive information concerning 
the psychodynamics of smoking. 


MertHop 
Measurement of Smoking Behavior 


Students who reported that they currently smoke 
seven or more cigarettes per week were classified as 
smokers; those reporting less frequent current smok- 
ing (or none) were classified as nonsmokers. With 
this definition (Definition 1), 27% of the high 
school students and 16% of the junior high school 
students were classified as smokers. Because fre- 
quency of smoking among the junior high school 
students was low by this definition, the nonsmokers 
were further divided into those who reported that 
they had smoked 20 or more cigarettes in their 
lifetime and those who reported that they had not. 
Definition 2, which included as smokers those who 
reported having smoked 20 or more cigarettes in 
their lifetime, resulted in 38% of the junior high 
school students being classified as smokers. For 
junior high school students, smoker-nonsmoker com- 
Parisons were made in terms of both definitions. 
Better discrimination was obtained with Definition 
2 than with Definition 1. With Definition 2, 42 of 
the 84 biserial correlation coefficients (between per- 
sonality and smoking status) were significant at or 
beyond the .05 level. With Definition 1, 34 of the 84 
coefficients were significant. The junior high school 
data reported in Tables 1 and 2 are based on Defi- 
nition 2, 


Measurement of Personality 


Each student was asked to examine each of 42 
bipolar personality traits (Cattell, 1957) and iden- 
tify, for each trait, the five members of his class who 
are most like the left pole and the five who are 
most like its opposite on the right, (Class size 
ranged from 20 to 35). Selections on the left were 
considered positive nominations and those on the 
tight were considered negative. The positive and 
negative nominations a ratee received were scored 
+1 and —1, respectively. Failure to be nominated 
Was scored zero. Thus, in a peer group of 25, a 
tatee’s score on any given trait could range from 
+24 to —24, Further details concerning the method 
of deriving personality scores from peer ratings, the 
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safeguards employed to reduce bias, and the theo- 
retical rationale underlying the use of peer ratings 
are given by Smith (1967b). 


RESULTS 


A factor analysis (principal components ro- 
tated to an oblique solution using the bi- 
quartimin criterion) of the entire sample of 
peer rating data (Grades 8-12) revealed four 
factors which agree well with results obtained 
in earlier studies of nursing students and col- 
lege students (Smith, 1967b). The relations 
between smoking and personality are shown 
in Table 1 for each of the 42 peer rating 
variables separately; but to facilitate sum- 
marization, those variables are grouped by 
factor. The variables loading positively on 
each factor are listed separately from those 
loading negatively. 

The values shown in Table 1 are biserial 
correlation coefficients. Smokers were coded 
+1 and nonsmokers were coded 0; hence, 
plus correlation coefficients indicate that 
smokers have more of the trait than nonsmok- 
ers and negative coefficients indicate the op- 
posite, 

The results in Table 1 are mutually sup- 
portive within factors and highly consistent 
across the four samples. Each of the six varia- 
bles loading positively on Factor I (Agree- 
ableness) correlates negatively with smoking 
status in each of the four samples. These 
correlations indicate that smokers score lower 
on “Agreeableness” than do nonsmokers. Each 
of the five variables loading negatively on 
Factor I correlates positively with smoking 
status in each sample. These correlations in- 
dicate that smokers score higher than non- 
smokers on “Disagreeableness.” Stated dif- 
ferently, 44 correlations in the table relate 
smoking status to “Agreeableness” and all 44 
indicate that smokers score lower on “Agree- 
ableness” than do nonsmokers. Of those 44 
coefficients, 38 are significant at or beyond 
the .05 level. 

The results for Factor II (Extraversion) 
are also internally consistent over variables 
and over samples. Of the 23 (out of a possible 
40) correlations which reach the .05 level of 
significance, 22 indicate that smokers score 
higher on “Extraversion” than do nonsmok- 
ers. 
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TABLE 1 


BISERIAL CORRELATIONS BETWEEN PERSONALITY TRAITS AND SMOKING STATUS 
(SMOKER vs. NonsMOKER) IN Four SAMPLES OF STUDENTS 


T: Agreeableness 

Positive variables 
Adaptable 
Tender 
Self-effacing 
‘Trusting 
Good-natured 
Considerate 

Negative variables 
Jealous 
Assertive 
Attention-seeking 


Demanding 
Hides mistakes 

II: Extrayersion 
Positive variables 


‘Thoughtful, pensive 
TIT: Strength of Character 
Positive variables 
Conscientious 
Self-reliant 


Nozni signs (+) indicate 


higher scores for 


sp = 


High school students Junior high students 


Females Males 


Females Males 
(N =198) (N =188) (N =96) (N =80) 
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For Factor III (Strength of Character), 
33 of the 52 correlations were significant and 
all 33 indicate that smokers score lower on 
“Strength of Character” than do nonsmokers. 

The results for Factor IV (Emotionality) 
are suggestive but inconclusive. Only four of 
the 16 correlations are significant. Three of 
those indicate higher emotionality among 
smokers than among nonsmokers, but one in- 
dicates the opposite. 

Table 1 also shows (see bottom) results 
for four complex variables (happy-go-lucky, 
frank, imaginative, and fastidious) which 
load heavily on more than one of the four 
factors. Where significant, the correlations 
involving these complex variables indicate 
that relative to nonsmokers, smokers score 
higher on “happy-go-lucky,” “frank,” and 
“imaginative.” 

The main difference between the factor 
analytic results of the present study and the 
earlier results based on adults is that the 
earlier fifth factor, “Refinement,” did not 
appear in the present analysis. One of its two 
component variables, “fastidious,” was found 
here to be a complex variable which loaded 
on three of the four factors identified; the 
other, “crude,” was found to load on the 
factor called “Strength of Character.” In 
the present study, as in the earlier one, smok- 
ers scored significantly higher than nonsmok- 
ers on the variable “crude” (see Table 2). 

Multiple discriminant analyses, using the 
step-up procedure,* were performed to deter- 
mine the degree to which smokers could be 
distinguished from nonsmokers on the basis 
of personality scores. To avoid capitalizing on 
What might be called “idiosyncratic, sample- 
bound relations” between personality and 
smoking status, a prediction equation was 
developed empirically, using half of the high 
school sample (NV = 193), and this equation 


‘The first variable to enter the prediction equa- 
tion was the one (out of 42) which had the highest 
Correlation with the criterion, The next to enter was 
the one having the highest correlation with the cri- 
terion after the effect of the first predictor variable 
Was partialed out. At the next and at each succeed- 
ing step the entering variable was the one having 
the highest correlation with the criterion when the 
influence of the multivariate prediction battery ob- 
tained at the preceding step of the analysis was 
Partialed out (Rao, 1952). 
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TABLE 2 


MEDIAN BISERIAL CORRELATIONS BETWEEN 
PERSONALITY TRAITS AND SMOKING STATUS 
(SMOKER vs NoNSMOKER) IN FIVE 
ADULT AND Four PREADULT 
SAMPLES 


Adults 
(N = 1029) 


Preadults 


Factor (N= 562) 


I: Agreeableness 
Positive variables 
Adaptable 
Tender 
Self-effacing 
Trusting 
-natured 
Considerate 
Negative variables 
ealous 
Assertive 
Attention-seeking 
Demanding 
Hides mistakes 
Il; Extraversion 
Positive variables 
Gregarious 


Ga: 
Talkative 
Conventional 


10%" 


Happy 

Interest in opposite sex 
Negative variables 

Reserved 

Quiet 


Shy 
Thoughtful, pensive 
IIL: Strength of Character 
Positive variables 
Conscientious 
Self-reliant 
Responsible 
Orderly 
Socially mature 
Resourceful 
Inquisitive 
Obedient 
Negative variables 
uittin, 
Langui 
Obstructive 
Crude 
Prone to daydream 
IV: Emotionality 
Positive variables 


Bmorionsl ee ah 
ense x s 
Easily upset =,1 100 =, 1500 
Negative variable 
'olerant of stress — 09%" =.09" 
comnie mariabiet 100% iare 
lappy-go-luc! he g 
Frank. 4 A T 18e anne 
Imaginative Ta 09%" 
Fastidious 06" 02 


Note.—Plus signs (+) indicate higher scores for smokers 
than for nonsmokers; minus signs (—) indicate the opposite, In 
the study of 1462 adults, 433 contributed self-report personality 
data but no peer rating data. 

*p = 05. 
Gd 
boast 


was then tested in two cross-validation sam- 
ples—the second half of the high school sam- 
ple (N = 193) and the sample of 176 stu- 
dents from junior high school. With five of 
the 42 peer variables in the equation, accu- 
racy of prediction of smoking status was 77% 
in the first high school sample, 65% in the 
second, and 67% in the junior high school 
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sample. When the two cross-validation sam- 
ples were combined, accuracy of classification 
of smoking status was 66%. In these analyses 
males and females were combined to keep the 
sample sizes relatively large. In earlier analy- 
ses, where males and females were analyzed 
separately, the accuracy of discrimination for 
high school females was 71% and that for 
high school males was 69%. The correspond- 
ing figures for females and males in the junior 
high school samples were 79% and 68%, 
respectively. (The analyses separating the 
sexes were not cross-validated.) In both 
univariate and multivariate results, the as- 
sociation between personality and smoking 
status was greater for junior high school fe- 
males than for junior high school males.° 
The data of the present study of preadults 
were collected with procedures similar to 
those used earlier in a study of adults. Table 
2 reports the results of the two studies. Two 
biserial correlation coefficients are given for 
each of the 42 personality variables. The first 
member of each pair is the median coefficient 
for five samples of adults studied earlier 
(Smith, 1967a). The second is the median of 
the four coefficients obtained in the present 
study of preadults. The results of the two 
studies are strikingly similar. Of the 42 pairs 
of correlations, 39 have the same signs and 
34 of these are statistically significant in both 
studies. Furthermore, the magnitude of the 
correlation between smoking status and score 
on a particular personality variable in one 
study is highly predictive of the magnitude of 
the correlation for that variable in the other 
study. 


COMMENTS AND CONCLUSIONS 


Study of the relations between personality 
and smoking status in 562 high school and 


5 Multivariate discrimination for junior high school 
males was definitely poorer than that for junior high 
school females—but less so than one might have 
expected from examining the univariate results. In 
this regard it should be noted that the smoker/ 
nonsmoker split for junior high school males was 
38/42, whereas that for females was 28/68. In addi- 
tion, the intercorrelation among the 42 peer variables 
was higher for females than for males. Both of these 
differences could favor multivariate discrimination 
for the males, Because of these and other complexi- 
ties, multivariate results do sometimes depart from 
expectations based on univariate results, 
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junior high school students yielded results 
which are strikingly similar to those found 
earlier with adults. In both studies the smok- 
ers scored higher than the nonsmokers on 
various measures of “Extraversion,” and 
scored lower on various measures of “Agree- 
ableness,” and “Strength of Character.” In 
addition, smokers (in both studies) scored 
higher than nonsmokers on the variables 
“crude,” “happy-go-lucky,” and “frank.” 
Thus the present results, obtained with pre- 
adults, cross-validate and extend the general- 
ity of the earlier results, obtained with adults. 

The results of both studies were (a) highly 
significant, statistically, (b) internally con- 
sistent within factors, and (c) consistent 
across samples. The uniformly high level of 
statistical significance shown in Table 2 
(Point a just mentioned) is due in part to the 
large sample sizes. Although a correlation of 
-11 accounts for only 1% of the criterion 
variance, it is, nevertheless, significant at the 
001 level with a sample of 1,029. The con- 
sistency within factors (Point b) is not sur- 
prising, since variables comprising each factor 
are highly intercorrelated; however, the ex- 
traordinary consistency across samples (Point 
c) was surprising; and because of this, we 
checked the analytic procedures more exten- 
sively than usual, but found no evidence of 
error. 

Multiple discriminant analyses permitted 
smokers and nonsmokers to be discriminated 
with accuracy ranging from 68% for junior 
high school males, to 79% for junior high 
school females. Cross-validation analyses 
yielded 66% accuracy of classification of 
smoking status. In the present study of pre- 
adults, as in the earlier study of adults, use 
of peer rating measures of personality, in 
association with multivariate analyses, pro- 
vided distinctly greater discriminative power 
than that reported in any previous study of 
the relationship between personality and 
smoking status. 

Tf one believes that the effectiveness of 
antismoking educational campaigns will in- 
crease as more is learned concerning the 
Psychodynamics of smoking, then it becomes 
important to determine whether information 
now available from studies of adults can be 
used to guide campaigns directed at preadults 


~~ ee 
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PERSONALITY AND SMOKING BEHAVIOR 


—a population which has been used only 
rarely in studies of the psychodynamics of 
smoking. While the present investigations 
have dealt with only one aspect of this com- 
plex matter, the extraordinary similarity be- 
tween the results for adults and preadults does 
provide some justification for using informa- 
tion obtained in studies of adults to guide 
development of antismoking educational cam- 
paigns directed at younger people. 
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EFFECTS OF CUES AND EXAMINER INFLUENCE ON 
TWO WECHSLER SUBTESTS * 


JEROME M. SATTLER? 
San Diego State College 


Two experiments studied the effects of cues on the Wechsler Intelligence 
Scale for Children and the Wechsler-Bellevue Scale. In the first experiment 
170 adolescent Ss received the Block Design subtest, and results showed that 
one cue per item did not significantly affect performance on either the initial 
or repeated administration. In the second experiment 146 adolescent Ss received 
the Block Design and Picture Arrangement subtests, and results showed that a 
series of cues per item significantly raised scores on both the initial and re- 
peated administrations of both subtests. Test-form order significantly affected 
scores, while subtest order and Es did not. 


The administration, scoring, and interpreta- 
tion of intelligence tests are affected by many 
variables such as test content, procedures for 
administration and scoring, Æ competence, 
and the interpersonal relationship between the 
E and the S (Sattler & Theye, 1967). The 
present investigation studied the effect of a 
number of these variables on test performance. 
The primary objective was to determine how 
systematic alterations in test procedures affect 
performance on two Wechsler subtests, Picture 
Arrangement (PA) and Block Design (BD). 
These two subtests were selected because they 
especially lend themselves to the development 
of a series of specific, graduated cues which 
can be systematically administered. The issue 
of cue effects arises in part from recent sug- 
gestions made by Allison, Blatt, and Zimet 
(1968) that systematic cues be provided on 
the BD as part of the standard administrative 
procedure, Additional objectives were to eval- 
uate the relationship between the Wechsler 
Intelligence Scale for Children (WISC) and 
Wechsler-Bellevue (W-B) Scale Form I for 
the PA and BD subtests, to examine E dif- 
ferences, and to study the effects of test form 
and subtest order on performance. 


The following hypotheses were advanced: 


1 This study was supported by Research Projects 
OE 5-10-429 and OEG-4-7-078057-0402 from the 
United States Department of Health, Education, and 
Welfare, Office of Education. The writer is grateful 
to Edward A. Jacobson and to Leonard L. Tozier for 
a helpful suggestions. 

equests for reprints should be sent to ‘erome 
M. Sattler, who is now at Miami i De- 
partment of Psychology, Oxford, Ohio 45056. 


(a) Violation of standard procedures by ad- 
ministering cues during the first administration 
of the subtest(s) has no effect on test per- 
formance during either the first administra- 
tion or a subsequent administration of a com- 
parable form of the subtest(s) given im- 
mediately after the first subtest(s). (b) There 
is no relationship between cues and test per- 
formance. (c) The scores obtained by different 
Es do not differ significantly. Two experiments 
are reported which evaluated the hypotheses. 
In the first experiment only the BD subtest 
was used, while in the second experiment the 
BD and PA subtests were used. 


EXPERIMENT I 
Method 


Subjects. The Ss were eighth- and ninth-grade 
students attending a junior high school in Southern 
California. The experiment was designed to evaluate 
the performance of students with average intellectual 
ability who had no severe emotional or learning 
difficulties. To satisfy these two criteria, Ss were 
selected if they had School and College Ability 
Test (SCAT) scores within .60 of a standard devia- 
tion from the mean of their class, if they were not 
attending classes for the educationally or culturally 
handicapped, or if they did not have a learning 
disability. Also eliminated from the original sample 
were students who were foreign born or who had 
been previously administered the WISC or W-B. 
Fifty-four Ss initially selected had a perfect per- 
formance, and they were subsequently eliminated 
from the sample because the effects of cues could not 
be evaluated for them. The sample for the statistical 
analyses consisted of 71 male and 99 female ado- 
lescents. Their mean years of age was 14.29 (SD 
= .65 year). 

Examiners. Three graduate students, two male 
and one female, attending San Diego State College 


716 


CUES AND EXAMINER INFLUENCE 


served as the Es. The three Es were psychology 
majors who had completed graduate work in 
intelligence testing and who were working toward 
their master’s degree, When the experiment was 
initiated only one E served in the study, but it 
soon became evident that additional Zs would be 
needed. After the first Æ had tested approximately 
one-quarter of the Ss, the other two Es were em- 
ployed. Since the first Æ participated during the 
entire study, he tested several more Ss than the other 
two Es, 

Recording forms with the names of the Ss were 
arranged in a random order, and the Es obtained 
the names of Ss by taking approximately five or six 
record forms for each day’s testing from the top 
of a pile containing all of the record forms. While 
this procedure was not designed to ensure random 

| assignment of the Ss to the Es, it eliminated any 
selection of Ss by the Es, Each E tested both ex- 
perimental and control Ss. E1 tested 20 control and 

68 experimental Ss, E2 tested 6 control and 33 ex- 

perimental Ss, and E3 tested 7 control and 36 ex- 

perimental Ss. 3 

Procedure. Five groups were used to evaluate the 
effects of BD cues: four experimental groups each 
received a different cue, and a fifth served as the 

= control group. Each cue consisted of constructing a 
specific part of the design in front of S. The four 
cues administered to the respective groups were as 
follows: showing § how to arrange the (a) first 
tow, (b) last row, (c) first column, and (d) last 
column, Cues were administered immediately follow- 
^ ing failure of the design. 

The Ss were assigned separately by sex to one 
of the five groups, Because the W-B does not take 
age into consideration when raw scores are con- 
verted to standard scores, the sample was divided 
into 3-month age intervals corresponding to the 
3-month intervals on the WISC conversion tables. 
Using a table of random permutations of nine, each 
Swithin a 3-month age interval group was randomly 
assigned to one of the five groups. The order of pre- 
sentation of WISC and W-B was counter-balanced 
by assigning odd-numbered Ss to the W-B-WISC 
order and even-numbered Ss to the WISC-W-B 
order, 

Each S received two administrations of the subtest 
with one subtest immediately following the other. 
During the first administration, the experimental 
groups received the cues, while no cues were ad- 
ministered to the control group. During the second 
administration, the alternate form of the test was 
administered, and no cues were presented to any 
of the groups. 

Wechsler’s (1944, 1949) discontinuance criteria 
Were used during both the first and second ad- 
ministrations. An item was considered failed when 
S could not complete it correctly on the first ad- 
ministration. Passing the item after the cue was 
administered was still considered a failure for dis- 
continuance purposes. The time limits stated in the 
manual were used after the cue was administered. 

The Ss were tested individually in rooms provided 
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by the school. The Æ introduced himself to S by 

saying: 
Hello, I am Mr. [Miss] . I am going to 
give you a short test in order to compare your 
present results with previous test scores. Your 
score today, however, will not be recorded on 
your school records and will not affect your 
grades in any way. We would like you to do the 
best you can. Do you have any questions? 


After the testing was completed, Æ said: 


Thank you for cooperating. I’d like to remind 
you that your score does not affect your grades, 
and it isn’t reported on your school records. We 
are interested only in finding out more information 
about our tests. We are testing many students at 
the school, and you were selected by a method 
something like having your name pulled out of a 
hat. 


The following instructions were read to an S 
when a cue was administered: 


I’m going to put together some of the blocks. I 
will make the bottom row [or top row, left 
column, right column, depending on the group to 
which S was assigned]. Now you go ahead and 
finish it. Look at the picture and make one just 
like it. Tell me when you are finished. 


Results è 


Cues. The effectiveness of the four BD cues 
was evaluated by a three-factor analysis of 
variance design with repeated measures on one 
factor. The two independent factors were 
group and sex of S, and administration was 
the repeated factor. A summary of the analysis 
of variance appears in Table 1. The only 
significant factor was the administration: 
Ss achieved significantly higher scores on the 
second administration (X = 10.81) than on 
the first administration (Y = 9.26). Thus, the 
four cues did not differ among themselves or 
from the control group in affecting Ss’ per- 
formance on either the initial or second ad- 
ministration, and sex differences were not 
present. 

Examiner variable. The E variable was 
evaluated in two ways, and both used analysis 
of variance designs. Nonsignificant differences 
were found in the scores the Zs obtained and 
in the number of cues they administered (X 
number of cues = 1.85). The number of cues 
administered in the two test-form orders and 

3 All analyses of variance in both Experiments I 


and II used the unweighted means analysis procedure 
because of unequal numbers, 
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TABLE 1 


ANALYSIS OF VARIANCE OF BD SCORES FOR 
Groups, SEX, AND ADMINISTRATIONS 
IN Experiment I 


Source df MS F 
Between Ss 
Groups (A) 4 11.00 1.04 
Sex (B) 1 29.04 2.75 
AXB 4 8.90 84 
Error (between) 160 10.56 1,00 
Within Ss 
Administrations (C) 1 199.90 63.55* 
AXC 4 1,89 60 
BXC 1 1.54 49 
AXBXC 4 2.99 95 
Error (within) 160 3.14 1,00 
*p <,001, 


in the four experimental groups were also 
found by analysis of variance procedures to 
be not significantly different. 

Order, age, ability, and test-form variables. 
Analyses of variance were performed to deter- 
mine whether the order of administering the 
test forms affected scores and whether the 
ages and ability level of the Ss differed in the 
five groups. The results in each case were not 
significant for these variables. The randomiza- 
tion procedures were thus effective in produc- 
ing similar ages and ability levels in the 
groups. A Scheffé test, used to evaluate 
whether the WISC and W-B total means 
differed, was not significant (t= .70, df= 
160, p > .05); therefore, the two test forms 
produced similar scores. 


EXPERIMENT IT 


Experiment IT was designed to measure 
the effects of a series of cues rather than only 
one cue, and to evaluate two subtests, BD 
and PA. Because the first column cue was 
found to be the most effective in Experiment I 
(although not significantly), it was used asa 


basis for establishing the BD cues in Experi- 
ment II. 


Method 


and ninth grades, 
Experiment I for sel 
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minimal emotional and learning difficulties were also 
used in Experiment II. None of the Ss had par. 
ticipated in Experiment I. There were 49 Ss, initially | 
selected, who had perfect performance on either the 
BD or PA, and they were subsequently eliminated 
from the sample because for these Ss, the effects 
of cues could not be evaluated. The sample for the 
statistical analyses consisted of 71 male and 15 
female adolescents. Their mean years of age was 
13.25 (SD = .67 year). 

Examiners. Four graduate students, three male 
and one female, attending San Diego State College, 
served as the Es. Three of the four Zs participated 
in Experiment I. The fourth ÆE’s experience was 
similar to that of the other three. The first three 
Es began testing Ss at the same time, and about 
halfway through Experiment II, the fourth E was 
employed, The Es obtained the names of their Ss in 
exactly the same manner as described in Experiment 
I. El tested 17 control and 23 experimental Ss, E2 
tested 12 control and 12 experimental Ss, E3 tested 
18 control and 23 experimental Ss, and E4 tested 22 
contro] and 19 experimental Ss. 

Procedure. The Ss received two administrations 
of both the BD and PA subtests. During the first 
administration, the experimental group received cues, 
while the control group was administered the two 
subtests in the standard manner, During the second 
administration, which immediately followed the 
first, the alternate form of the Wechsler test was 
administered, and cues were not administered to 
either of the groups. Four counterbalanced orders 
were used to administer the substests: (a) BD-WISC, 
PA-WISC, BD-W-B, PA-W-B; (b) PA-W-B, BD- 
W-B, PA-WISC, BD-WISC; (c) BD-W-B, PA- 
W-B, BD-WISC, PA-WISC; (d) PA-WISC, BD- 
WISC, PA-W-B, BD-W-B. 

When the experimental Ss failed to complete items 
correctly, they received a series of specific, graduated 
cues designed to facilitate successful completion of 
the items immediately after failing an item. Three 
cues were designed for each subtest. The BD cues 
were as follows: Cue 1—The S received 50% ad- 
ditional time beyond the completion time stated in 
the manual. Cue 2—The E arranged the first column 
of blocks. Cue 3—The E arranged the first column 
Plus an additional number of blocks. The number 
of blocks arranged in Cue 3 was as follows: three of 
the four blocks for a 4-block design (first column 
plus first block of second column), first two columns 
for a 9-block design, and first three columns for 4 
16-block design. 

The PA cues were as follows: Cue 1—The S re 
ceived 50% additional time beyond the completion 
time stated in the manual. Cue 2—The E arranged 4 
specified number of cards. When the series containe 
three or four cards, E arranged the first card. 
When the series contained five cards, E arrange 
the first and second cards, and when the series con- 
tained six cards, E arranged the first, second, an 
third cards. Cue 3—The E arranged a greater number 
of cards than in Cue 2, with the exception of the 
series containing three cards. For a three-card 
series, the number of cards arranged by E was the 


_ same as in Cue 2, because if two cards were arranged 
in a three-card series, only one card would remain 
for S to arrange. When the series contained four 
cards, E arranged the first and second cards. When 
_ the series contained five cards, E arranged the first, 
second, and third cards. When the series contained 
_ six cards, E arranged the first, second, third, and 
fourth cards, A specific order for the remaining 
cards was used based on the order stated in the 
= manual unless the order would have automatically 
resulted in a correct solution. In these cases a dif- 
ferent order was used for the remaining cards. 

If S successfully completed the item after the first 
cue, E presented the next item. If S was unable to 
complete the item after the first cue, the second 

_ cue was administered. If S was still unable to com- 
plete the item correctly, the third cue was ad- 
ministered. As in Experiment I, Wechsler’s (1944, 
1949) discontinuance criteria and time limits (ex- 
_ cept for the first cue) were used during both the first 
and second administrations. Testing conditions were 
similar to those described in Experiment I. 


Results 

Cues. The subtest scores were evaluated by. 
“a four-factor analysis of variance design using 
group and sex as the two independent factors, 
‘and administration and subtest as the two 
" tepeated factors. The results of the analysis 
_ of variance, presented in Table 2, show that 
‘three significant Fs were found which in- 
dicated the following: (a) the experimental 
group (X=10.10) achieved significantly 
higher overall scores than the control group 
(X = 9.30), thus the cues improved perform- 
ance; (b) second administration scores (X = 
10,43) were higher than first administration 
Scores ( = 9,00); and (c) BD scores (X = 
10.04) were higher than PA scores (X= 
9.40). Because none of the interactions was 
Significant, the results also indicate that the 
experimental group achieved significantly 
higher scores than the control group on both 
the first and second administrations, and that 
both the experimental and control groups ob- 
ined significantly higher scores on the second 
administration of both subtests. 

_ Examiner variable. The E variable was 
evaluated in three ways, and all used analysis 
variance designs. Nonsignificant differences 
Were found among Zs in the scores they ob- 
fained on both subtests, and in the number 
cues (X = 3.11) they administered in the 
tious orders of administration. However, 
When the number of items on which cues were 
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TABLE 2 


ANALysIs OF VARIANCE OF BD Anp PA Scores 
FOR Groups, SEX, SUBTESTS, AND ADMIN- 
ISTRATIONS IN EXPERIMENT IT 


Source af MS F 
Between Ss 

Groups (A) 1 92.03 6.40* 
Sex (B) 1 17.31 1.20 
AXB 1 22.25 1.54 

Error (between) 142 14,37 

Within Ss 

Subtests (C) 1 53.83 7.86** 
AXC 1 11.48 1.67 
BXC 1 24,98 3.64 
AXBXC 1 2.32 34 
CX Ss 142 6.84 
Administrations 

(D) 1 284.26 62.01*** 
AXD 1 3.44 75 
BXD 1 04 00 
AXBXD 1 13 02 
D X Ss 142 4.58 
CxD 1 7.83 1,92 
AXCxD if 03 .00 
BXCXD 1 2.34 Eyi 
AXBXCXD 1 1.96 48 
CXDX Ss 142 4.08 

* 

#5 St 
we b <.001, 


administered was used as the dependent vari- 
able, three significant Fs appeared: the E 
factor (F = 2.98, df = 3/52, p < .05), order 
factor (F = 3.82, df = 3/52, p < .05; WISC 
BD first X = 2.06, W-B PA first Ẹ = 1.70, 
W-B BD first X = 2.39, WISC PA first X 
= 1.85), and the Æ x Order interaction (F = 
2.14, df = 9/52, p < .05). The four E means 
within each order were tested by use of the 
Newman-Keuls procedure. The results in- 
dicated that there were no significant differ- 
ences among the four Es’ means when the 
WISC BD or W-B PA was administered first. 
However, when the W-B BD was administered 
first, H4’s mean (X = 3.50) was significantly 
higher than the means of each of the three 
other Zs. The means for E1 (X = 2.08), E2 
(X= 2.00), and E3 (X = 2.00) were not 
significantly different from one another. When 
the WISC PA was administered first, the 
mean of El (X=1.00) was significantly 
lower than the means of the other three Es 
(X= 2.00, X=2.16, and X=2.22, re- 
spectively). The results indicated that one of 
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the four Zs gave cues on more items than the 
other three Zs in one order, and another Æ 
gave cues’ on fewer items than the other three 
Es in another order, 

Order Variable. The effect of order of ad- 
ministration was evaluated by a three-factor 
analysis of variance design. The group by 
order of administration was the independent 
factor (eight levels), and administration and 
subtest were the repeated factors. The group 
by order factor was not significant (F = 1.30, 
dj = 7/138, p > .05), while the administra- 
tion factor (F = 8.22, df = 1/138, p < .01), 
subtest factor (F = 80.91, df = 1/138, p< 
001), and the Group by Order x Subtest in- 
teraction (F = 6.50, df = 7/138, p < .001) 
were significant, 

The significant interaction indicates that 
significant differences between the BD and 
PA subtests are dependent on the administra- 
tion order, Individual mean comparisons, 
using the total means for each subtest (first 
plus second administrations), made between 
the BD and PA subtests in each of the eight 
group by orders of administration revealed 
that significantly higher BD than PA scores 
were obtained in both the control and experi- 
mental groups on the two orders having the 
W-B first, and that the experimental group 
obtained significantly higher BD than PA 
scores on one order which had the WISC ad- 
ministered first. The individual mean compari- 
son results suggest that in the control group 
there is a greater disparity between the BD 
and PA subtest scores when the W-B is ad- 
ministered during the initial administration 
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Age and ability variables. Analyses of va 
ance were performed to determine whether 
ages and ability level of the Ss differed in 4 
groups. The results in each case were not 
nificant. 


Discussion 


Performance on either the initial or sub. 
sequent administration of the BD was ne 
affected in Experiment I when Ss were give 
single cues consisting of constructing pa 
the design (first or last columns or first 
last rows) on a mean number of 1.85 de: 
However, when more extensive BD cues wen 
given in Experiment II by providing cues 6 
a mean number of 2.00 designs and providin 
a mean number of 3.11 cues (i.e., giving mor 
time, constructing first column, and construch 
ing first and part or all of the second and, 
third columns), Ss scored higher on the fi 
and second administrations of the BD th 
those Ss not receiving cues on the first 
ministration. In Experiment II, PA perform 
ance was also significantly improved on b 
the first and second administrations when | 
series of cues was administered. These Ti 
sults have implications for the examinati¢ 
procedure. ’ 

The results indicate that alterations 
standard procedures, when minor, are 
likely to affect BD performance; ho 
when extensive, BD performance is aff 
The E should therefore not introduce ex 
sive testing-the-limit procedures during 
examination; rather, he should wait until 
examination has been completed before $ 
procedures are attempted. Allison et aly 
(1968) suggestion that graded help be intro 
duced during the administration of the BY 
subtest is clearly untenable because the pre 
ent results show that help procedures can 
nificantly raise performance. The suggestio 
made by Wechsler (1958) that stories be 
tained on the Wechsler Adult Intellig 
Scale PA items immediately after each 
is completed must also be questioned. 
Procedure introduces an additional variable 
which may affect S’s performance. By tellin 
a story after he has completed his arran 
ment, S has time to evaluate further his a 
rangement. He may see that his arrangeme 


CUES AND EXAMINER INFLUENCE 


was incorrect, and thereby possibly obtain 
insight which may be useful to him in solving 
the next arrangements. The results of the 
present study do not provide an answer to 
whether test scores are altered when stories 
are asked. However, this latter modification 
of test procedure should be investigated for 
its possible effects on test performance, 

Because the data were collected on two 
subtests from average ability students with 
minimal learning and emotional difficulties, it 
is difficult to generalize the findings to other 
subtests or groups. Other subtests and groups 
therefore should be studied in future research 
concerned with the effects of cues on test per- 
formance. 

Other results point to the following: First, 
the Æ variable does not necessarily affect the 
reliability of the intelligence test score, and 
the sex of the Æ is not necessarily a significant 
variable either, Second, WISC and W-B BD 
and PA subtests produced mean scores which 
were not significantly different. Third, the sub- 
test order of administration and the test- 
form order of administration are very com- 
plex variables which at times may affect S’s 
or E’s performance. Departures from the 
standard subtest order of administration must 
therefore be carefully evaluated for their pos- 
sible effects on performance. Much remains 
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to be learned about how and why different 
administrative orders affect test performance. 

Approximately 25% of those tested were 
eliminated from the analyses because they 
passed all of the items on either the BD or 
PA subtests, Thus, these subtests did not pro- 
vide adequate ceilings with a population of 
average ability adolescent students. It is rec- 
ommended that future studies using the WISC 
or W-B which focus on the effects of modify- 
ing standard procedures use either younger 
age groups or develop additional items to ex- 
pand the range of difficulty of the subtests. 
The ceilings of other tests or subtests may 
also be limited, and it is recommended that 
tests used in similar investigations be evalu- 
ated for their range of difficulty. 
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PERSONALITY CORRELATES OF ACCURATE EMPATHY 
IN A COLLEGE COMPANION PROGRAM * 


GEORGE J. VESPRANI? 
University of Cincinnati 


A cross-validation of research by Bergin and Solomon on personality test cor- 
relates of empathy was attempted using 33 untrained female college students 
as quasi-therapists. Their interactions with hospital patients were recorded, 
sampled, and rated using the Truax Accurate Empathy Scale, Accurate empathy 
(AE) ratings were correlated with student personality measures from the 
Edwards Personal Preference Schedule and the Minnesota Multiphasic Per- 
sonality Inventory (MMPI). The results partially support data reported by 
Bergin and Solomon in that Depression and Psychasthenia scales of the MMPI 
were found to be negatively and significantly correlated with AE ratings of 
students, In addition, this study provides data about changes in levels of AE 
over time and methodological pitfalls in sampling this variable. 


In recent research on those variables that 
presumably affect the course of psychotherapy, 
empathy has been shown to be an important 
and relevant one. Truax (1963), as well as 
other researchers, has shown that when high 
levels of empathy are communicated to pa- 
tients, the likelihood of patient improvement 
is increased. In fact, Truax (1963) and 
Bergin (1963) have shown that a low level 
of empathy on the part of the therapist may 
actually lead to patient deterioration. It seems 
imperative, therefore, that all therapeutic en- 
deavors, in an attempt to maximize patient 
progress, provide a high level of this therapist 
variable, Certainly, this has relevance for all 
of the established and accepted forms of 
therapeutic encounters from usual forms of 
psychotherapy to pastoral counseling. 

For some time, the mental health profes- 
sions have been suffering a critical shortage 
in manpower. The professions of psychiatry 
and clinical psychology are producing practi- 
tioners at an ever-increasing rate, yet the de- 
mand continues to outstrip the supply. The 
mental health manpower needs of the country 
have been well documented by Albee (1959) 
and Schofield (1964). 
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This high demand for therapeutic services 
and the short supply of practitioners has re- 
sulted in the use of as yet untapped resources 
to help fill the gap between supply and de- 
mand. This in turn has led to the establish- 
ment of subprofessional groups to carry out | 
and implement some of the mental health 
practices which the short supply of profes- | 
sionals presently does not permit. Such pro- | 
grams have crystallized around the develop- 
ment of the mental health technician, for ex- 
ample, the work of Holzberg (1963, 1965) 
and others in the training and use of college 
students in the mental hospital and that of 
Rioch, Elkes, Flint, Usdansky, Newman, and 
Silber (1963) in the training of mature but in- 
experienced housewives to perform as thera- 
pists. Truax and Carkhuff (1967, Ch. 7) have 
recently reported on a program for the train- 
ing of lay hospital personnel as therapists. 

This approach raises some serious questions 
about the type of people who seek these op- 
portunities and, in addition, some concern — 
about the levels of empathy that these sub- 
professionals and novices can provide. In gen- 
eral, one may ask, “What personality charac- 
teristics are necessary for a successful helping 
relationship?” This question has, in part, been 
investigated in the research of Bergin and 
Solomon (1963) on some of the test correlates 
of empathy. They attempted to compare the 
level of empathy of 18 postinternship students 
in clinical and counseling psychology (using 
the Truax Accurate Empathy Scale) with 
Minnesota Multiphasic Personality Inventory 
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(MMPI) and Edwards Personal Preference 
Schedule (EPPS) profiles. The results they 
obtained, though based on a relatively small 
sample, are provocative and partially support 
other findings with regard to the positive re- 
lationship of empathy and adjustment. On the 
MMPI, for example, they found the Depres- 
sion (D) and Psychasthenia (Pt) scales to be 
negatively and significantly correlated with 
accurate empathy (AE). Other negative, 
though not significant, correlations were re- 
ported for those scales indicative of personal- 
ity disturbance. The EPPS data indicated 
that Consistency (Con), Order (Ord), and 
Intraception (Int) scales are all negatively 
and significantly related to empathy, while 
Dominance (Dom) and Change (Chg) scales 
correlated positively and significantly with 
AE. 

Perhaps the most significant finding of this 
study was the reported negative relationship 
between measures of subjective discomfort 
(Pt and D scales of the MMPI) and AE. 
This suggests that relevant standards for the 
selection of clinical psychologists or therapists 
in general should include some measure of 
these significant personal attributes. 

This paper reports an attempt to cross- 
validate and extend the previous work of 
Bergin and Solomon (1963) using a group of 
untrained college students in a companion- 
like program instead of graduate students in 
clinical and counseling psychology. Such a 
program, based on the work of Holzberg 
(1963, 1965), uses untrained college students 
who are to act in a quasi-therapeutic relation- 
ship with mental patients. In addition, this 
study attempted both to evaluate the initial 
level of AE and to explore longitudinally the 
development of AE over a course of contacts 
between patients and college companions. 

On the basis of the existing research data 
of Bergin and Solomon, it was hypothesized 
that companions with high scores on the D 
and Pt scales of the MMPI would manifest 
a low level of AE in their relationship with 
these patients. A second hypothesis was that 
those companions with high scores on the Dom 
and Chg scales of the EPPS would manifest 
high levels of AE. The final hypothesis was 
that a longitudinal evaluation would show an 
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increase in the level of AE as a result of 
lessened anxiety on the part of the companions 
and their greater familiarity with the task. 


METHOD 


To accomplish these goals, students in abnormal 
psychology classes at the University of Cincinnati 
were administered the EPPS and the D and Pt 
scales of the MMPI. Thirty-three female volunteers 
were then solicited from these classes to participate 
as companions to female patients in a local mental 
hospital. These companions ranged in age from 20-25 
years with a median age of 21 years. The previous 
work of Bergin and Solomon (1963) suggested a 
negligible relationship between grade average, intel- 
ligence (as reflected by the verbal score on the Grad- 
uate Record Examination), and AE. For this reason, 
these data were not gathered or used as selection 
criteria. The decision to use only female Ss was 
spurred by the fact that previous studies in this area 
have used mixed populations and have not con- 
trolled for this variable. As companions, these stu- 
dents acted in a quasi-therapeutic relationship with 
a patient arbitrarily assigned to them for a series of 
five 1-hour sessions over a 3-week span of time. Pa- 
tients were selected solely on the basis of their 
ability to communicate at least at a minimal level 
as judged by the ward psychiatrist. These patients 
ranged in age from 17-68 years, with the median 
age of 36 years. Fourteen of these patients had a 
schizophrenic diagnosis, usually undifferentiated type, 
five others were considered psychotic depressives, 
three had a diagnosis of alcohol or drug intoxication, 
and the remainder had a variety of neurotic condi- 
tions usually with anxiety or depression as a main 
focus. The hospital in which they were patients is a 
receiving hospital and usually keeps its patients for 
90 days or less, 


Training of Companions 


Most of the students participating as companions 
had no prior experience as interviewers, It was felt, 
therefore, that they should have some acquaintance 
with the tape recorder, the materials, and the setting 
prior to their actual interview experience, To ac- 
complish this, the companions were brought together 
at the hospital for a group meeting. A written and 
verbal description of their task was presented to 
them, including the general instructional set to “Get 
to know the patient as well as possible.” They were 
told to be good listeners as well as interviewers. 
Their role was to be “companionable, to ask ques- 
tion’ which invite the patient’s participation, and to 
comment on what the patient says.” In addition to 
the verbal and written description of their task, they 
were given a vicarious learning experience. This con- 
sisted of listening to several 10-minute taped samples 
of interactions between students and patients, fol- 
lowed by an opportunity to role play both the pa- 
tient and companion roles, After this brief training, 
they were assigned to their patients. 
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TABLE 1 


Mrans AND STANDARD DEVIATIONS OF 
MMPI anp EPPS Variantes 


a Note-—T scores for EPPS data and raw scores for MMPI 
jata, 
TH = 12 for MMPI 


Measurement oj Accurate Empathy 


These sessions with patients were tape recorded 
and each recording was arbitrarily divided into four 
segments of equal length, Two, 2-minute verbal sam- 
ples were then randomly 


during the session and, in addition, provide the op- 
portunity to on change in the level of AE 
within each should this occur, To obtain a 


samples. 
Although the typewritten samples contained s 
versation of both patient and companion. this isois 


ile 
if 
ih 
ih 


understanding in a language attuned Sh 
current felags {p. 2). Involved in the proces 
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to work with this scale, They, too, were females of 
similar age and educational background to the com- 
panions and had a limited psychological background, 
They were trained through the use of trial samples 
before proceeding to the actual data, Once these AE 
ratings were obtained, they were correlated with the 
above-mentioned test variables and compared to the 
Bergin and Solomon data. 


RESULTS 


The three raters were able to achieve a re- 
liability of .60 for the total data as calculated 
by use of the intraclass correlation formula 
described by Ebel (1951). This is a statisti- 
cally significant level of reliability, identical 
to that previously reported by Kiesler, Ma- 
thieu, and Klein (1966), who used a similar 
population of raters and essentially identical 
training procedures. 

Table 1 presents the means and standard 
deviations for MMPI and EPPS variables. 
The means and standard deviations of EPPS 
variables have been converted to T scores and, 
as such, the data from the present study and 
the Bergin and Solomon study can be directly 


TABLE 2 
CoxgetaTions or Summen AE Scores with EPPS 
AND MMPI VARIABLES IN THE PRESENT STUDY 
AND CORRELATIONS FOR THE BERGIN AND 
Soromon Srupy 


Bergin and 

Variable ire ah Solomon 
EPPS 

Con — 131 

Aft —.081 

Def 027 

Ach 046 

Ord .079 

Exh 070 

Aut 149 

Int 162 

Suc =.350** 

Dom 198 

Aba —.013 

Nur AIS. 

Chg —.340%* 

End 224 

Het —.252 

Agg —.085 
MMPI 

D —.336" —4Al 

Pi —.288* —.54° 
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TABLE 3 
Means AND STANDARD Deviations OF AE RATINGS 


Bergin and 
Total Solomon 
Time 1 3 5 (1+3+5) 
M SD M SD M SD M SD M SD 
Early 2.90 .80 2,90 86 2.91 80 2.90 83 
Late 2.93 84 2.78 81 2.69 82 2.80 83 
Total 2.91 84 2.84 84 2.80 82 2.85 83 2.51 1.01 


compared. MMPI means and standard devia- 
tions are also reported in Table 1, but as raw 
‘scores without K correction.? The K correc- 
tion was not available because the entire 
MMPI was not administered. 

Table 2 presents correlational data for AE 
scores and test variables. Only the first hy- 
pothesis with regard to MMPI variables was 
verified, as the D and Pt scales of the MMPI 
were shown to have significant negative cor- 
relations with AE. A correlation of —.336 was 
obtained for D and —.288 for Pt, both of 
which are significant at the .05 level. The 
second hypothesis, which predicted that Dom 
and Chg scales of the EPPS would be sig- 
nificantly and positively correlated with AE, 
was not confirmed, Although Dom was posi- 
tively related, it failed to reach significance, 
and Chg, contrary to the prediction, was sig- 
nificantly but negatively correlated with AE 
(—.34, p < .05). 

Inspection of Table 3 indicates that AE did 
not increase as a function of the number of 
Sessions as predicted in the third hypothesis. 
Instead, AE scores obtained from samples 
taken early (second segment) in the session 
temained remarkably stable over the series of 
five sessions. This is not true, however, of the 
AE scores obtained late (fourth segment) in 
the sessions, The late samples generally tended 
to yield a noticeably lower level of AE than 
early samples within each session, and, in ad- 
dition, this tendency increased as a function 
Of sessions. In effect, the decrease in level of 
AE between early and late samples is greater 
in Session III than in Session I and greater 


* Bergin kindly made the raw score data for MMPI 
Scales available so that comparisons could be made. 


in Session V than Session III. This change is 
graphically represented in Figure 1. 

An analysis of variance indicated that this 
early-late differential in AE scores failed to 
reach statistical significance. The differences 
in total AE scores across sessions as well as 
the Sessions X Sample Location interaction 
were likewise not significant. An attempt was 
made to further investigate this tendency for 
late samples to decrease as a function of ses- 
sions by means of a trend analysis as described 
by Edwards (1964). The goal here was to see 
if this decline over sessions actually defined a 
negative slope as significantly different from 
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Fic. 1. Mean AE scores for early, late, and 
combined samples by sessions. 
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a horizontal line. This analysis revealed a 
trend which approached but failed to reach 
significance at the .05 level. The obtained F 
ratio was 3.86 and an F ratio of 3.90 is needed 
for significance at the .05 level. 


DISCUSSION 


The data with regard to the MMPI vari- 
ables suggest that the admission of neurotic 
symptomatology, as indicated by high scores 
on D and Pt, counterindicates high levels of 
AE. This was previously reported by Bergin 
and Solomon (1963) and has now been cross- 
validated using a much different population 
in which training level was markedly different 
and participants were restricted to females. 
This finding supports the common sense as- 
sumption that poorly adjusted people have 
trouble being empathic although the present 
data do not suggest why this is the case. 

This study has implications for companion- 
like programs in that it indicates naive female 
Ss can provide a level of AE at least as high 
as that of advanced graduate psychology stu- 
dents as reported by Bergin and Solomon (see 
Table 3).* Companions in this study attained 
a degree of AE approaching Level 3 on the 
Truax Accurate Empathy Scale.” The question 
of whether or not this is an adequate level 
for notable improvement of the patient relates 
to the relative usefulness of companion pro- 
grams. If one considers the level of AE offered 
by graduate psychology students as adequate 
to initial progress in therapy, then one must 
also consider a similar level in companions as 
acceptable. If this is not the case, that is, if it 
is felt that a level of AE approaching Level 3 
on the Accurate Empathy Scale is not suf- 
ficient to produce any patient gains in the 
therapeutic or quasi-therapeutic context, then 


4A one-to-one comparison is difficult if not im- 
possible in that Bergin and Solomon, for the purposes 
ot their ee rey the AE scale to 10 points 

inserting an additi i 
Da F on a ional point between Empathy 

5 Truax (1961) defines Level 3 as: 

Therapist responds accurately to client’s more ex- 

posed feelings. He also displays concern for the 

deeper, more hidden feelings, which he seems to 
sense must be present, though he does not under- 
stand their nature. The therapist seems to assume 
the Presence of deep feelings, although he does not 
sense their meaning to this particular patient [p. 8]. 
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we need to take a look at the training of 
graduate students who major in clinical areas, 
It is indeed surprising that naive Ss can mani- 
fest a level of AE of similar magnitude to 
those trained to be therapeutic practitioners, 
This is even more surprising in that most 
graduate programs provide the student with 
extensive therapeutic experience, although 
without a specific emphasis on empathic un- 
derstanding per se. Perhaps it is felt that with 
professional growth and experience, as well as 
good psychological adjustment on the part of 
the practitioner, one’s empathic abilities are 
sharpened and do not require specific training. 
Certainly the level of AE is higher in profes- 
sionals and it can be assumed that if this 
level were not a function of training or selec- 
tion, it must have grown out of experience. 
Just as one might expect graduate students 
to improve in level of AE offered to a patient 
as a function of experience, if for no other 
reason than the relatively higher level of AE 
offered by professionals, one might expect an 
advancement in AE scores of companions 
with further experience. The degree of im- 
provement and the need for specific training 
to aid this process is a suitable subject for 
future research. 

No formal evaluation was undertaken to 
assess the experiential aspects of the study 
from the companion’s viewpoint. Most com- 
mented that it was a valuable and interesting 
experience and that they would continue in 
such a role if the opportunity were available. 
This was similar to the patients’ verbalized 
comments that the experience was helpful. 
The patients filled out a five-item rating scale 
characterizing their experience. This resulted 
in a rating set in which the extreme positive 
rating was chosen most often and may be a 
reflection of what they felt E expected as 
opposed to an accurate representation of their 
investment. This aspect certainly needs fur- 
ther investigation. 

The second major finding of this study re- 
lates to the methodological problems regard- 
ing the sampling of this variable. It has been 
thought that AE was a stable therapist vari- 
able. Indeed, if one samples from the early-to- 
middle portions of the therapeutic interaction, 
one obtains a reasonably stable measure of 
AE. As demonstrated by the present study, 
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however, sampling in the last quarter results 
in a consistently lower measured level of AE. 
Not only does the level of AE decrease within 
the session itself, as a function of time, but 
the decrease accelerates with each succeeding 
session. This trend has not been reported be- 
fore because other researchers have chosen to 
sample the early segments of the therapeutic 
interaction. The only previous report of in- 
stability in this therapist variable comes from 
Kiesler et al. (1966), who reported on longi- 
tudinal sampling from the last half of the 
first 15 sessions of the therapeutic interaction. 
He reported considerable variability in the 
measured level of AE in the first seven or 
eight sessions and a stabilization at about the 
twelfth session. 

The meaning of this trend is open to specu- 
lation at present, but its implication is that 
sampling of this variable from therapy inter- 
actions should take cognizance of what ap- 
pears to be a systematic change in AE. 
Certainly, this stability, or lack of it, has 
relevance when one attempts to interpret the 
results of such studies. This suggests (a) that 
studies might be done taking more as well as 
standardized samples from each session to 
ascertain other fluctuations or trends within 
the session, that is, sample from the same 
location for all sessions sampled; and (b) 
a further exploration of longitudinal trends 
in AE over the full range of therapeutic con- 
tact is needed. Most significant of all is the 
suggestion that care must be taken in inter- 
preting measurement results with regard to 
sample location. 

In summary, the cross-validation of the 
Bergin and Solomon study, which this study 
attempted, produced some rather interesting 
results. Perhaps the most significant finding 
of this study is the confirmation of the hy- 
pothesis regarding the relationship of the D 
and Pt scales of the MMPI to AE. The data 
Suggest that neurotic self-descriptions, as indi- 
cated by high scores on the D and Pt scales 
of the MMPI, counterindicate high levels of 
AE. This study did not cross-validate the 
findings of Bergin and Solomon with regard 
to EPPS variables, suggesting only a super- 
ficial and variable relationship to AE. Finally, 
the second major finding of this study is dem- 
onstrated in the tendency for the level of AE 
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to decline between the second segment sample 
and the fourth segment sample within each 
session. In addition, this decrease in level of 
AE becomes more marked as a function of in- 
creasing sessions. This trend seems to imply 
that sampling of this variable from therapy 
interactions should take cognizance of what 
appears to be a systematic change in the level 
of AE over time. 
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The Missouri Children’s Behavior Checklist is a set of descriptions of children’s 
behavior that may be rated by a child’s parent. This checklist is based on 
the literature dealing with observers’ reports of children’s behavior in a 
number of different settings and consists of items that have been shown to 
cluster together into six relatively independent dimensions or scales (Aggres- 
sion, Inhibition, Activity Level, Sleep Disturbance, Somatization, and Soci- 
ability). The scales can be rated with reasonable reliability, and the scale 
scores, based on ratings by mothers, discriminate at least two groups of 
clinically different children. The dimensions of the Missouri Children’s Be- 
havior Checklist appear to be sufficiently independent, reliable, and dis- 
criminating to warrant further use as one approach to the quantitative 


evaluation of the behavior of children. 


This report deals with the construction of a 
checklist (Missouri Children’s Behavior 
Checklist) for the assessment of several rele- 
vant dimensions of children’s behavior. The 
primary reason for developing this checklist 
was to provide a method for identifying groups 
of children, each of which would be at the 
extreme of one of several clinically or theo- 
retically significant dimensions of children’s 
behavior. These checklist-identified groups, 
quantifiably different in described behavior, 
were meant to serve as criterion groups in an 
ongoing program of research designed to con- 
struct and validate an objective, nonverbal, 
personality test for children (Sines, Pauker, 
& Sines, 1966). A by-product of the con- 
struction of the checklist has been the de- 
velopment of the several checklist scales 
which, apart from their use in the definition of 
criterion groups, have shown promise in them- 
pre of providing useful research and clinical 

ata. 

The literature dealing with empirical stud- 
ies of the behavior of children shows marked 
consistency in pointing up a number of di- 
mensions that appear to describe the behavior 
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of children even when those dimensions were 
derived from rather different populations of 
children and somewhat different methodologies 
were used (Dreger, Reid, Lewis, Overlade, 
Rich, Taffel, Miller, & Fleming, 1964; Jenkins 
& Hewitt, 1944; Ross, Lacey, & Parton, 1965; 
Spivack & Levine, 1963—1964). On the basis 
of this literature, it seems clear that within 
broad limits, professionals as well as parents 
and other nonprofessionals use highly similar 
general language, concepts, and dimensions of 
behavior to describe children’s behavior 
(Walker, 1967). 

Views concerning the number of dimensions 
that are necessary and sufficient to reflect 
behavioral differences between children are 
characterized by less agreement between in- 
vestigators; some feel that a small number 
of dimensions can account for the valid vari- 
ance in reports of children’s behavior (e.g., 
Peterson, 1965) and some have located and 
tentatively labeled a relatively large number 
of factors or clusters of specific behaviors 
(e.g., Dreger et al., 1964; Spivack & Levine, 
1963-1964). There are equally important 
related questions, of course, such as whether 
behavioral or personality dimensions, however 
identified and in whatever number, will allow 
us to improve our prediction of children’s be- 
havior and whether they will allow us to dis- 
criminate between different etiologies of vari- 
ous patterns of children’s behaviors. 
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As the first step in a larger study of these 
latter questions, the authors examined the 
literature for behavior descriptions and 
clusters of descriptive statements that ap- 
peared to relate to behavior dimensions of 
particular clinical value. Previous investigators 
generally made use of parents’ reports of their 
children’s problem behavior (Dreger et al., 
1964) or statements of referral problems or 
case records in child guidance clinics or 
other institutions (Jenkins & Glickman, 1946; 
Peterson, 1961; Spivack & Levine, 1963- 
1964). Ross et al. (1965) recognized this 
bias toward pathology and included a set of 
“good adjustment” items in a “pro-social 
behavior” checklist scale. 

In each of the numerous studies that have 
been reported (Dreger et al., 1964; Fish & 
Shapiro, 1964; Patterson, 1964; Peterson, 
1961; Quay & Quay, 1965; Ross et al., 1965; 
Spivack & Levine, 1963-1964; Walker, 1967), 
children were rated by parents, teachers, in- 
stitutional supervisors or clinicians either as 
showing or as not exhibiting or not possessing 
each of the behaviors or characteristics 
selected for study. The rating data were 
typically subjected to some sort of clustering 
procedure, usually factor analysis, in order 
to identify those behavior items that tend to 
occur together. 

In many of the studies previously listed, 
three consistent and relatively independent 
sets or clusters of behavioral characteristics 
have been noted. These three factors or 
clusters have been variously labeled, but may 
reasonably be referred to as “Aggression,” 
“Inhibition,” and “Activity Level.” 

In addition to these three frequently noted 
factors, several additional clusters of behaviors 
have been described, primarily in the work 
of Dreger et al. (1964). Those of greatest 
clinical interest here are “Sleep Disturbance,” 
“Somatization,” and “Sociability.” 

The behavior checklist reported in this 
paper was derived from several of the check- 
lists and behavioral descriptions which ap- 
peared in the literature referred to above. 


METHOD 


The final form of the Missouri Children’s Be- 
havior Checklist, consisting of 70 statements, was 
distilled statistically from 95 behavior-descriptive 
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statements which made up an original form of the 
checklist, The original 95 statements were selected 
from the existing literature to cover six dimensions 
of behavior: aggression, inhibition, activity level, 
sleep disturbance, somatization, and sociability. No 
item contributes to more than one dimension. The 
individual items were selected from the published 
reports if they were listed with factor loadings of 
30 or greater, if they were employed by the in- 
vestigator in his narrative description of the be- 
havior dimension, or if they appeared in the same 
cluster in more than one study. 

Data on the original checklist were collected in 
15 child psychiatry, pediatric, and mental health 
clinics in the United States and Canada.* Most of 
the children, 404 boys and 250 girls, ages 5 through 
16 years, were seen for psychological evaluation, 
The mother of each of these children was asked 
to indicate on the checklist (by circling a “Yes” or 
a “No”) whether or not her child had shown the 
behavior described in each of the 95 statements 
during the previous 6 months, 

The analyses reported below deal with the check- 
lists of the 404 boys, Similar analyses will be made 
for girls when considerably more data have been 
collected. 


RESULTS 
Construction of the Final Checklist Form 


A score was derived for each of the six be- 
havior dimensions by assigning one point to 
the dimension for each of its items to which 
the mother responded in the direction of the 
published factor loading. Point-biserial cor- 
relations were calculated between each of the 
95 original items and the total score on each 
of the six dimensions or factors. An item 
was retained in the checklist and assigned to 
a dimension if the point-biserial correlation 
between the item and the total dimension 
score was at least .30, and if the square of 
the point-biserial correlation was at least 
twice as large as the square of the correlation 
between that item and the total score on 
any of the remaining five factors. Several 
items were thus eliminated. The checklist 
items and the direction of scoring are listed 
in Table 1. 


3 The authors wish to express their sincere apprecia- 
tion to the following psychologists who furnished 
data on the Missouri Children’s Behavior Checklist 
in connection with this study: Loretta K. Cass, 
Charles S. Cleeland, Bruce R. Cushna, James W. D. 
Hartman, Hyman Hops, Jon Krapf, Luciano L’Abate, 
Donald S. Leventhal, David Markert, Eileen T. 
Nickerson, Elizabeth C. Pennick, and Gertrude J. 
Williams. 
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TABLE 1 
FINAL CHECKLIST ĪTEMS AND SCORED RESPONSES 
A-1 Says as for instance, “I’ll get even,” “You won’t | H-4 Over-talkative, chatters, keeps talking or inter- 
get away with that,” “TIl show him,” expresses rupting conversation Y ) 
desire for revenge (Y) H-5 Sings or hums continually (to the expressed 
A-2 Fights (Y) annoyance of others) (Y) 
A-3 Says “Others are to blame” for own actions (Y) | H-6 Becomes more active or more talkative in groups, 
A-4 Selfish (Y) becomes noisier and more excited than usual 
A-5 Unscrupulously takes advantage of others (Y) when he is in a group (Y) 
A-6 Hurts animals (Y) H-7 Speaks rapidly, words “come tumbling out 
A-7 Makes statements contrary to fact (lying, telling fast” (Y) 
untruths) (Y) H-8 Becomes “jittery,” building up tension, becomes 
A-8 Screams, bangs objects when denied something, wound up (Y) 
has temper tantrums (Y) H-9 Is said to be distractable, turns away quickly 
A-9 Pulls other children’s hair, punches, steps on from what he is doing when something else 
toes, etc., annoys children (Y) moves, when someone speaks, or other sounds are 
A-10 Steals (Y) made (Y) 
A-11 Destroys or defaces property (Y) H-10 Falls, cuts, bruises, injures self, has many acci- 
A-12 Plays with matches (Y) dents (Y) 
A-13 Hurts other children (pinches, hits, kicks or | Sl-1 Cries out in sleep (Y) 
other destructive acts) (Y) Sl-2 Talks in sleep (Y) 
A-14 Teases other children (Y) Sl-3 Complains of bad dreams (Y) 
A-15 Hits smaller children, “pickson” weaker or | Sl-4 Tosses and turns in sleep, rolls, gets up often at 
smaller children (Y) night, etc. (poor or restless sleeping) (Y) 
A-16 Screams more than others (Y) SI-5 Talks about or complains of nightmares about 
A-17 Threatens to kill someone (Y) past serious events (divorce, automobile acci- 
A-18 Swears or curses (uses “Hell,” “God damn” or dent, fire, loss of loved one, or other “crisis” 
other four-letter words) (Y) events) (Y) 
A-19 Does not answer when spoken to, pouts, looks | SI-6 Walks in sleep (Y) 
mean or sullen (Y) SI-7 Sleeps well, awakes very few times at night (a 
I-1 Prefers to be with children younger than him- “good sleeper”) (N) 
self (Y) SI-8 Has difficulty going to sleep (Y) 
I-2 Is apathetic or underactive (Y) Sl-9 Irregular bed time (Y) 
T-3 Does not try new situations, “hangs back,” is | Ps-1 Becomes so upset by changes in routine such as 
considered by others as fearful or shy (Y) changing residences or schools or when expecting 
T-4 Does not perform before group, refuses to speak visitors that the child may vomit or report 
before class when requested, does not volunteer bodily aches, headaches, stomach aches, or feel- 
to speak or act before group or class (Y) ings of nausea (Y) 
I-5 Withdraws, remains quiet, does not talk back | Ps-2 Worries a great deal, is said to be a worrier, ex- 
when others shove, hit, accuse, or criticize him presses worry or concern about bad grades, 
(does not “stand up for self”) (Y) health, etc. (Y) 
1-6 Speaks with weak voice, in a monotone, voice | Ps-3 Clings to mother (stays close to mother, hangs 
trails off” at ends of sentences, or speaks in a onto dress or hand) (Y) 
weak, high-pitched voice (Y) Ps-4 Complains of pains in hi 
I-7 Stays largely in room or house (Y) Ps-5 p tee. aoe? a: ot ®) N 
I-8 Says “I’m tired,” “I want to fest tete. (others s-5 Complains of pains in limbs or back (muscle 
say that he tires easily or rests often) (Y) eh and pains) (Y) 
1-9 Cries easily (Y) Ps-6 Vomits when things “do not go his way,” when 
I-10 Is sensitive (Y) he shows signs of anger (red face, raised voice, 
I-11 Has few close friendships (Y) etc.), when he says he is worried, or when he feels 
I-12 Is seclusive, prefers to be by himself (yy sad or is emotionally upset (Y) 
I-13 Does not participate in group activities, stays in Ps-7 Cries at separation from mother (on going to 
background (said to be retiring) (Y) school, camp, etc.) (Y) 
ie a shy or timid (¥) Ps-8 Has physical complaints (Y) 
-1 1 Moves constantly, “gets k into everything,” | So-1 Expresses or shows concern over the misfortunes 
Swarms all over,” is overactive (Y) of others (e. ats should ks questions 
H-2 Jumps from one activity to next, di 3 Hi 8o Pi Shoulder, asks qi 2 
asks (others sa? he haz t, does not finish about troubles, says “you feel unhappy, don 
v 'y he has a short attention span) ae ne (Y) 
H3 j AAN 0-4 Expresses appreciation for others’ acts (Y) 
ey falls easily, throws clumsily, is awk- | So-3 Expresses delight over the happiness of others 
(eg., claps hands, says “that’s good!””) (Y) 
Sociability, > 7 Aseression, 1 = Inhibition, H = Activity Level, SI =Sleep Disturbance, 


Ps = Somatization, and So = 
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TABLE 1— (Continued) 


So-4 
So-5 
So-6 


Expresses delight in beauty (Y) 

Discusses own problems with others (Y) - 

Says “I’m sorry,” “Won't you forgive me?” 
more than others do (expresses great remorse, 
apologizes repeatedly, cries after hurting or tell- 
ing untruths or destroying property) (Y) 


So-7 Talks easily with adults, initiates activities 


or conversation with adults other than parents 
(¥) 

Asks to be held or hugged, climbs into lap, etc. 
(seeks physical expressions of affection) (Y) 
Sought out by others, others say they like him; 
among first selected for teams, etc. (Y) 

So-10 Requests praise or approval (Y) 


So-8 


So-9 


Each new dimension was rescored, and the 
point-biserial correlation was calculated be- 
tween each item and each of the total scores.* 

The means and standard deviations of 


4Tables giving (a) the point-biserial correlations 
between each checklist item and total scores on 
each behavior dimension and (b) the percentage of 
mother-father agreement on each checklist item have 
been deposited with the National Auxiliary Publica- 
tions Service. Order Document No. 00660 from the 
National Auxiliary Publication Service of the Amer- 
ican Society for Information Science c/o CCM 
Information Sciences Inc., 909 Third Avenue, New 
York, New York 10022. Remit in advance $3.00 for 
photocopies or $1.00 for microfiche and make checks 
payable to: Research and Microfilm Publications, Inc. 


scores on each of these six clusters or behavior 
dimensions were calculated for the total group 
and for the boys at each age from 5 to 16 
years inclusive. These values are presented 
in Table 2. 

Dimensions of behavior such as these may 
be reasonably expected to meet at least three 
requirements if they are to warrant further 
study: (a) the specific behavior items and 
the factor scores should be reasonably reliable, 
(b) the dimensions should be relatively in- 
dependent, and (c) the several dimensions 
should be differentially present and measur- 
able in clinically different groups of children. 


TABLE 2 


MEANS AND STANDARD DEVIATIONS FOR CHECKLIST DIMENSIONS 
av Eactt AGE AND FOR ToraL Group oF Boys 


Age 
Total 
5 6 7 8 9 10:| etd 12 | 13 | 14 | 15 | 16 
Dimension 
N 
8 26 | 47 | 57 | 69 | 48 | 38 | 41 | 25 | 24 | 14 7 | 404 
Aggression 
x 3.12 | 5.38 | 5.53 | 6.05 | 5.58 | 6.40 | 6.60 | 5.32 | 6.84 | 4.79 | 5.78 | 2.57 | 5.74 
SD 3.22 | 3.96 | 4.09 | 4.39 | 3.75 | 5.62 | 4.62 | 4.37 | 4.86 | 4.99 | 3.94 | 1.13 | 4.43 
Inhibition 
x 2.62 | 3.15 | 4.26 | 3.79 | 3.77 | 4.02 | 4.79 | 4.58 | 5.28 | 4.58 | 5.50 | 3.00 | 4.16 
SD 2.13 | 2.22 | 2.93 | 2.66 | 2.47 | 2.42 | 3.53 | 3.01 | 3.91 | 2.75 | 3.30 | 2.24 | 2.85 
Activity Level 
x 2.88 | 4.42 | 4.83 | 4.53 | 4.65 | 4.71 | 4.53 | 4.49 | 3.44 | 3.54 | 2.43 | 1.71 | 4.31 
SD 1.56 | 2.74 | 2.92 | 2.64 | 2.50 | 2.57 | 2.77 | 2.81 | 2.08 | 2.30 | 2.10 | 1.80 | 2.64 
Sleep Disturbance 
x 1.25 | 1.85 | 2.47 | 1.67 | 1.70 | 2.19 | 2.45 | 2.83 | 2.16 | 1.79 | 1.78 | 2.43 | 2.08 
SD 1.28 | 2.13 | 2.61 | 1.98 | 2.12 | 1.98 | 2.24 | 2.05 | 2.10 | 1.67 | 1.80 | 2.15 | 2.12 
Somatization 
1.00 | 1.00 | 1.49 | 1.04 | 1.09 | 1.98 | 1.29 | 1.58 | 1.88 | 1.25 | 1.64 | .71 | 1.37 
SD 1.07 | 1.41 | 1.68 | 1.31 | 1.44 | 1.88 | 1.39 | 1.72 | 1.51 | 1.33 | 1.69 | .76 | 1.55 
Sociability 
x 6.00 | 4.88 | 5.08 | 5.10 | 4.84 | 4.88 | 3.60 | 4.78 | 3.92 | 3.88 | 3.50 | 4.00 | 4.64 
SD 1.93 | 2.55 | 2.12 | 2.64 | 2.41 | 2.40 | 2.21 | 2.36 | 2.29 | 2.21 | 2.93 | 1.15 | 2.41 
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TABLE 3 
Opp-EveN RELIABILITY OF SCORES ON 
CHECKLIST DIMENSIONS 
Spearman- 
Dimension Pearson r Brown 

correction 
Aggression 76 86 
Inhibition 62 -16 
Activity Level 69 82 
Sleep Disturbance 62 ih 
Somatization 152 -68 
Sociability -50 67 


The published research has dealt with these 
issues in several respects, and data relating 
to each of these questions for the final form 
of the Missouri Children’s Behavior Checklist 
are reported below. 

Reliability 

Internal consistency. Scores on odd-even 
halves of each of the six behavior dimensions 
were correlated for evidence of one type of 
reliability. 

Interjudge agreement. The authors were 
able to collect behavior checklists from both 
parents of 47 children. The agreement be- 
tween mothers’ and fathers’ ratings on in- 
dividual items ranged from 53.2% to 93.6%. 
The average agreement for all the items in- 
cluded in each dimension or scale of the 


checklist ranged from 68.9% to 93.2% (see 
Footnote 4). 


Interdimension Correlations 


The raw total scores on each of the di- 
mensions were intercorrelated using the data 


TABLE 4 


INTERCORRELATIONS BETWEEN TOTAL Scores 
on CHECKLIST Dimensions 


Dimension 1 | 2 | 3 | 4 | 5 | 6 
1. Aggression 19% oe 
2. Inhibition ? OF aie A E 
3. Activity Level ige yee -1e 
4. Sleep Distur- q $ A 
ance 404 
5. Somatization — j ott 
6. Sociability ce 
* 
b <05, 
**b <.01. 
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from the 404 boys. The intercorrelation 
coefficients are presented in Table 4. 


Between-Group Differences 


In order to determine whether scores on 
these several behavior dimensions were con- 
gruent with generally accepted clinical ex- 
perience, the means and standard deviations 
of checklist scale scores for 24 boys seen in 
a university child psychiatry clinic were com- 
pared with checklist scores for a group of 24 
nonreferred boys who had been tested ex- 
tensively for another study of “normal” 
children, 

The ages of the boys in each of these 
groups ranged from 6 through 14 years with a 
mean of 9.88 years and a standard deviation 
of 2.21 years. The IQ scores ranged from 
80 to 134 in the nonclinic group, with a mean 
of 104.04 and a standard deviation of 11.87. 
The IQ scores in the clinic group ranged from 
80 to 137, with a mean of 103.67 and a stan- 
dard deviation of 11.95. 

The distributions of scores on each of the 
checklist factors are presented in Table 5, 


TABLE 5 


Comparisons OF CHECKLIST SCORES FoR Norma (N) 
AND Cuinic (C) Boys MATCHED FOR AGE AND IQ 


Dimension Group| X | SD| t b 
N 3.17 | 2.55 
Aggression 4.40 | .002 
G 8.67 | 4.85 
N | 2,04 | 2.80 
Inhibition 2.11 | .05 
Cc 3.96 | 3.14 
È N 2.46 | 2.02 
Activity Level 3.74 | .002 
(3 5.38 | 2.72 
N 79 | 1.77 
Sleep Disturbance 1.64 | ns 
C 1.88 | 2.21 
à N 75 | .99 
Somatization 1.76 | ns 
(> 1.29 | 1.37 
thors N | 5.88 | 2.11 
Sociability 4.09 | .002 
Cc 3.54 | 2.00 


f 
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along with the mean factor or scale scores. 
There are significant differences between these 
two groups of boys on checklist scales of 
Aggression, Inhibition, Activity Level, and 
Sociability. 


Discussion 


The point-biserial correlations between each 
item and each total scale score indicated that 
each of the checklist items relates relatively 
exclusively to only one of the six dimensions 
of behavior. The dimensions themselves are 
independent for all practical purposes; al- 
though a number of the scale intercorrelations 
(Table 4) are significant beyond the .01 
level, the largest correlation (.43 between 
Aggression and Activity Level) indicates that 
the two dimensions in question share only 
about 18% of their variances. 

Reliability is at a reasonable level for a 
behavior rating scale. The odd-even item cor- 
relations (Table 3) indicate an adequate 
degree of internal consistency. In view of the 
probable fact that fathers and mothers gen- 
erally see different samples of their children’s 
behavior, a high degree of parent agreement 
on the checklist is not crucial for reliability 
considerations. Nonetheless, the data showed 
moderately high agreement between parents 
and were consistent with the degree of agree- 
ment reported by other investigators (Dreger 
et al., 1964; Peterson, 1961; Ross et al., 
1965; Quay & Quay, 1965). Although no data 
on interscorer reliability is reported here, it is 
the authors’ experience that high scoring reli- 
ability can be regularly achieved. 

The distributions of checklist scores in the 
clinic and nonreferred groups of boys and the 
significant differences between the two groups 
in mean scores on Aggression, Inhibition, 
Activity Level, and Sociability (Table 5) 
support the notion that several of the clinically 
obvious differences between these two groups 
are reflected in their checklist results. It is 
possible, of course, that such differences in 
checklist scores may result from factors such 
as parental attitudes rather than (or in addi- 
tion to) intrinsic behavioral differences be- 
tween the children. Had no differences ap- 
peared, however, we would have much less 
Confidence in the concurrent validity of the 
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Missouri Children’s Behavior Checklist so 
far as distinguishing among children is con- 
cerned. (Research relating parent variables 
to checklist scores is in progress.) 

The lack of statistically significant differ- 
ences between the two groups of boys on the 
Sleep Disturbance and the Somatization scales 
may be due to one or both of two major 
factors. First, the very skewed score dis- 
tributions on both of these scales reduce the 
statistical power of parametric analysis. Sec- 
ond, it is quite likely that children whose 
primary complaints involve disturbances of 
sleep pattern or numerous somatic symptoms 
are not referred to a child psychiatry clinic, 
and this selective referral eliminates such 
children from our psychiatric sample. 

Since the distributions of scores on Sleep 
Disturbance and Somatization for the two 
groups of boys do not justify the use of 
parametric statistics, the Kolmogorov-Smirnov 
two-sample test was also applied and differ- 
ences again were found to be insignificant. 
Thus, the dimensions of Sleep Disturbance 
and Somatization either are not capable of 
making useful distinctions among children or 
the selective referral issue may be highly 
relevant. Significant differences on these two 
scales between unselected boys and boys seen 
in a pediatric clinic would support the selec- 
tive referral hypothesis. 

As has been noted (Dawes, 1962; Sines, 
1964), the validity and certainly the utility of 
an assessment device cannot be established or 
evaluated by demonstrating differences on the 
diagnostic instrument between clinically dif- 
ferent groups of Ss. Even so, the existence of 
such differences on the checklist is gratifying. 
If, however, it is to be useful as anything 
more than a descriptive measure, research 
must be done using the strategy recently re- 
ported by Kulik, Stein, and Sarbin (1968), 
in which the behavioral and demographic cor- 
relates of patterns of checklist scores were 
determined. Whether patterns of scores on the 
several dimensions reported here will be 
valuable as predictors of extrachecklist at- 
tributes remains to be explored. 

The information now available does sug- 
gest, however, that the use of this checklist 
by mothers does provide technically accept- 
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able, specific information concerning several 
clinically relevant dimensions of their chil- 
dren’s behavior. 
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It was hypothesized that participants in an in-hospital work-therapy pro- 
gram require a shorter period of both intensive and total psychiatric treat- 
ment and fewer readmissions to all psychiatric facilities than do nonparticipants, 
The Ss were Fort Logan Mental Health Center patients randomly assigned to 
the participant and nonparticipant groups at admission. Contrary to the hy- 
potheses, the results indicated that participants had significantly longer stays in 
both intensive treatment and total hospitalization than did nonparticipants. 
Although there was no difference between the groups with respect to readmis- 
sion to all psychiatric facilities, participants had significantly more readmissions 
to Fort Logan. It was suggested that work-therapy programs be established 
in the community in order to avoid contributing to the hospital dependency 


of participants. 


Proponents of work therapy have suggested 
that meaningful paid work, deliberately pro- 
vided for the therapeutic benefit of the psy- 
chiatric patient and integrated with the 
other therapeutic activities, may speed the 
recovery of acute cases and return long-term 
patients to usefulness. By communicating 
more realistic expectations of the require- 
ments and satisfactions of a job, work therapy 
lessens the fear of leaving the security of the 
institution and strengthens the patients in the 
community. 

The growing interest in and prevalence of 
work-therapy programs has been well doc- 
umented with enthusiastic speculation about 
the potential for rehabilitation of psychiatric 
patients with various symptomatologies and 
levels of dysfunction (Denber, 1960; Jones, 
1960). Nevertheless, while some efforts have 
been directed toward evaluations of the 
changes in attitudes, behaviors, and outcomes 


1This study was done in conjunction with a 
larger project supported in part by Grant MH 
0137-04 from the National Institute of Mental 
Health, Some of the data were also supplied by the 
Fort Logan Record System, supported in part by 
National Institute of Mental Health Grant 5-R11- 
MH-00931-06, The authors wish to express their 
appreciation to Joanne Fults for her assistance in 
the manuscript preparation. 

2 Requests for reprints should be sent to Margaret 
S. Barbee, Fort Logan Mental Health Center, 3520 
West Oxford Center, Denver, Colorado 80236. 


which obtain in patients as a result of par- 
ticipation in work-therapy programs (Hart- 
lage & Hartlage, 1967; Pile & Swanson, 1968; 
Thompson, 1961), there is little indication of 
basic empirical research into these programs’ 
efficiency in expediting the return of par- 
ticipants to their communities and reducing 
their rehospitalization. 

This study was undertaken in order to 
demonstrate the relationship of work-therapy 
participation to treatment-outcome variables 
of Fort Logan Mental Health Center patients. 
Specifically, it was hypothesized that work- 
therapy participation would reduce (a) the 
length of intensive institutional treatment, 
(b) the length of total hospital care, (c) the 
rate of readmission to any psychiatric in- 
stitution, and (d) the rate of readmission to 
Fort Logan. 

The Fort Logan work-therapy program 
operates independently in a small industrial 
workshop on the hospital grounds. The pro- 
gram staff members not only direct patient 
groups in their work, but they also contract 
with local industries for the light assembling 
and packaging work to be done in the shop. 
All materials and supplies are provided by 
the employer firms, who also deliver the 
raw materials and pick up the finished pro- 
ducts. Groups of patients work for 2-hour 
periods, generally two or three times per week, 
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and are paid wages from the fees charged to 
the employers. The rate of pay is based on 
the group piecework system, that is, each 
individual shares equally in the earnings of his 
work, regardless of his contribution to its 
success. 


METHOD 


The research sample was randomly drawn from 
patients admitted to Fort Logan’s Adult Psychiatric 
Division and randomly divided into two groups, 
those who were assigned to the work-therapy pro- 
gram (participants) and those who were not selected 
to attend the workshop (nonparticipants). The 
sampling procedure stipulated that all Ss must have 
remained in the hospital for a minimum of 20 days 
and that the participants must have attended the 
workshop for at least 10 days, while the nonpar- 
ticipants must not have attended the workshop at all. 
The Ss who failed to meet these criteria were 
deleted from the sample, yielding 110 acceptable 
participant Ss and 149 nonparticipant Ss at the 
initial phase of the research. Because this refinement 
and subsequent sample attrition might have 
destroyed the randomization of the two groups, 
statistical tests were performed to establish whether 
or not there were significant demographic and/or 
psychiatric differences between participants and non- 
participants. The groups were not found to differ 
significantly in terms of age, sex, level of education, 
marital status, diagnosis, or severity of diagnosis, 
either initially or at any of the three follow-up 
phases, Follow-up interviews were conducted at 3, 
12, and 24 months after the patient’s first significant 
upward transfer. A significant upward transfer is 
contingent on the decision of the respective patients’ 
therapeutic team, which excludes the workshop 
staff, and results in a move from a more intensive 
treatment modality to a less intensive modality, 
for example, from inpatient status to outpatient 
status or from day care to a halfway house. 

Two years after discharge constituted the time 
period for determining whether readmission occurred 
for an S. Sample attrition during this period reduced 
the participant N to 73 and the nonparticipant V 
to 84 for the readmission analyses. Speculation con- 
cerning the nonparticipants’ greater attrition rate. 
that is, a loss of 44% from in-hospital to readmission 


TABLE 1 


CONTINGENCY TABLE FOR LENGTH OF INTENSIVE 
TREATMENT MEDIAN TEST 


Fewer than | More than 
Group 103 days | 103 days | Total 
Participants 42 68 
Nonparticipants 87 62 T 
Total 129 130 259 


Note—x? = 9.54, df = 1. 
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TABLE 2 


CONTINGENCY TABLE FOR LENGTH OF TOTAL 
HOSPITALIZATION MEDIAN TEST 


Fewer than | More than 


Group 166 days | 166days | Total 
Participants 41 69 110 
Nonparticipants 89 60 149 

Total 130 129 259 


Note.—x? = 11.89, df = 1. 


analyses, as compared to the participants’ 34% 
attrition, is presented at a later point in this paper. 

It should be noted that the follow-up interviews 
were primarily used as the data sources for the 
above-mentioned control analyses and for the col- 
lection of information about readmissions to facili- 
ties other than Fort Logan. These interviews, how- 
ever, were not the data sources used in the testing of 
the remaining three hypotheses. Data concerning 
the length of initial intensive care stay, total length 
of hospital treatment, and readmission to Fort Logan 
after discharge were obtained from the Fort Logan 
Record System. 


` 


RESULTS 


As the distributions of total days of inten- 
sive treatments and total length of stay were 
extremely skewed toward the upper range of 
the distribution, it was necessary to use the 
median test for the first two hypotheses. The 
results of the analysis involving length of in- 
tensive treatment, shown in Table 1, yielded 
a median of 103 days for the combined par- 
ticipant and nonparticipant groups. Thirty- 
eight percent of the participants and 58% 
of the nonparticipants had fewer than the 
median number of days of intensive care, 
indicating that work therapy participants re- 
quire more intensive treatment than do non- 
participants. This difference is statistically 
significant, but contrary to the hypothesis. 

For the hypothesis involving total length 
of stay, the results, shown in Table 2, yielded 
a similar significant reversal from the predic- 
tion, as 37% of the participants and 60% 
of the nonparticipants were treated in some 
capacity for less than the median 166 days- 

The third hypothesis compared participants 
and nonparticipants on the basis of number 
of readmissions to any psychiatric facility 
after discharge from Fort Logan. Fifty-two 
percent of the participants and 40% of the 
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nonparticipants were readmitted. This dif- 
ference is not significant. Table 3 summarizes 
the chi-square distribution. 

When readmissions specifically to Fort 
Logan are evaluated, the difference is statis- 
tically significant, as 46% and 23% of the 
participants and nonparticipants, respectively, 
were readmitted to Fort Logan. Table 4 gives 
these results. This finding is also contrary to 
the hypothesis. 


Discussion 


The results of the above analyses failed to 
support any of the hypotheses and, in fact, 
strongly suggest that rather than facilitating 
the return of the patient to the community, 
work therapy may have inhibited the transi- 
tion. Briefly summarizing the statistically 
significant findings, it was learned that pa- 
tients who participated in work therapy both 
remained in intensive treatment and main- 
tained their associations with the hospital 
through outpatient and low-intensity services 
for a significantly longer period of time than 
did patients who were nonparticipants. Par- 
ticipants also returned to Fort Logan in sig- 
nificantly greater numbers than did nonpar- 
ticipants during the 24 months of the follow- 
up period, while the differential readmission 
tate to all psychiatric facilities was not sig- 
nificant. The final two results strongly sug- 
gest that work-therapy participation fosters 
institutional dependency, but the absence of 
complete data on possible readmissions to 
other facilities makes this conclusion some- 
what tentative. However, if one follows this 
conclusion, it is possible that the nonpartici- 
pants may have obtained postdischarge care 
from other psychiatric facilities from which 


TABLE 3 
CONTINGENCY TABLE FOR TOTAL READMISSIONS 


Not 


Group Readmitted ted Total 
Participants 38 35 73 
Nonparticipants 34 50 84 

Total 72 85 157 


Note.—x? = 1.66, df = 1. 
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TABLE 4 
CONTINGENCY TABLE FOR READMISSIONS 
TO Fort LOGAN 
; Not 

Group Readmitted | sadmitted Total 
Participants 30 35 65 
Nonparticipants 19 50 69 
Total 49 85 134 


Note.—x? = 4.23, df = 1. 


data retrieval is less efficient, thus accounting 
for the greater data attrition in this group. 

The interpretation of these results re- 
quires a consideration of Fort Logan’s goal 
for the work-therapy program. This program 
was instituted specifically to eliminate the trend 
toward continued institutionalization which 
is frequently seen in psychiatric patients who 
are hospitalized. However, closer examination 
of the Fort Logan program reveals some ele- 
ments which may have created rather than 
lessened attachment to hospital life. First, 
for many patients, a workshop job not only 
offered the most regular and well-paid em- 
ployment which they had ever obtained, but 
it also met their vocational needs very ade- 
quately. Second, the shop supervisors were 
supportive rather than demanding, the hours 
were short, and most behavior problems were 
well tolerated, in marked contrast to a com- 
petitive employment situation. Third, free 
socializing among patients was encouraged, 
and these new friends may have tended to re- 
place the influence of family and friends in 
the community. In summary, the provision of 
inhospital vocational satisfactions, support 
from supervisors for low-level functioning, 
and social contacts exclusively among other 
patients would not seem to be a positive 
influence in the return of the patient to his 
community, where memories of past social 
and vocational failures are many . 
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Prior research with the “A-B” scale has shown intriguing similarities between 
practicing therapists and undergraduates varying in A-B status. On that basis, 
12 scales from the Omnibus Personality Inventory and the four component 
scores of the American College Testing Program battery were related to A-B 
status of undergraduates. A stepwise discriminant analysis showed that the 
present set of measures significantly discriminated male As from ABs (middles) 
and Bs, and that the differences between As and Bs involved largely the joint 
effects of masculinity-femininity, verbal aptitude, and natural science aptitude 
measures. As and Bs also appeared to differ in a way consistent with previous 
research on “cognitive style.” Correlational and factor-analytic data indicated, 
however, that the A-B variable cannot be regarded as linearly related to those 
measures which discriminated As from Bs. A quasi-typological interpretation of 
the A-B variable was proposed and methodological implications were drawn. 


The manner in which therapists respond to 
items concerned with manual, technical, or 
mechanical activities on the A-B scale, a short 
scale derived from the male form of the Strong 
Vocational Interest Blank (SVIB), apparently 
relates to differential psychotherapeutic results 
with certain nosological groups. A series of 
studies by Whitehorn and Betz (reviewed by 
Betz, 1967) has shown that therapists who 
disavow such interests (“As”) have higher 
improvement rates with schizophrenics than 
those who endorse these interests (“Bs”). 
The B-type therapists, conversely, achieve 
significantly better outcomes than As with 
neurotic outpatients (McNair, Callahan, & 
Lorr, 1962). 

In recent years, no clinical studies com- 
parable in scope to the early studies of 
Whitehorn and Betz and McNair and as- 
sociates have been conducted. However, re- 
search with the A-B variable has been brought 
into the laboratory, and a series of therapy 
analogue studies (reviewed by Carson, 1967) 
has been conducted. Interestingly, most of 
these studies have yielded results consistent 
with the original studies in that, for as yet 
poorly understood reasons, untrained male Ss, 

1 The authors are deeply indebted to Harriet Rose, 
University of Kentucky Counseling and Testing 
Service, and Thomas Greenland, for their coopera- 
tion in the administration of the test battery. 

2 Requests for reprints should be sent to James 
E. Dublin, who is now at St. Albans Psychiatric 
Hospital, Radford, Virginia 24141. 


when asked to interact “helpfully” with simu- 
lated patients (e.g., tape-recorded schizoid 
and neurotic communications), show perform- 
ance differences consistent with those of prac- 
ticing therapists. In a variety of analogue 
situations, A-type Ss have responded “better” 
than Bs to schizoid, disaffiliative, extrapunitive 
“others,” while Bs have responded more posi- 
tively than As to neurotic, trusting, intro- 
punitive “others” (Berzins & Seidman, 1968; 
Carson, Harden, & Shows, 1964; Sandler, 
1965). Such similarities between therapists 
and untrained Ss strongly suggest that the 
A-B scale measures a number of basic di- 
mensions of personality, 

The empirical derivation of the scale 
(initially 23 items of the male form of the 
SVIB discriminated As from Bs) renders its 
conceptualization difficult, and to date the 
search for correlates of the A-B variable has 
yielded suggestive but limited information. 
Whitehorn and Betz initially observed differ- 
ences in “clinical style” between As and Bs, 
and speculated that the interest patterns sug- 
gested by scale items may have a direct bear- 
ing on therapists’ attitudes toward and be- 
havior with patients (Betz, 1967). McNair 
et al. (1962) favored an explanation in terms 
of socioeconomic and/or social class differ- 
ences between their samples and those of 
Whitehorn-Betz as these might interact with 
therapists’ interest patterns. More recently, 
Lorr and McNair (1966) observed that since 
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A-B scale items correlated significantly with 
the masculinity-femininity (MF) scale of the 
SVIB, sex-role identification variables might 
underlie the differential performances of As 
and Bs, the former seemingly being less 
masculine than the latter. Still more recently, 
Campbell, Stevens, Uhlenhuth, & Johansson 
(1968) fitted A and B doctors into a hierarchy 
of occupations according to their mean score on 
a new Strong-type A-B scale. They found that 
the A doctor, author-journalist, lawyer, artist, 
and librarian, in that order, represented scores 
at one extreme (the A end of their standard 
scale, the A doctor being most extreme), while 
(again from most to least extreme) carpenter, 
pilot, veterinarian, math-science teacher, B 
doctor, and engineer were at the other extreme. 
Asserting that their data “clearly support 
the original Whitehorn-Betz interpretation,” 
Campbell et al. also presented evidence that 
with respect to A-B scale scores, medical 
students do not change appreciably during 
the 4 years of medical school, and suggested 
that this pattern of scores reflects “an intel- 
lectual-verbal versus practical-mechanical di- 
mension.” As Carson (1967) points out, the 
content of the scale defies conceptualization 
in terms seemingly relevant to psychotherapy 
(he reports a factor analysis of the items of 
the scale which showed only that As are un- 
interested in manual and mechanical activity 
while Bs show engineering interests). Thus, 
in summary, the speculations of the above 
investigators have not received direct empirical 
test. 

The absence of a large-scale attempt sim- 
ultaneously to investigate a variety of possible 
personality and aptitudinal correlates of the 
A-B variable gave rise to the present study. 
Since previous research has shown intriguing 
continuities between practicing therapists’ 
performances with patients and untrained Ss’ 
ere with simulated patients, the 
use of a large sample of under; 
seemed justified. i fee 
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Subjects, The Ss were drawn from all incomin, 
freshman students who, as Part of the ongoing 5 
stitutional research program at the University of 
Kentucky, volunteered to take a battery of tests. 
Over males and almost as many females 
completed the measures pertinent to this study. 
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Measures. Designed specifically for research with 
college students (Center for the Study of Higher 
Education, 1962), the Omnibus Personality Inventory 
(OPI) was a logical choice as the basic instrument 
for measuring a number of personality dimensions. 
The OPI has typically yielded five rotated factors 
which account for about 80% of the total variance 
(e.g, Elton & Rose, 1967). However, in the present 
study, because the authors’ interest was unlike that 
of researchers into problems of academic achieve- 
ment, the relationship of Ss’ A-B status to discrete 
OPI scales (rather than OPI factors) was examined, 

The local version of the OPI (Form C), which ex- 
cludes four scales because of item overlap, retains the 
following 12 scales: Thinking Introversion (TI), 
Theoretical Orientation (TO), Estheticism (Es), Cog- 
nitive Complexity (Co), Autonomy (Au), Impulse 
Expression (IE), Schizoid Functioning (SF), Social 
Tntroversion (SI), Religious Liberalism (RL), Mas- 
culinity-Femininity (MF), Lack of Anxiety (LA, 
derived from the Taylor MA scale), and Yea-Nay- 
sayer (Y-N, derived from the Couch-Keniston 
scale) 3 

For measures of scholastic aptitude, the American 
College Test (ACT) Battery was employed. Its four 
component scales are Verbal (V), Numerical (N), 
Social Science (SS), and Natural Science (NS). The 
above 16 scales constituted the total set of variables 
to which the A-B scale scores of Ss were related. 

The version of the A-B scale used here comprised 
19 items (14 from the SVIB and 5 from the MMPI), 
This scale is derived from item-analytic work by 
Schiffman, Carson, and Falkenberg* on the 31-item 
Kemp (1966) modification of the items used in the 
original Whitehorn-Betz study. Thirteen of the items 
used in this study are identical to those published 
by Lorr and McNair (1966) in a 15-item scale 
which showed an internal-consistency coefficient of 
91. The present scale was scored in the “B” direction 
(high scores = B status), 

Selection of groups. Although prior research with 
the A-B variable has been limited to male Ss, to 
learn whether the implications of prior research 
might be extended to females in future studies, ex- 
treme and middle groups were drawn from the dis- 
tribution of A-B scores for both sexes. For males, 
the mean A-B score was 9.13, SD =3.57. Cutting 
scores of 5 and 13, one standard deviation to either 
side of the mean, yielded 98 As and 99 Bs, and 99 
Ss whose scores fell between the cutting scores were 
randomly drawn to comprise the middle (AB) group. 
For females, the mean A-B score was 5.92, SD = 3.39. 
Similarly derived cutting scores (2 and 10) yielded 
98 As and 73 Bs, and 99 ABs were selected as for 
male Ss, 

Analyses, The basic analysis of group differences 
on the 16 experimental scales consisted of a step- 


*For full details of the meaning of each of the 
OPI scales and factors, see the OPI Manual. De- 
scriptions of the scales are given only where pertinent 
to the text. 

*Personal communication from Robert C. Carson 
to the third author, October 1967. 
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TABLE 1 
OPI Raw Scores anp ACT SCORES ror MALE AND FEMALE As, ABs, anD Bs 
Male Female 
Variable A AB B A AB B 

M SD M SD M SD M SD M SD M SD 
TI 30.66 9.66 | 30.20 9.72 | 28.87 8.96 | 27.72 7.86 | 29.08 | 9.56 | 30.85 | 10.29 
TO 15.76 4.22 | 17.26 5.09 | 17.11 4.18 | 12.99 4.26 | 14.65 | 4.38 | 16.63 4.77 
Es 9.63 5.31 | 10.34 5.10 | 10.45 4.41 | 11.48 4.45 | 11.74 | 4.46 | 13.60 4.52 
Co 11.74 4.53 | 12.71 4.96 | 13.05 4.20 | 10.38 3.82 | 11.29 | 4.26 | 12.44 4.62 
Au 21.37 8.74 | 21.27 6.84 | 19.76 6.44 | 18.94 6.20 | 20.17 | 5.81 | 19.01 5.82 
IE 35.79 9.64 | 37.46 | 10.87 | 37.43 | 10.32 | 30.83 9.62 | 31.41 | 10.41 | 32.60 | 10.41 
SF 36.90 | 11.82 | 35.44 | 12.56 | 35.02 | 15.33 | 37.34 | 13.36 | 37.09 | 14.35 | 34,50 | 14.40 
SI 24.53 8.48 | 23.55 7.82 | 24.83 8.21 | 23.12 | 10.48 | 22.93 8.79 | 22.20 8.13 
RL 12.99 5.02 | 13,09 5.43 | 13.10 5.65 | 11.40 4.11 | 12.20 | 4.14 | 11.26 4.65 
MF 52.04 7.93 | 54.98 9.64 | 55.23 8.76 | 44.23 9.85 | 45.45 | 8.41 | 47.31 8.70 
LA 10.12 4.06 | 11.11 4.56 | 11.65 4,38 9.71 4.45 | 10.67 | 4.33 | 10.26 431 
Y-N 9.46 3.20 9.72 3.63 9.76 3.64 | 9.81 3.12 9.53 | 3.33 9.45 3.39 
v 21.50 | 3.41 | 20.93 | 3.68 | 19.98 | 4.38 | 22.09 | 3.62 | 22.39 | 3.47 | 21.38 | 4.10 
N 24.89 5.14 | 24.53 5.70 | 24.57 6.22 | 21.03 5.50 | 22.83 | 5.84 | 22.14 5.47 
SS 24.05 | 5.03 | 23.30 | 4.86 | 23.17 | 5.45 | 21.98 | 5.32 | 23.20 | 4.87 | 23.16 | 5.42 
NS 25.20 4.34 | 25.01 4.69 | 25.69 5.37 1-21.27. 5.11 | 22.32 | 5.48 | 22.38 5.27 

wise discriminant analysis (Computer Program (cf. Ross, 1968), it is apparent that males 


BMDO”M, version of September 1, 1965, developed 
by the Health Sciences Computing Facility, Univer- 
sity of California at Los Angeles) that describes the 
rank order in which and the extent to which a num- 
ber of dependent variables (here, 12 personality and 
4 aptitude scales) contribute, individually and jointly, 
toward a separation of Ss with respect to one or 
more independent variables (here, the three groups 
of scorers on the A-B scale). 

A second analysis of group differences on the 16 
experimental scales (using the same Ss) was a prin- 
cipal axis factor analysis of each criterion group, 
separately for each sex. 

In the factor analysis described above, the number 
of Ss in each criterion group did not meet the usual 
rule-of-thumb for this type of factor analysis (10 
times the number of variables). Moreover, the groups 
used in both of the above-described analyses under- 
represent midrange scorers (ABs). For these reasons, 
smaller random samples of Ss (117 males and 117 
females) were also drawn for correlational purposes. 
Intercorrelations of the measures were then factored 
through a principal axis analysis (with A-B score 
included as the seventeenth variable), rotated to 
the “normal” criterion proposed by Kaiser (1958). 


RESULTS 


Mean scores for male and female Ss on each 
of the 12 OPI and 4 ACT scales are shown 
in Table 1. As might be expected from con- 
sideration of stereotypes in our culture and 
prior OPI studies regarding sex differences 


have consistently higher scores on MF, IE, 
TO, N, and NS, while females score higher on 
Es and V. It appears that for both sexes ABs 
and Bs have progressively higher scores on 
masculinity than As, and that for females the 
same kind of relationship holds for TO. Thus, 
despite the obviously greater variability of the 
middle group (ABs), these mean scores would 
imply a linear relationship between MF and 
A-B status for males and between TO and 
A-B status for females. 

In the stepwise discriminant analysis, the 
first discriminant function, which accounted 
for 83% of the variance, significantly sepa- 
rated the male groups (x? approximation of 
the Wilks-Lambda test = 49.85, df = 32, p 
<.025). Subsequent discriminant functions 
were not significant by this test. Table 2 shows 
the order in which and the extent to 
which each of the 16 variables discrimi- 
nated the three groups of A-B scale scorers 
for both sexes. The individually discriminat- 
ing scales for males (MF, V, NS, and 
Co) and females (TO and Co) suggest that 
differentiations on the A-B variable are ac- 
complished largely by scales which show large 
differences between sexes, and which, as al- 
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TABLE 2 


ORDER IN WHICH AND EXTENT TO WHICH F VALUES 
FOR 16 VARIABLES DISCRIMINATE As, ABs, AND Bs 


Males Females 
Order 
Variable F Variable F 

1 MF 3.99 TO 14.04** 

2 V 3, Au 2.45 

3 NS Sh Es 2.27 

4 Au 1.93 SF 1,84 

5 Co 3.10* Co 2.92* 

6 SF 2.43 V 1.63 

7 IE 1.72 N 2.09 

8 SI 1.47 LA 1.47 

9 TO .93 MF 1.48 
10 TI 1,27 SS 1.07 
11 LA -16 RL .91 
12 Es 52 TE .52 
13 N 10 Y-N 55 
14 Y-N 10 DE .21 
15 RL 10 NS -20 
16 SS .07 SI 12 


Note.—df for males = 2/293; for females = 2/267. 
* p <.05, 
** p < 001, 

ready noted, call to mind sex-role stereotypes 
and sex differences in interests and aptitudes. 
The scale which best discriminates the female 
groups, TO, can in fact be regarded as an MF 
scale; the scale description (“Interest in 
science and scientific activity, and a logical, 
rational, critical approach to problems”) 
clearly suggests masculine pursuits. More- 
over, the mean values for the only other scale 
which individually discriminates the female 
groups (Co, see Table 1), like the means for 
MF, appear to be ordered linearly, with both 
sexes scoring progressively higher on Co as 
one reads across from As to ABs to Bs. The 
description of this scale (in brief, “more lik- 
ing for and tolerance of new, novel, or diverse 
events, uncertainty and ambiguity”) suggests 
a cognitive style. Therefore, it would appear 
that the masculinity-femininity variable, in 
-its several manifestations, forms the dominant 
discriminator of groups drawn from the dis- 
tribution of A-B scale scorers, and it would 
appear secondarily that cognitive style differ- 
ences may be of some importance as they re- 

late to sex-role differences and stereotypes. 
„Since four measures in this battery indi- 
vidually discriminated the male criterion 
groups, a further breakdown of these differ- 
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ences is indicated. Table 3 shows the com- 
parisons of the three male groups on all scales, 
It is evident that with successive steps in the 
discriminant analysis, differences between As 
and Bs carry more weight than A-AB or 
AB-B differences. The significance of dif- 
ferences between As and Bs reaches its maxi- 
mum on the third step, that is, after MF, V, 
and NS have been jointly entered into the 
discriminant function; subsequent steps show 
progressively lower F values (see Note, Table 
3). Recalling that on an absolute basis, male 
As and Bs differ only slightly with respect to 
V and NS scores (Table 1), it now appears 
that in conjunction with MF differences, these 
differences are quite important in differentiat- 
ing extreme As from extreme Bs. It is con- 
sistent with Bs scoring higher on measures of 
masculinity that they also score higher on 
natural science abilities while As outscore 
them on verbal abilities, and that only slight 
differences here discriminate the two extreme 
groups. While Campbell et al. (1968) have 
speculated about the importance of an intel- 


TABLE 3 


MULTIVARIATE F MATRIX SHOWING COMPARISONS 
or A-AB, A-B, AnD B-AB DIFFERENCES FOR 
MALES ON THE 16 VARIABLES 


F value 
Variable df 

A-AB A-B B-AB 
MF 1/293 5.48 6.47 04 
Vv 2/292 3.24 6.92 1.49 
NS 3/291 3.25 7.58 2.21 
Au 4/290 3.17 6.46 1.65 
Co 5/289 2.66 6.44 1.90 
SF 6/288 2.24 6.14 2.09 
IE 7/287 1.93 5.73 2.11 
SI 8/286 2.02 5.26 1.84 
TO 9/285 1.85 4.71 1.85 
TI 10/284 1.67 4.49 1.82 
LA 11/283 1.64 4.17 1.65 
Es 12/282 1.52 3.91 1.54 
N 13/281 1.41 3.59 1.42 
Y-N 14/280 1.30 3.34 1.33 
RL 15/279 1.22 3.11 1.24 
SS 16/278 1.14 2.90 1.17 


Note.—Due to the progressively increasing degrees of free- 
dom in the numerator in a stepwise discriminant analysis, 
several of the A-AB differences are significant at the .05 level, 
and all A-B differences are significant at the .01 level, However 
it is apparent that only the values for the first three scales 
(MF, V, and NS) increase as each step of the discriminant 

lysis is carried out, and that this holds true only for the 
A-B differences. 


CORRELATES OF THE “A-B” THERAPIST SCALE 


lectual-verbal versus practical-mechanical dif- 
ference, here it is demonstrated for the first 
time on conventional measures of aptitude. 

The results of the factor analysis for cri- 
terion groups were generally disappointing, 
but did yield one significant negative finding. 
The first factor, which uniformly accounted 
for about 22% of the variance for both sexes 
and for all criterion groups, was comprised of 
scale loadings in the high-80s and low-90s on 
IE, SF, and Y-N.° The failure of the Couch- 
Keniston measure of acquiescence to dis- 
criminate the A-B scale groups is an impor- 
tant negative result in that the items of the 
A-B scale are keyed largely in the same posi- 
tive direction, that is, to answer in the “B” 
direction almost invariably requires saying 
“yes” or “like.” The absence of systematic 
trends here argues for the relative independ- 
ence of the A-B scores from the acquiescent 
response set. (The third author also has un- 
published data which reveal that A-B scores 
are largely independent of the social desir- 
ability response tendency as measured by the 
Marlowe-Crowne scale.) 

In view of the findings from the discriminant 
analysis already discussed, the results of the 
principal axis factor analysis which included 
A-B scale score as one of 17 variables were 
surprising.® In the case of the male Ss, MF 
loaded (.89) with LA (.43) and on the fourth 
factor, which accounted for very little (about 
9%) of the common variance. A-B did not 
load with any other scale measure at a level 
even approaching significance (highest loading 
was .11, on a factor showing V, NS, and SS 
loadings ranging from .70 to .84). A sixth 
factor, which accounted for about 6% of the 
variance, was comprised of a .90 loading on 
A-B scale score, but no other scale loaded 
significantly on this factor. The simple correla- 
tion between A-B and MF scale scores, for 
males, was —.11 (ms). Similarly for females, 
A-B loaded alone (.91) on sixth factor. The 
highest MF loading (.93) was again on a 
factor with only two other appreciable load- 
ings (LA, .43 and SI, .51), and the TO scale 
loaded (.78) on an essentially uninterpretable 

5 The complete varimax matrices for the criterion 
group factor analyses are available on request from 
the first author. 


®The complete factor analysis is available on re- 
quest from the first author. 
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third factor (12% of the variance) on which 
the A-B loading was only .11. 


Discussion 


Considered in conjunction with the findings 
of the discriminant analysis, the fact that the 
A-B variable is minimally correlated with the 
variables involved in the discrimination of ex- 
treme scorers would seem to argue for a quasi- 
typological interpretation of the A-B variable. 
That is, while in some absolute sense Bs may 
be more masculine, less verbal, etc., than As, 
these differences are not reliably manifested 
throughout the middle range of the distribu- 
tion of A-B scale scorers, but only at the 
extremes (the outer 7% or so). 

The conclusions to be drawn from this re- 
search should be tempered by noting that the 
Ss in the present study, unlike practicing 
therapists and Ss drawn on a voluntary basis 
to participate in psychotherapy analogue 
studies, may be more heterogeneous with re- 
spect to their intended occupations or profes- 
sions, and may represent a wider spread of 
aptitude. For this reason, the results of the 
discriminant analysis should be understood 
conservatively. Even so, the MF-V-NS triad, 
as here measured, can be regarded as im- 
portant discriminators between extreme As 
and extreme Bs, in the only study thus far 
which has attempted both a measure of ap- 
titude and a measure of masculinity-femininity 
(other than the MF scale of the SVIB itself). 
That the discriminatory power of these vari- 
ables applies largely to extreme scorers does 
not seriously detract from the conclusions 
which may be drawn from this study. In this 
connection, it should be noted that in the 
study of Campbell et al. (1968) already dis- 
cussed briefly, the A doctor was the most ex- 
treme scorer in the A direction while the B 
doctor was less extreme only than some oc- 
cupations which are easily conceptualized as 
primarily mechanical-practical (e.g., carpen- 
ter, pilot). There are thus other indications 
that a quasi-typological rather than continu- 
ous relationship of the A-B variable to other 
measures is tenable. 

The direct (experimental) investigation of 
differences between As and Bs has shown repli- 
cable results in only two areas. B-type ther- 
apists (Pollack & Kiev, 1963) and under- 
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graduates (Shows & Carson, 1965) appear to 
exceed As in what Witkin has termed “psy- 
chological differentiation” (Witkin, Dyk, Fat- 
erson, Goodenough, & Karp, 1962). That is, 
Bs were more “field-independent” on the rod- 
and-frame test than As, who in turn were sig- 
nificantly more variable in performance. Other 
recent studies suggest that As and Bs may 
react differentially to stress, with As being 
inclined to inhibit anger or turn it inward, 
while the stress reactions of Bs are less well 
understood (Berzins, Friedman, & Seidman, 
1969; Sandler, 1965; Welch & Berzins, 1968). 

The findings of the present study can be 
related, in terms of both theory and methodo- 
logical implications, to both of these areas of 
experimentation. Based on the above-described 
rod-and-frame studies and other correlates of 
“mode of spatial orientation,” Silverman 
(1967) holds that a “greater communication 
capacity” of As with schizophrenics “may be 
conceptualized in terms of perceptual, cogni- 
tive and personality variables.” He elaborates, 
however, almost exclusively in perceptual 
terms: 


The A-type ... is responsive to more stimulus at- 
tributes of the perceptual field including incidental 
social behavior cues. He is more receptive to the ef- 
fects of seemingly irrelevant stimulation, and to 
changes in the organization of the perceptual field. 
He is more capable of relaxing his orientation to re- 
ality and responding to hunches and intuition. The A 
therapist is, thus, more accepting of the “realness” of 
the schizophrenic patient’s perceived unreality, . . . 
Overall, the perceptual responses of the A-type ther- 
apist are more similar to those of the schizophrenic 
patient than are those of the B-type. The dominant 
perceptual tendency of the B therapist is to counter- 
act stimulus effects which interfere with articulated, 
reality-attuned cognitive activity [p. 12]. 


Tn view of the above formulation, an impor- 
tant finding is that the outer extremes of the 
A-B scale score distribution show differences 
with respect to Co. This is at least consistent 
with direct experimental evidence suggesting 
that Bs have a cognitive style which differs 
from that of As in that it reflects greater 
“psychological differentiation.” To whatever 
extent a rational, critical, logical approach to 
problems (TO for females) and the ability to 
withstand and persevere in the face of new. 
novel, uncertain, or ambiguous events (Co for 
both males and females) is similar to “psycho- 
logical differentiation,” there is in the present 
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study a strong trend for Bs to manifest these 
characteristics to a greater extent than As. 
However, if As overall perceptual and cogni- 
tive modes are more similar to the schizo- 
phrenic patient’s than the perceptual and cog- 
nitive modes of Bs, as Silverman suggests, 
then As should have higher scores than Bs on 
SF, which was not the case. Moreover, if As 
are “more receptive to changes in the or- 
ganization of the perceptual field,” as Silver- 
man holds, they should be more, not less, 
“cognitively complex” on a measure such as 
the Co scale of the OPI. While it is easy to 
speculate about perceptual, cognitive, and per- 
sonality differences which are consistent, re- 
sults such as those of this study discourage 
broad speculation in the absence of solid 
empirical referents. 

The masculinity-femininity findings of this 
experiment are generally consistent with the 
findings of Berzins et al. (1968), Sandler 
(1965), and Welch and Berzins (1968) that 
As are inclined to inhibit anger or turn it in- 
ward. The B, more masculine, should be 
more defensive about admitting to feelings 
of inadequacy or anxiety, and in the factor 
analysis, MF did load with LA, a scale de- 
scribed precisely in those terms. The dearth 
of correlates of the A-B scale with variables 
in the present battery which in principle 
should relate to differential stress reactions of 
college-age males (e.g., TI, SI, and especially 
SF and IE) is consistent with other recent 
findings that personality scales often do not 
consistently relate to operationally defined 
variables except at the extremes of the dis- 
tribution of scale scores. For example, the 
failure of middle groups of scorers on the 
Taylor Manifest Anxiety scale to perform 
consistently under stress or show “appropri- 
ate” physiological reactions is well known, 
and Dublin (1968) and others have recently 
shown that the Repression-Sensitization 
(R-S) scale may not represent measurement 
along a unidimensional continuum from “re- 
pression” to “sensitization.” 

Summarizing, the absence of correlational 
findings which demonstrate linear relation- 
ships between the A-B variable and the other 
measures used here and the absence of per- 
sonality scale findings which would be more 
consistent with clinical speculations based on 
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perceptual and cognitive style research re- 
quire the findings of this study to be under- 
stood conservatively. Nevertheless, masculin- 
ity-femininity, in its various manifestations, 
and cognitive style as implied by at least two 
manifestations (problem solving approach; 
tolerance for change in the perceptual-cogni- 
tive field) do help to discriminate extreme 
scorers on the A-B scale. Important in this 
“cognitive style distinction” are verbal-intel- 
lectual skills versus practical-nonverbal ones, 
for even slight differences here differentiate 
scorers at the extreme of the A-B scale. 

Our findings seem important for two differ- 
ing but related methodological reasons. First, 
they underscore the importance of middle 
groups in personality, cognitive, and percep- 
tual research in general, especially when an 
attempt is made to relate scores on a per- 
sonality scale to other measures of personality 
or cognitive or perceptual style. Second, they 
may show that in personality research, the 
“phenotype” may bear closer scrutiny than 
the “genotype.” That is, the future of the 
A-B line of investigation may lie in a con- 
tinued search for behavioral differences be- 
tween As and Bs in dyadic interactions in- 
volving, or at least modeled after, the kinds 
of interactions involved in actual psycho- 
therapy, and may not lie in further attempts 
to refine or extend the A-B scale or to achieve 
construct validation by building up a nomo- 
logical network through broad inferences from 
other closely related personality research 
areas, Such differences in dyadic behavior 
may then be understood on other, largely 
phenotypic, grounds, such as verbal and non- 
verbal communicative styles. 
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DISTINCTIVE PERSONALITY ATTRIBUTES OF 
CHILD-ABUSING MOTHERS * 


BARRY MELNICK axb JOHN R. HURLEY * 
Michigan State University 


To explore hypotheses derived from contemporary child-abuse writings, 
groups of 10 abusive (A) and 10 control (C) mothers, matched for age, 
social class, and education, were compared on 18 personality variables. As 
differed reliably from Cs by scoring higher on TAT pathogenicity and depend- 
ency frustration, but lower on TAT need to give nurturance, self-esteem 
(California Test of Personality), Manifest Rejection, and family satisfaction 
(Family Concept Inventory). These findings are inconsistent with contemporary 
descriptions of As as being chronically hostile, overwhelmed by maternal 
responsibilities, domineering participants in a power struggle, or as “normal 
personalities.” Appearing more characteristic of As are an inability to empathize 
with their children, severely frustrated dependency needs, and a probable 


history of emotional deprivation, 


The recent child-abuse literature offers many 
hypothetical explanations for this behavior, 
but few statistically defensible findings. Thus, 
the child-abuse victim’s mother has been vari- 
ously described as (a) chronically aggressive 
(Curtis, 1963; Nurse, 1964; Young, 1954), 
(b) highly frustrated by the restrictions of 
the maternal role (Blumberg, 1964; Feinstein, 
1964; Gladston, 1965), (c) an authoritarian 
locked into a power struggle with the child 
(Newston, as quoted in Blumberg, 1964), 
(d) a psychologically normal individual re- 
sponding violently to a host of social stresses 
(Elmer, 1965; Reinhart, 1967), and (e) an 
unstable, emotionally immature person (Na- 
than, 1965), who is extremely dependent and 
narcissistic (Kemp, Silverman, Steele, Droege- 
mueller, & Silver, 1962; Komisaruk, 1966). 
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of how these ascribed attributes apply to the 
abusive (A) mother is offered by this study. 


METHOD 


Ten A’s referred to the Wayne County Clinic for 
Child Study by physicians for suspected child-abuse, 
based on such medical evidence as subdural hema- 
tomas, soft tissue swellings, skin bruisings, and re- 
peated fractures, were compared with 10 control (C) 
mothers. Two criteria further defined this A group: 
(a) the mother, rather than the father, was believed 
to have inflicted the abuse; and (b) the abused child 
was under 3 years of age. Probably because this 
clinic is in a lower-class Negro section of Detroit, 
eight As were low-socioeconomic-class Negroes. Half 
of the As resisted returning to the clinic for study 
and were tested within their homes after being told 
only that additional information was needed from 
them, Potential Cs were recruited from the same 
locale with the aid of the Brewster Neighborhood 
Social Service Organization of Detroit which supplied 
a list of young mothers residing within the Brewster 
Housing Project. The first 10 Cs personally con- 
tacted agreed to participate in “a research project 
concerned with the attitudes and feelings of young 
mothers”; all were Negro. The following data de- 
scribe the degree of match achieved between the A 
and C groups on the selected variables: age in years: 
A= 23.8, C= 25.6; total weekly income: A = $92.8 
C=$93.3; years of education: A=9.9, C= 11.6; 
number of children: A= 2.5, C=2.8; age when first 
married; A= 20.3, C= 20.2; and Hollingshead-Red- 
lich (1958) social class index 8; A=48,C=48. 

Each S was administered four personality assess- 
ment measures selected because of their apparent 
relevance to certain of the child-abuse hypotheses. 


* Because many of the Ss resided in housing proj- 
ects, the “residence” element of the index was Te 
placed by annual income, divided into seven classes: 


Given first were the Self-Reliance, Self-Esteem, Sense 
of Personal Freedom, Feelings of Belonging, and 
Withdrawing Tendencies subscales from the California 
Test of Personality (CTP; Thorpe, Clark, & Tiegs, 
1953). The CTP has been validated with various 
global indexes of personality adjustment (Peak, 1963; 
Tindall, 1955). Next was the Family Concept In- 
ventory (FCI), a measure of general satisfaction with 
one’s family which has differentiated successfully be- 
tween families predetermined to be high and low in 
general adjustment (van der Veen, Huebner, Jorgens, 
& Neja, 1964) and which also correlates substantially 
(van der Veen et al., 1964; Palonen, 1966) with the 
Locke-Wallace Marital Adjustment scale. Then ad- 
ministered was an index of the general harshness of 
parental disciplinary policies, the Manifest Rejection 
(MR) scale (Hurley, 1965), shown to correlate im- 
portantly with parents’ direct, oral acknowledgments 
that they applied overtly punitive acts to their child. 

Last given was a set of 12 TAT cards, the re- 
sulting stories being analyzed by two scoring tech- 
niques (Melnick, 1968). One approach, called the 
Pathogenic Index (PI), is scored for content reflect- 
ing either an inability to empathize with others or 
which shows a lack of “basic trust” in the environ- 
ment, The PI has differentiated as predicted between 
the mothers of schizophrenics and of nonschizo- 
Phrenics in several recent studies (Meyer, 1964; 
| Meyer & Karon, 1967; Mitchell, 1965), In the other 
method TAT stories were scored for needs Affiliation, 
Aggression Dependence, Dominance, Independence, 
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and Nurturance. Need frustration scores were also 
determined for needs Affiliation, Dependence, Domi- 
nance, and Independence; needs Aggression and 
Nurturance had been defined in ways which made 
similar scores inappropriate. 


RESULTS 


All TAT stories were scored independently 
by the senior author and another rater. The 
interscorer reliabilities, using the product- 
moment correlation, exceeded .80 for all need 
and frustration scores; for PI this r = .94. 

The ¢ test for small samples (Hays, 1963) 
was used to assess the significance of all differ- 
ences between the A and C groups. The small 
size of the samples and the crudeness of the 
personality measures were viewed as sufficient 
justification for using the .10 level of sig- 
nificance in interpreting all results. Table 1 
shows that 6 of the 18 measures reliably dif- 
ferentiated between the A and C mothers, 
The As revealed lower CTP Self-Esteem (p 
< 05), less family satisfaction (p < .10), a 
higher PI (p < .002), less need Nurturance 
(p < .05), higher frustration of need De- 


TABLE 1 
PERSONALITY MEASURE COMPARISONS OF THE ABUSIVE AND CONTROL MOTHERS 


Personality measure 


SSN ae 
California Test of Personality 


Self-Reliance 9.2 
Self-Esteem 8.7 
Sense of Personal Freedom 9.5 
Sense of Belonging 10.5 
Withdrawing Tendencies 8.8 


Family Concept Inventory (FCI) 91.4 
Manifest Rejection (MR) 54.0 
TAT Ratings 
Pathogenic Index (PI) 70.7 
need Dependence 27 
need Independence .24 
need Affiliation 
need Aggression 
need Dominance 


“n Dependence frustration 
n Independence frustration 
n Affiliation frustration 
n Dominance frustration 


* p <.10, two-tailed. 
„p< 05, two-tailed 
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TABLE 2 


Propuct-MoMENT CORRELATIONS AMONG PERSON- 
ALITY ÅTTRIBUTES WHICH DIFFERENTIATED 
BETWEEN ABUSIVE AND CONTROL 


MOTHERS 
Depend- 
Personalit Self- | ,7eec_| _ency 
versonality | MR | PI | Esteem | Nutr | frustra- 
tion 
FCI L2 —.58%  .78**# 1.250 —.39. 
MR —.24 109 6 —.40* 
PI —.43* 62 25 
Self-Esteem 15 —.32 


n 
Nurturance 


* p < .10, two-tailed. 
** p < 05, two-tailed. 


pendence (p < .05), and a less openly reject- 
ant (MR) stance toward children (p < .05). 
Table 2 gives the product-moment correla- 
tions among these six measures which differ- 
entiated between the A and C mothers. Family 
satisfaction (FCI) relates importantly to self- 
esteem and inversely to pathogenicity (PI). 
Self-esteem and an inability to empathize with 
and accept others (PI) correlated negatively, 
while this latter variable (PI) also relates in- 
versely with the need to give nurturance. 
The intercorrelations among these six mea- 
sures were subjected to an elementary factor 
analysis (McQuitty, 1961) which disclosed 
that the following variable pairs were more 
highly co-linked than associated with any 
other index: FCI and CTP Self-Esteem, PI 
and need Nurturance (need to give nurtur- 
ance), and MR with need Dependence frustra- 
tion. In terms of their percentage contribu- 
tions to the total covariance of this six-vari- 
able matrix, these measures ranked: FCI 
(26%), PI (22%), CTP Self-Esteem (21%), 
need Nurturance (13%), need Dependence 
frustration (12%), and Manifest Rejection 


(7%). 
Discussion 


Plainly limiting the generalizations which 
can be drawn from the present findings are 
the smallness of the samples and their largely 
lower-class Negro composition. Further, it 
would have been advantageous to have pe 
praised the degree of disturbance of all the 
children, or to have had a C group mothers 
of disturbed children, because the A mothers 
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might have been confronted with dealing with 
a more disturbed child than those faced by the 
C group. : 

Within these restrictions, the As high PI 
and low need (to be) Nurturant suggest that 
these women may have a deficient capacity 
for empathizing with and administering to 
their children’s needs. These scores also imply 
that the As had experienced considerable 
prior frustration of their own emotional needs. 
Fey’s (1955) findings that self-acceptance 
varies with both the feeling of being ac- 
cepted by others and the actual acceptance 
by others seems relevant. The As do not per- 
ceive their families as sufficient in meeting 
their emotional needs. The FCI mean (91.4) 
of the As was sharply below that of the Cs 
(130.5). Yet, the Cs mean falls notably be- 
low the mean (154.9) of two samples 
(Palonen, 1966; Updyke, 1968), totaling 99 
Ss from a university community of young, 
middle-class, white mothers. The relationships 
between the FCI index and the two psy- 
chological adjustment indicators (PI and CTP 
Self-Esteem) suggest that the As psychopath- 
ology is associated with and perhaps con- 
tributes to their very poor family adjustment. } 

The As frustrated need Dependence re- 
flects both marked dependency and feelings 
of being unable to cope with the responsi- 
bilities of life accompanied by a perceived 
lack of environmental emotional supports. 
This view fits Olive’s (1966) data which sug- 
gested that child killings often followed shortly 
after the mother had been abandoned by or 
cut off from an important supportive person, 
usually a husband or lover. 

It can be speculated that the As may experi- 
ence a deep sense of anxiety from their feel- 
ings of being unable to cope with their re- 
sponsibilities in the face of inadequate sup- 
ports which predisposes them to the arousal 
of anger (Sullivan, 1953, p. 212) whenever 
their children are difficult to manage. These 
frustrated dependency needs may lead to 4 
semiobsessive preoccupation with their own 
needs which limits their ability to identify with 
the child. Also, this narcissism may hinder the 
development of a strong attachment to the 
child which might have helped to control 
aggressive impulses. 


ATTRIBUTES OF CHILD-ABUSING MOTHERS 


That the As would score below the Cs on 
Manifest Rejection (A = 54.0, C = 66.0) was 
not anticipated. Defensiveness may be the 
explanation, for the As could well have feared 
that psychological test findings might con- 
tribute to the initiation of criminal charges. 
An alternative interpretation is that the As 
actually had more rigid defenses against the 
expression of hostility than did the Cs. This 
view is consistent with Megargee’s (1967) 
report that homicidal killers include both 
overtly aggressive and overcontrolled types 
of individuals. 
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SPI DISCRIMINATORS AMONG FOUR PSYCHOLOGICAL 
HEALTH-SICKNESS LEVELS * 


DON SPIEGEL 


Brentwood Hospital, Veterans Administration Center, Los Angeles 


Criterion groups representing four psychological health-sickness levels were ' 
formed for each sex from among 87 hospitalized psychiatric patients and 145 
normal adults. Health-sickness levels were established for patients on the 
basis of clinical judgments, and for normals, on the basis of an MMPI severity 
of pathology index. Magnitudes of mean scores for 14 scales of the Spiegel 


Personality Inventory (SPI) were in rank order of pathology level for males 
and females. Both univariate and multivariate methods were used to select 
10 scales which appear to be the most promising for use in estimating health- 
sickness levels in male and female adults. These were deviant response, 
deterioration feeling, self-dissatisfaction, intrusive thought, alienation, dread, 
future planlessness, olfactory minimizing, auditory minimizing, and tactual 


minimizing, 


Although a revised system of psychiatric 
nomenclature has recently been developed in 
an effort to improve international communica- 
tion among mental health professionals and 
in order to meet changing administrative 
needs and therapeutic goals (Spitzer & Wil- 
son, 1968), considerable work has also been 
done to provide a basis for describing in- 
dividual behavior in terms of a single “health- 
sickness” continuum (Luborsky, 1962). Pro- 
ponents of the latter approach, which tends to 
cut across varieties of pathology to focus on 
levels of functioning or dysfunctioning, con- 
tend that individuals may be meaningfully de- 
scribed and ranked in terms of their overall 
ability to cope with life situations under 
ordinary conditions of stress, or by the extent 
to which adequate psychological functioning is 
impaired by disease, organic pathology, and/or 
by maladaptive defenses or behavior patterns. 


1 Contributions of others to this study were as 
follows: W. J. Dixon (statistical consultation) ; 
Arlene Hartry (assistance in project development) ; 
Normal Sadwick, Ursula Henderson, Charles Acker, 
Leonard Rosalsky, Rodolfo Vocal, Carl Hanssen 
(clinical ratings); Howard Kroll and Joel Abrahams 
(test scoring); Marie Brady (manuscript typing) ; 
and Morris Paulson (assistance in various phases) . 
Data was tabulated by the Statistical Unit, Psy- 
chology Service, Brentwood Hospital. Data was 
transferred to punched cards by Veterans Ad- 
ministration Western Research Support Center. 

* Requests for reprints should be sent to Don 
panne pe atweed Hospital, Veterans Adminis- 

ter, Wilshire and Sawtelle Boulevar: 
Angeles, California 90073 . k 
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In recent years, attempts have been made 
to develop reliable test indexes of “severity of 
psychopathology,” in particular with the 
MMPI, for example, Tamkin (1959), Sines 
and Silver (1963), Blumberg (1967), and | 
Dunlop and Waltmann (1966). Also, con- 
siderable success has been reported in relating 
ratings based on Rorschach responses to clin- 
ical judgments of health-sickness level (May- 
man, 1967). | 

The present study combined univariate and 
multivariate approaches to the problem of 
selecting sets of test variables from the Spiegel 
Personality Inventory (SPI; Spiegel, 1965) 
which would maximize the likelihood of ac- 
curately classifying members of criterion 
groups representing four levels of psycholog- 
ical health-sickness and would minimize the 
likelihood of misclassification. 


METHOD 
Subjects 


All patients on a male ward and a female ward of 
a Veterans Administration neuropsychiatric hospital 
were rated for severity of psychopathology on & 
21-point scale on which health-sickness levels wet? 
explicitly defined. Each patient was rated by his | 


2 The rating scale was subdivided as follows: 1-3 
points (ideal normal), 4-9 (usual range for non- 
hospitalized “normals”), 10-18 (usual range for test- 
able psychiatric patients), and 19-21 (usually too 
Severely ill to be tested). Each range of 3 points 
was further defined. Raters were instructed that 
most ratings should fall in the range from 10 to 18 
Points which is a range of moderate to markedly 
Severe psychopathology characterized by increasing 
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ward psychologist and psychiatrist, A battery of 
tests was administered to the patients in small 
supervised sessions. The criteria for including a pa- 
tient in the study were (a) the two pathology 
ratings had to differ by less than six points, (b) 
all tests had to be completed and show evidence 
that S understood and cooperated with instructions, 
and (c) MMPI validity indexes had to show no 
evidence that an attempt was made to present an 
overly-favorable self-image on the test. The ratings 
of pathology made by psychologist-psychiatrist pairs 
were added. For each sex, two pathology levels (i.e. 
a “sicker” level (D) and a “healthier” level (C)) 
were formed on the basis of combined ratings. 
Twenty-one male patients and 23 female patients 
who received combined ratings equal to or less 
than the median combined rating of 26 were at the 
healthier level. Twenty-one male patients and 22 
female patients whose combined ratings were above 
26 were at the sicker level. 

Normal Ss were drawn from several sources in 
an attempt to match the patient groups as closely 
as possible on variables including age, socioeconomic 
level, education, and general occupational level. 
Most Ss were between 30 and 50 years of age, 
Caucasian, and from the lower to lower-middle 
socioeconomic level. Normals were similarly tested 
and subdivided into a level without apparent 
pathology (A) and a level with apparent “mild” 
pathology (B) on the basis of the Multiphasic Index 
(MI scale) of the MMPI. This scale, based on a 
weighted configuration of other scale scores, was 
reported by Dunlop and Waltmann (1966) to be a 
useful index of severity of psychopathology. The 
mean MTI score for the 145 male and female normals 
(A plus B groups) was 87.45. Level A was rep- 
resented by 44 males and 25 females with MJ 
scores of 87 and below, and Level B was rep- 
resented by 29 males and 47 females with scores 
above 87. 

Since the normals and patients were necessarily 
classified on the basis of different criteria of health- 
sickness, a further check was made on the validity 
of the assumption that group order ABCD formed 
a rank order of health-sickness levels. The mean 
scores and standard deviations of 20 MMPI clinical 
and research scales (MJ excluded) for each group 
were given to six clinical psychologists to rank 
independently for severity of psychopathology. 
There was unanimous agreement that ABCD was 
the proper ordering of groups on a health-sickness 
dimension. 


Tests and Scales 


The test battery included the complete MMPI 
and SPI, in addition to nine other tests (not to be 
considered here). Description of special MMPI 
scales may be found in Dahlstrom and Welsh (1960). 
Development of the SPI has been described else- 


difficulty in satisfactorily coping with problems and 
functioning adequately under stress, Raters were 
also asked to indicate their level of confidence in 
rating each patient (high, moderate, or low). 
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where (Elashoff & Spiegel, 1969; Spiegel & Hartry, 
1968; Spiegel & Keith-Spiegel, 1969), and a manual 
is available for research use (Spiegel, 1965). For 
the present study the complete 200-item inventory 
was administered and 41 scales scored and used in 
the analyses. Thirty-five of these scales are either 
15- or 18-point Guttman scales with no item overlap 
(Guttman, 1950). Of particular interest in this 
research was a special deviant response scale com- 
posed of 39 items from these scales which normals 
and psychiatric patients had answered differently on 
a previous study (with chi-square values for each 
item significant at p <.01), a two-item deterioration 
feeling scale (which calls for admission or denial 
of regression in one’s level of functioning), and a 
set of sensory and environmental “maximizer-mini- 
mizer” scales (which measure preference for rela- 
tively high, moderate, or low levels of stimulation), 
Scale names reflect the high end of the scoring 
range which, in most instances, is the pathological 
end of a psychological dimension, 


RESULTS 


Table 1 gives the means and F values 
derived from univariate analyses of variance 
of MMPI scales on 21 variables for each sex, 
As may be seen, the mean MMPI profiles of 
Level A men and women were similar and 
appear quite healthy, Level B men and women 
also had similar profiles, but with an elevation 
above Level A profiles on most scales which 
suggests the presence of mild emotional dis- 
turbance. Males tended to be somewhat 
higher on Hy, Pd, and Ma and females 
slightly higher on Ay and Dy. MMPI mean 
scores showed considerably greater separation 
between B and C levels of health-sickness for 
men than for women, which probably ac- 
counts for the much larger F values for males 
than for females. On most scales both C and 
D level females showed less pathology than 
either C or D level males (e.g., note mean 
scores on Hs, D, Hy, Pd, Pa, Pt, Sc, Ma, 
A, R, Es, Ay, and MI). The D group means 
(both male and female) showed more psy- 
chotic and sociopathic symptomatology than 
the C group means which, in turn, revealed 
higher neurotic pathology than the D group 
means. Thus, although ABCD was apparently 
the appropriate rank order of mean pathology 
for each sex, male patients with a given rating 
of pathology tended to have higher MMPI 
scores than females with the same rating. In 
view of this, it appeared advisable that sepa- 
rate discriminant analyses should be done on 
the data from males and females. 
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TABLE 1 
MMPI Means AND F VALUES vor MALE AND FEMALE ADULTS or Four HEALTH-SICKNESS LEVELS 
Level—Males Level—Females 
MMPI F F 
GA 2 B c D A B Ç D 
7 52.3 69,2 62.9 | 16.36% | 49.6 51.4 57.0 56.2 3.30"* 
A 10 56.0 79.9 78.0 | 38.89%* | 49.8 56.0 70.8 71.8 | 22.43%* 
Hy 55.9 57.0 70.5 62.0 9.55** | 54.4 52.2 59.4 57.9 3.32* 
Pd 54.5 61.8 71.3 TAT 1 LSAS 56.6 54.0 64.0 70.4 | 10.76** 
Mf 56.8 55,2 66.1 63.3 8.02** | 48.4 49.7 47.1 48.6 37 
Pa 51.0 55.4 67.9 78.1 | 30.98** 52.7 56.4 62.0 67.2 8.15% 
Pt 48.8 56.5 76.0 78.1 | 41.68** | 50.2 55.0 65.0 71.0 | 22.71 
Se 48,4 56.5 80.8 92.3 | 48.54** | 51.0 55.4 68.3 78.0 | 23.32** 
Ma 56.2 61.1 61.6 65.8 3.73 50.7 53.7 58.0 58.0 2.40 
Si 45.8 49.7 58.6 64.1 | 25.77** 49.4 59.4 61.9 64.8 | 15.86** 
L 3.6 3.6 4.2 4.5 1.13 4.6 3.4 4.4 4.4 3.73% 
R 2.6 5.8 10.4 17.8 | 30.63** 2.2 5.4 8.8 11.8 | 14.30** 
K 16.4 14.1 14.0 12.3 5,39%* 17.4 12.7 12.3 12.5 8.57" 
Al 47.1 58.2 89.4 98.9 | 37.24** | 47.3 60.6 81.7 90.5 | 28.90** 
A 41.7 49.9 60.2 64.9 | 42.00** | 40.2 50.9 56.8 59.3 | 20,39** 
R 48.5 45.7 54.9 53.4 5.37%* 47.2 48.6 51.6 51.2 82 
Es 62.6 54.4 40.9 34.1 | 66.31** 59.5 52.5 44.6 41.9 13.90** 
F/NF 11 3.3 8.2 13.5 | 39,98** 1.1 3.7 6.3 8.5 | 14.10** 
Ar 9,3 14.7 24.9 24.9 | 40.50** 10.9 18.6 23.5 24.1 17:10 
Dy 12.9 19.0 28.8 33.4 | 44.15** 16.2 27.5 31.3 33.3 | 20.91%% 
MI 80.0 92.8 112.0 115.8 | 57.37** 80.8 94.1 105.9 111.0 | 24.75** 
OF Sn 


Means and standard deviations were com- 
puted for all SPI scales and separate analyses 
of variance were done for each scale to test 
the equality of means for Groups A, B, C, and 
D for each sex. These were examined to find 
scales with mean scores in ABCD order of 
magnitude for both sexes, The 14 such scales 
found are among the 18 scales shown in 
Table 2. Of the variables listed, 15 uni- 
variate F values were significant at p < .01. 

Multiple discriminant analyses were then 
done for sexes separately for the four groups 
(A, B, C, and D) using the 18 variables 
shown in Table 2. The analysis (Dixon, 1967) 
proceeds in a stepwise manner by forming 
linear sums of first one, then two, then three, 
etc., variables. The variable added is always 
the one which gives the greatest improvement 
in the classification of cases and the greatest 
decrease in the ratio of within to total gen- 
eralized variance. 

For the discriminant analyses, any vari- 
able whose F value to enter a set of discrimi- 
nants was below 2.20 was excluded from entry. 
For males, three variables were included in 


the discriminating set before the F values be- 
came insufficient for further computation. The 
single best variable was deviant response 
which, by itself, misclassified only 8% of the 
normal males as patients and less than 12% 
of the patients as normals. The alienation 
scale improved the accuracy of classifying 
members of Subgroups C and D, and in- 
trusive thought increased the accuracy of 
classifying both A and D groups. The F values 
for all between-groups comparisons were sig- 
nificant at p < .01. The percentages of cases 
classified into Groups ABCD by these dis- 
criminants are shown in Tables 3. This set of 
three variables correctly classified 77% of A 
group males, 39% of B group, 48% of C 
group, and 62% of D group. 

For females, five variables were included in 
the discriminating set before the F values 
became insufficient for further computation. 
Again, the best single variable was deviant 
tesponse which did not do as well in classifying 
females as it did males. In fact, it misclassi- 
fied 25% of the normal females as patients 
and 35% of the patients as normals. The 
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TABLE 2 
SPI Means AND F VALUES FOR MALE AND FEMALE ADULTS OF Four HEALTH-SICKNESS LEVELS 
Level—Males Level—Females 
SPI scale F F 
A B c D A B le, D 

Deviant response 66.7 75.3 | 104.7 | 106.1 | 53.80** 76.1 86.2 | 104.0 | 105.4 | 21.91** 
Deterioration feeling 29 3.2 6.0 5.9 | 30.99** 3.4 3.9 5.5 6.3 | 13.80** 
Self-dissatisfaction 9.5 12.0 16.2 15.9 | 23.27** 10.6 12.8 16.4 16.6 | 11.65** 
Social dissatisfaction 10.6 12.1 15.1 16.4 | 16.57** 10.4 13.7 16.7 17.2 | 19,60** 
Social insecurity 8.0 | 10.9 12.6 13.1 | 14.71** 8.4 11.2 13.1 13.5 | 10.34** 
Alienation T. 8.6 10.1 13.4 | 19.63** 7.8 9.8 10.8 11.5 7.09** 
Fearfulness 9.3 9.7 11.6 12.1 5.34** 12.4 13.0 13.5 14,7 1.92 
Dread 8.0 9.5 12.4 13.8 | 18,55** 8.0 9.8 11.0 13.1 8.61** 
Hopelessness 7.4 6.9 11.0 LATA 23.93" 6.1 7A 9.9 11.0 | 16.32** 
Anger-depression 10.3 12,4 15.5 16,0 | 12,03** | 12.8 13.1 15.0 15.5 2,72" 
Sex disturbance 23.4 | 27.0 31.3 | 31.6 | 12.13** | 20.4 | 25.1 27,7 | 28.1 | 9.31** 
Impulsive action 10.1 | 11.9 13.0 | 13.8 | 6.32 9.3 | 11.9 12.7 | 12.5) 4.32" 
Intrusive thought 9.9 | 13.1 16.9 | 17.5 | 30.23** | 10.9 | 13.6 17.2 | 18.4 | 14.95** 
Obsessiveness 10,4 | 117 14.7 | 14.9 | 10.14 | 10.6 | 12.3 14.3 | 15.2] 7.04" 
Future planlessness 10.5 | 10.8 12.4 14.2 | 5.25** 9.7 11.0 11.5 13.6 4,98** 
Olfactory minimizing 8.2 10.1 11:7 12.5 | 9.36** 9.9 10.5 10.7 13.8 5.70** 
Auditory minimizing 8.2 10.1 13.8 14.5 | 26.89** 9.6 11.4 12.1 15.3 | 11,10** 
Tactual minimizing 7.6 | 10.0 11.3 | 13.6 | 18.76** | 10.0 | 10.1 12.3 | 14.2 | 9,94** 


Note.—F values are based on univariate analyses of variance. 
*b <05. 
wD < 01, 


addition of social dissatisfaction improved 
the classification of normals into Subgroups 
A and B, and the addition of auditory 
minimizing improved the classification of 
C and D patients. Tactual minimizing and 
deterioration feeling improved the classi- 
fication of A and B normals and B and C pa- 
tients. The F values for all between-groups 
comparisons for the five-variable set were sig- 
nificant at p < .01. Table 3 shows the per- 
centage of cases classified into Groups ABCD. 
This set correctly classified 64% of Group A 
normals, 45% of B normals, 57% of C pa- 
tients, and 55% of D patients. It misclassified 
about 21% of the normals (A plus B) as 
patients (C plus D), and almost 22% of the 
patients as normals. 

The first set of discriminants was then 
remoyed from the pool of potential entering 
variables, and for each sex, a second dis- 
criminant analysis was done to determine 
whether a second linear combination of vari- 
ables could be found which might do as well 
or better than the first in classifying cases. 
For males, four variables were entered into 
the discriminating set in the order: deteriora- 


tion feeling, tactual minimizing, self-dis- 
satisfaction, and auditory minimizing. This 
set of variables correctly classified cases for 
male Ss as follows: A: 73%; B: 52%; C: 
43%; and D: 53%. Misclassified as patients 
were 2% of A and 20% of B group normal 
males; misclassified as normals were 24% 
of C and 14% of D group male patients. 
For females, five variables were entered in 


TABLE 3 


PERCENTAGE OF Cases CLASSIFIED INTO GROUPS 
BY DISCRIMINATING VARIABLES 


Assigned group 
Actual group 
A B Cc D 
Males 
A 77 21 0 2 
B 38 39 17 6 
Cc 5 14 48 33 
D 0 5 33 62 
Females 
A 64 28 4 4 
B 27 45 19 9 
cC 4 17 57 22 
D 4 18 23 55 
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TABLE 4 
CORRELATIONS BETWEEN SPI AND MMPI Severity or PATHOLOGY SCALES 
MMPI Scale 
SPI scale ; 
F MI Ar Es Sc D-0 
Deviant response 61 75 75 =75 69 80 
Deterioration feeling 42 60 62 —60 55 66 
Alienation 51 54 45 —50 52 56 
Dread 51 60 62 — 63 55 56 
Future planlessness 30 26 25 —40 29 26 
Intrusive thought 48 65 74 —70 57 67 
Tactual minimizing 40 48 44 —53 46 46 
Auditory minimizing 53 59 57 —61 55 58 
Olfactory minimizing 35 31 32 —39 34 31 
Self-cissatisfaction 38 64 65 —49 50 70 


Note.—A correlation of .17 is significant at p <.01. 


the following order: hopelessness, intrusive 
thought, self-dissatisfaction, and sex disturb- 
ance. This set of variables correctly classified 
cases for female Ss as follows: A: 80%; B: 
45%; C: 39%; and D: 50%. Misclassified 
as patients (either C or D) were 4% of A 
and 21% of B group normal females; mis- 
classified as normals were 39% of C and 14% 
of D group female patients. 

Thus, from the two discriminant analyses of 
four groups for each sex, it was found that 
six variables appeared for both sexes in either 
the first or second set of discriminants, namely, 
deviant response, deterioration feeling, self- 
dissatisfaction, intrusive thought, auditory 
minimizing, and tactual minimizing. The 
classification of males into health-sickness 
levels was aided by alienation, and the classifi- 
cation of females by social dissatisfaction, 
hopelessness, and sex disturbance. 

Pearson product-moment coefficients of 
correlation were computed between each of 
the 18 SPI variables shown in Table 1 and 
clinical ratings of health-sickness level for 
the total group of 87 male and female pa- 
tients. The following significant correlations 
were obtained: alienation (.29), dread (.30), 
hopelessness (.27), intrusive thought (.29), 
future planlessness (.28), auditory minimizing 
(.32), olfactory minimizing (.24), and tactual 
minimizing (.25). The first four correlations 
were significant at p < .01; the last two at 
p< 05. 

The interrelationships between some of the 


more promising SPI indexes of health-sickness 
level and some of the more established MMPI 
counterparts may be seen in Table 4. These 
Pearson correlation values are based on the 
entire group of 232 patients and normals of 
both sexes. As may be seen, the deviant re- 
sponse scale has extremely high correlations 
with all the MMPI indexes, with those of 
deterioration feeling, dread, intrusive thought, 
and auditory minimizing being close behind. 
The highest correlation values for deviant 
response, deterioration feeling, and self-dis- 
satisfaction are with D-Obvious. Future plan- 
lessness, tactual minimizing, auditory minimiz- 
ing, and olfactory minimizing have their high- 
est relationships (negative) with Ego-Strength 
(Es). Intrusive thought is most strongly re- 
lated to indexes of anxiety (A, Ar). The 
scales with weakest relationships to MMPI 
scales are future planlessness and olfactory 
minimizing (both related more to Es than to 
the other scales). In a factor analysis of 65 
variables including many MMPI and SPI 
scales, and involving the same 232 cases, the 
variables in Table 4 all had substantial load- 
ings on a severity of psychopathology factor, 
with especially high loadings for A, deviant 
response, MI, Es, intrusive thought, dread, F, 
deterioration feeling, and self-dissatisfaction, 
in descending order of magnitude. 


Discussion 


The finding that mean scores on 14 SPI 
scales were in rank order of pathology level 
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as defined by criterion groups of male and 
female adults is of particular importance for 
further research with the SPI in this area. 
Examination of Table 1 indicates that good 
potential discriminators also existed for the 
MMPI, namely, Pd, Pa, Pt, Sc, Si, F, AI, A, 
Es, F/NF, Dy, and MI. Indeed, a number of 
these have been reported elsewhere to be good 
indexes of severity of pathology. 

The 18 SPI scales in Table 2 show good 
potential for identifying the health-sickness 
level of respondents. Perhaps the most promis- 
ing scales are those which had means in ABCD 
order of magnitude, which had F values sig- 
nificant at p < .01 in the univariate analyses 
of variance, and for which scores for patients 
correlated significantly with criterion ratings 
of pathology. These, for both sexes, were 
alienation, dread, intrusive thought, future 
planlessness, olfactory minimizing, auditory 
minimizing, and tactual minimizing. Although 
deviant response, deterioration feeling, and 
self-dissatisfaction were selected in the dis- 
criminant analyses, primarily on the basis of 
their separation of A from B and B from 
C groups, they did not appear to be par- 
ticiularly useful in separating healthier from 
sicker patients. 

A novel aspect of the results was the finding 
that sensory scales of the SPI appear to be 
among the promising discriminators of health- 
sickness level. These scales all involve either 
a preference for or aversion to relatively in- 
tense, varied, or abrupt stimulation in a par- 
ticular modality. Persons earning high scores 
acknowledge strong aversive reactions to stim- 
ulus inputs which low scorers find pleasant or, 
at least, not annoying. In another study, sig- 
nificant correlations were found between SPI 
sensory minimizing scores and anxiety scores 
on the Taylor scale (Ar) for both patients 
and normals. It was also found in previous 
work, that sensory minimizers selected lower 
levels of stimulus intensity than did sensory 
maximizers in free choice laboratory situa- 
tions, Nevertheless, the significance of sensory 
minimizing tendencies for psychopathology is 
unclear at this time. 

It has been frequently reported that 
“sicker” individuals tend to endorse more 
socially undesirable responses or admit more 
pathology than healthier individuals. Whether, 


in a given instance, this is associated with ; 
lack of effective defenses, a lack of concer 
for what others think about one, a desire ti 
look ill, or the presence of truly greater path 
ology, is not always clear. On the basis o 
the author’s analysis of responses to thi 
MMPI and SPI, it would appear that the 
average person at each successively highe 
pathology level tends to complain more thar 
those at lower levels about something being 
wrong, either with himself, or with thing: 
going on in the world around him. 

Although many SPI items have neither 
socially desirable nor undesirable response pos- 
sibilities, it is not these items which appear 
to be the best discriminators among criterion 
pathology levels. Even the sensory scales, in 
which the items do not provide an opportunity 
to admit or deny pathology per se, do permit 
S to admit annoyance or aversion to certain 
kinds of stimulus inputs. It would seem that 
given the opportunity to admit annoyance, the 
“sicker” individuals will do so. This view was 
supported by other results obtained from Ss 
used in this study who were asked to indicate 
the presence of personal problems from com- 
prehensive checklists. The mean number of 
problems checked were in ABCD order of 
magnitude for both sexes, and the F values 
for the analyses of variance of four groups 
were significant at p < .01. Thus, it would 
appear likely that severity of psychopathology 
or health-sickness level is positively associated 
with general annoyance level which may be 
manifested in any suitable vehicle which offers 
an opportunity to complain. 

In terms of the SPI scales of Table 2, 
which were grouped according to spheres of 
content similarity, a characterization of the 
average patient at Brentwood Hospital, male 
or female, who is regarded by mental health 
professionals to be among the most severely ill 
is provided. He is dissatisfied with himself and 
with social relationships. He feels he is held 
in disrespect and low regard by others, and 
he feels misunderstood and isolated. He has a 
foreboding anticipation of imminent catas- 
trophe, fear that participation in potentially 
dangerous life events will bring him harm, or 
a fear of death or attack. He views life as a 
rather hopeless struggle, with a future which 
has nothing to offer. He sees himself as easily 
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upset and in a bad mood much of the time. 
He feels ill at ease around others and often 
guilty without knowing why. He tends to 
fly off the handle easily and to be unable to 
delay gratification. He has a disturbing in- 
ability to maintain control of thoughts, to 
concentrate and attend. He denies concern 
with the future and seems unable to plan 
ahead. He is annoyed by intense or highly 
varied stimulation in one or more sense 
modalities. 

It is, of course, not to be expected that all 
individuals at a given pathology level will 
have similar elevations on the SPI scales. 
Rather, as with the MMPI, configurations of 
these and other SPI scale scores tend to 
elucidate the predominant characteristics of 
the individual. It is probable that a patient 
with a generally elevated profile or high 
elevations on a number of the 18 scales of 
Table 2, would be regarded as relatively ill, 
maladjusted, disturbed, or sick by the hos- 
pital staff, using criteria other than tests 
on which to base their judgments. 
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PROSPECTIVE SPAN AS A COGNITIVE ABILITY 


LOUIS S. DICKSTEIN + 
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Prospective span (PS) in TAT stories may be regarded as a measure of the 
cognitive capacity of anticipation. As such, it should be related to cognitive 
tasks involving the specific capacity of anticipation but not to more gen- 
eral measures of verbal intelligence. In a sample of 51 male college students, 
PS was found to be related to two cognitive tasks requiring anticipation, the 
Mazes Test and the Seeing Deficiencies Test, but unrelated to the Vocabulary 
and Information subtests of the WAIS. Contrary to previous studies, PS 
was found to be negatively related to academic grades. Large differences were 
obtained between PS scores for the different TAT cards. 


Within the past 15 years, a number of 
studies of future time perspective have been 
conducted which have been variously referred 
to as studies of foresight, anticipation, plan- 
ning ability, or prospective span (PS). One 
of the most promising methods for the mea- 
surement of this variable has involved the 
scoring of TAT stories for PS (Epley & Ricks, 
1963; Teahan, 1958). 

Epley and Ricks (1963) reported a correla- 
tion of only .08 between PS and the Scholastic 
Aptitude Test. Similarly, in a subsequent 
study (Ricks, Umbarger, & Mack, 1964), a 
nonsignificant correlation was obtained be- 
tween PS and intelligence. On the other hand, 
Epley and Ricks did report a significant 
relationship between PS and Cattell’s intel- 
ligence scale measuring “the capacity to 
educe relationships,” while Dickstein and 
Blatt (1967) found that high and low 
scorers on the Picture Arrangement sub- 
test of the Wechsler Adult Intelligence Scale 
(WAIS) differed significantly on PS. A pos- 
sible resolution of these discrepancies may lie 
in viewing PS as a specific cognitive ability 
which is significantly correlated with other 
measures of anticipation, such as the Picture 
Arrangement subtest, but which is not neces- 
sarily related to more general measures of 
verbal intelligence. The present study was 
conducted to determine the relationships 
between PS and measures of general verbal 
intelligence and measures of the more specific 
cognitive ability of anticipation. 

1 Requests for reprints should be sent to Louis S. 
Dickstein, who is now at the Department of Psy- 


chology, Wellesley College, Wellesley, Massachusetts 
02181, 


Prospective span has been found to be 
related to academic achievement. In a study 
by Teahan (1958), high and low achievers 
in school differed significantly in PS, while 
Epley and Ricks reported a correlation of .43 
between PS and 4-year college grade averages. 
A secondary aim of the present study was to 
replicate the reported relationship between 
PS and academic achievement. 


METHOD 


The Ss were 51 male college students at the City 
College of the City University of New York. All 
Ss were volunteers from introductory psychology 
sections. There were 15 freshmen, 35 sophomores, 
and 1 junior. Almost all Ss were 18 or 19 years of 
age. 

All Ss were given six cards of the TAT and were 
required to write stories. The cards used were 1, 2, 
8BM, 6BM, 5, and 12M in that order. The stories 
were scored for PS according to the scoring sys- 
tem developed by Epley and Ricks (1963). In ad- 
dition, Ss were given four tests. These were the 
Vocabulary and Information subtests of the WAIS 
(Wechsler, 1955) and the Maze Tracing Speed Test 
(MTST) and the Seeing Deficiencies Test (SDT) 
from the Kit of Reference Tests for Cognitive Fac- 
tors (French, Ekstrom, & Price, 1963). The Vo- 
cabulary and Information subtests were selected as 
the two verbal subtests with the highest reliability 
coefficients and with the highest correlations with 
total verbal IQ (Wechsler, 1955). The raw scores 
on these subtests were converted to scaled scores. 
The other two tests were selected because they re- 
quire anticipation for successful performance. The 
MTST requires S to find and mark an open path 
through a moderately complex series of paper mazes. 
The SDT requires S to point out the way in which 
a described plan or activity is faulty. 

The grade-point index was available for 45 of 
the 51 Ss and these data were used to check the 
relationship between grades and PS. 
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TABLE 1 


Means, STANDARD DEVIATIONS, AND RELIABILITY 
CoEFFICIENTS FOR PS AND Four 
COGNITIVE MEASURES 


Measure M SD Reliability 
Prospective Span 2.74 1.63 S74 
Maze Tracing 22.78 6.87 921 
Seeing Deficiencies 7.75 2.86 .588 
Vocabulary 12.92 1.39 940% 
Information 14.02 2.38 910" 


*The reliabilities for the Vocabulary and Information 
subtests are the reliabilities reported in the WAIS manual 
(Wechsler, 1955). 


RESULTS 


The means, standard deviations, and reli- 
ability coefficients for the five measures are 
presented in Table 1. The reliability coeffi- 
cients for PS, MTST, and SDT are split-half 
coefficients corrected for full length by the 
Spearman-Brown formula. It should be noted 
that although both the MTST and the SDT 
are speed tests, they are divided into two 
sections which are timed and scored separately 
so that a split-half coefficient is appropriate 
(Helmstadter, 1964). The reliability coeffi- 
cients for the Vocabulary and Information 
subtests are those reported in the WAIS 
manual (Wechsler, 1955). They are also 
split-half coefficients corrected by the Spear- 
man-Brown formula. 

The reliabilities of the Vocabulary, In- 
formation, and MTST tests are all excellent, 
but reliabilities of the PS score and the SDT 
leave much to be desired. The TAT reliability 
of .574 is considerably lower than the .80 
test-retest reliability which has been reported 


TABLE 2 


CORRELATION Matrix or PS, Four COGNITIVE 
MEASURES, AND ACADEMIC INDEX 


Louis S. DICKSTEIN 


(Ricks et al., 1964). Furthermore, the average 
intercorrelation between the six stories in 
the present sample is only .224 which is not 
significantly greater than chance. Nine of the 
15 intercorrelations, however, were significant 
with the highest reaching .372. Forty of the 
Ss’ protocols were also scored by a second 
rater. The product-moment interrater reli- 
ability for PS was .959. 

The correlation matrix for the five mea- 
sures plus academic index is presented in 
Table 2. As expected, the PS score is sig- 
nificantly correlated with both the MTST 
and the SDT, although the magnitude of the 
correlations is modest. The MTST and the 
SDT are also significantly correlated with each 
other. The multiple correlation coefficient be- 
tween PS and the MTST and the SDT is 
335 (p< 01, one-tailed). It is instructive 
to note that while the MTST and SDT are 
both significantly correlated with the In- 
formation subtest, and the MTST is also sig- 
nificantly correlated with the Vocabulary 
subtest, the correlations between PS and the 
WAIS subtests are not significant. 

As previously noted, the reliabilities of 
the PS score and the SDT are rather low. 
Tf the correlations are corrected for attenua- 
tion, the correlation between PS and the 
MTST becomes .403, between PS and the 
SDT, .465, and between the MTST and the 
SDT, .567 (For all of these, p< .01, two- 
tailed). The correlations between PS and the 
WAIS subtests remain nonsignificant. 

The correlations between the grade-point 
index and the various measures are somewhat 
surprising. The index is not significantly re- 
lated to any of the measures including the 
WAIS subtests. The highest correlation is 
with the PS score and that correlation is the 


TABLE 3 
TRA 7 | 3 | x | R | i [e Means, ¢ VALUES, AND PROBABILITY LEVELS FOR 
Comparison or HicH- anp Low-PS SUBJECTS 
1, PS = 
2 Mist 2038 Pre S Measure | High | Low df t $ 
4. Vocabw: 
~ nt, i —.018 .289* 202 — MTST 26.18 | 22.34 | 32 | 1.70 | .05 
S rion a38 a2see aae oas — SDT 8.18 | 6.59 | 32 | 1.87 | .05 
. Index 221 (012.088 090 —.014 — Vocabulary | 13.12 | 13.12 | 32 00 ns 
Information | 14.53 | 14.18 32 i 
Note.—N = 51 for the correlations between the cognitive Index 1.98 2.36 | 28 as 025 


measures and 45 for all correlations with the I 
*p <.05, one-tailed. a Seats 
** > <.01, one-tailed. 


^ One-tailed. 
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reverse of what might be expected from 
previous studies. Indeed, when this correla- 
tion is corrected for the low reliability of the 
PS score, the correlation becomes —.292 (p 
< .10, two-tailed). 

In a further analysis, the top third and 
bottom third of Ss on PS were selected and 
t tests were conducted for the differences 
between these groups on all measures. The 
mean PS score for the 17 high-PS Ss was 
4.58, while the mean for the 17 low-PS Ss 
was 1.00. The means, ¢ values, and probability 
levels for the five comparisons are presented 
in Table 3. The differences between the 
groups are significant for the MTST, the 
SDT, and academic index. The differences 
between the groups on academic index is the 
most significant. The groups are not sig- 
nificantly different on the Vocabulary and 
Information subtests. 

Finally, it should be noted that the different 
TAT cards elicit different prospective spans. 
The mean PS for the six TAT cards in terms 
of their order of presentation were 2.16, 3.49, 
3.61, 3.45, 1.84, and 1.94. This point has been 
noted before by Ricks (1965). 


Discussion 


The results of the study support the first 
hypothesis. The PS score on the TAT is 
significantly related to other measures of 
anticipation but not to general verbal intel- 
ligence. Admittedly, the relationship is a 
moderate one which, considering the multiple 
correlation, accounts for only about 11% of 
the variance of PS. On the positive side, it 
does not seem possible to attribute the ob- 
tained relationships to common method vari- 
arce as is the case for so many personality 
variables (Mischel, 1968), since the three 
measures entail quite divergent operations. 
Rather, the correlations appear to represent 
a meaningful relationship which would prob- 
ably be considerably higher were the methods 
of measurement more reliable. 

The low reliability of the PS score may be 
attributed, in part, to the use of only six 
cards in the present study. A minimum of 
10 cards would appear to be a more ap- 
propriate procedure. As for the SDT, the 
extremely low mean of 7.75 out of a possible 
score of 20 for a college population suggests 
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that the time limit of 10 minutes per section 
is too stringent. A longer time period would 
allow more time for thought and probably 
would also result in higher reliability for that 
measure. In any event, the goal of developing 
an adequate instrument for the measurement 
of such an important capacity as anticipation 
would seem to be highly worthwhile. 

The absence of a significant correlation be- 
tween PS and the Information and Vocabulary 
subtest scaled scores is limited to the range 
of IQ in the sample. This range may be 
estimated from the subtest scaled scores as 
between 97 and 141, although 84% of the 
scores fall between 112 and 138. The mean 
estimated IQ is 123. Although this is a 
restricted range, it is sufficiently broad to 
give meaning to the absence of any relation- 
ship. 

The differences between mean PS for the 
different TAT cards suggests the need for 
more systematic study of the role of stimulus 
variables. It certainly seems inappropriate 
to use any set of TAT cards and to assume 
that the results are comparable to studies 
based on other sets of cards. 

The negative relationship between PS and 
academic index was completely unexpected 
and contradicts two previous studies. No ex- 
planation can be offered for this finding at the 
present time. 

Finally, it should be noted that all studies 
of PS thus far have used male samples. 
Studies of anticipation in females would seem 
to be in order. 


REFERENCES 


Dicxstew, L. S., & Bratt, S. J. The WAIS Picture 
Arrangement subtest as a measure of anticipation. 
Journal of Projective Techniques, 1967, 31, 32-38. 

Eprey, D., & Ricks, D. Foresight and hindsight in 
the TAT. Journal of Projective Techniques, 1963, 
27, 51-59. 

Frencu, J. W, Exstrom, R. B., & Price, L. A. 
Manual for kit of reference tests for cognitive 
factors. Princeton: Educational Testing Service, 
1963. 

Hetmstapter, G. C. Principles of psychological 
measurement. New York: Appleton-Century-Crofts, 
1964. 

Miscuet, W. Personality and assessment. New York: 
Wiley, 1968. 

Ricxs, D. Measurement of temporal orientation 
and span through the use of fantasy materials. 
Paper presented at the meeting of the Eastern 


760 Louis S. DICKSTEIN 


Psychological Association, Atlantic City, April Teanan, J. Future time perspective, optimism, and 
1965. academic achievement. Journal of Abnormal and 
Ricxs, D., Umesarcer, C., & Mack, R. A measure of Social Psychology, 1958, 57, 379-380. 
increased temporal perspective in successfully Wercuster, D. WAIS manual. New York: The Psy- 
treated adolescent delinquent boys. Aiste 2 chological Corporation, 1955. 
i l 85- 
pico ane SS ele A e E (Received October 25, 1968) 


(Continued from page 738) 


A Dimension of Moral Judgment: Robert Hogan*: Department of Psychology, Johns Hopkins University, Balti- 
more, Maryland 21218. 

Treatment of “Adolescent Rebellion” by the Reconstrual of Stimuli: John C. Masters*: Institute of Child De- 
velopment, University of Minnesota, Minneapolis, Minnesota 55455. 

College Students as Companions the Long-Term Mental Hospital Patients: Some Considerations: Edward H. 
Fischer*: Department of Psychology, Connecticut Valley Hospital, Middletown, Connecticut 06457, 

Autonomic Reactivity to Sensory Stimulation in Psychopatic, Neurotic, and Normal Juvenile Delinquents: 
Thomas D. Borkovec*: Psychological Clinic, Childrens Research Center, Champaign, Illinois. 

Anxiety and Speech in the Initial Interview: Benjamin Pope,* Aron W. Siegman, and Thomas Blass: Department 
of Psychology Service, Sheppard and Enoch Pratt Hospital, Baltimore, Maryland 21204. 

Approval Motive and Academic Behaviors: The Self-Reinforcement Hypothesis: Michael S. Matell and Ronald 
E. Smith*: Department of Psychology, University of Washington, Seattle, Washington 98105. 

Anxiety and Depression in Speech: Benjamin Pope,* Thomas Blass, Aron W. Siegman, and Jack Raher: Depart- 
ment of Psychology Service, Sheppard and Enock Pratt Hospital, Baltimore Maryland 21204. 

A Semantic Space for Nonverbal Behavior: Albert Mehrabian*: Department of Psychology, University of Cali- 
fornia, Los Angeles, California 90024. 

Self-Awareness and Its Relation to Other Parameters of the Clinical Inference Process: Michael J. Chandler*: 
Department of Psychology, University of Rochester, Rochester, New York 14627. 

Operant Conditioning Principles in the Treatment of a Selectively Mute Child: J. Dennis Nolan* and Connie 
Pence: Department of Psychology, Ohio State University, 1945 North High Street, Columbus, Ohio 43210. 

Concept Identification and Practice in Brain-Damaged and Process-Reactive Schizophrenic Groups: Oscar A. 
Parsons* and Helen P. Klein: Department of Psychiatry, University of Oklahoma, 800 Northeast Thirteenth 
Street, Oklahoma City, Oklahoma 73104. 

Measurement of Appropriate and Unnecessary Help-Seeking Dependent Behavior: Sheldon Cottler,* Robert 
P alin and Richard J. Palmer: Department of Psychology, Indiana University, Bloomington, Indiana 

Is an Abbreviated Form of the WISC Valid for Afro-American Children?: Robert J. Resnick* and Alan D. Entin: 
Department of Psychiatry, Medical College of Virginia, 1200 East Broad Street, Richmond, Virginia 23219. 


* Asterisk indicates for whom address is supplied. 


Journal of Consulting and Clinical Psychology 
1969, Vol. 33, No. 6, 761-762 


SHOULD WISC SUBJECTS EXPLAIN PICTURE 
ARRANGEMENT STORIES? * 


JAMES M. HERRELL AND JEFFREY H. GOLLAND 2 


Brooke General Hospital, Fort Sam Houston, Texas New York University School of Medicine 


To determine whether requiring WISC Ss to explain their Picture Arrangement 
(PA) sequences affects PA scores, 78 Ss were given the WISC and asked to 
tell PA stories, and 57 Ss were given the WISC under standard instructions. 
Forty-two pairs of Ss were matched for age, Full Scale IQ, and reason for 
referral. The Ss who told PA stories had an average PA score 2.12 scaled score 
points higher than Ss who did not tell stories (p<.01). Similar differences 
significant at or beyond the .02 level were found for Ss referred for behavioral- 
emotional problems, Ss with IQs below 100, and Ss 9 years old or less. Trends 
(.05 < p < .10) were found for Ss referred for academic-intellectual problems 
and Ss with IQs> 100. No significant difference was found for Ss>9 years. 
Clinicians were urged to be aware of the apparently facilitating effects of Ss 
explaining PA stories when interpreting the WISC. 


In clinical practice, many users of Wechsler 
tests routinely ask Ss to describe the stories 
depicted in the Picture Arrangement (PA) 
series after Ss have arranged the cards in each 
series. This practice is a deviation from stan- 
dard procedure, and potentially can alter Ss 
performance on the PA subtest. In their re- 
view of procedural variations in intelligence 
testing, Sattler and Theye (1967) mentioned 
no study dealing with this variation, nor were 
the present authors able to locate such a 
study in the literature. 

The purpose of the present study was to 
determine what effect asking Ss to describe 
WISC PA stories has on Ss’ PA performance. 


METHOD 
Subjects 


Almost all children referred for psychological eval- 
uation at Brooke General Hospital are administered 
the age-appropriate Wechsler test. For the present 
study, the S population consisted of all children 
completing the WISC from January through July, 
1968. There were 135 Ss, 99 males and 36 females, 
ranging in age from 6 through 15 years. 

Referrals tend to be for one of two fairly discrete 


1The authors want to thank the psychology tech- 
nicians at the hospital, Sam Sherrod, John Stahl, 
Dennis Gurrie, Bruce Engel, Douglas Abell, James 
Bowman, and James Wickis, who administered all 
tests involved, and Roger Gardner who not only 
administered tests, but helped compile the data. 

2 At Brooke General Hospital at the time the study 
was conducted. Requests for reprints should be sent 
to Jeffrey H. Golland, 145 Fourth Avenue, New 
York, New York 10003. 


reasons. Children are referred either for academic- 
intellectual problems (A-I) or behavioral-emotional 
problems (B-E). The appropriate determination is 
made directly from the referral. A-I Ss are those 
referred specifically for evaluation of intellectual 
functioning to meet state requirements for special 
education or to confirm a diagnosis of retardation 
or minimal brain dysfunction, In the B-E category 
are Ss referred for assessment of dynamics and 
recommendations for treatment. Such Ss represent a 
variety of psychological disturbances, such as enure- 
sis, uncontrollable acting out, poor peer relations, 
or withdrawn, depressive behavior. The two authors 
were in agreement on the appropriate category for 
all Ss used in the experiment. 

Prior to the initiation of this study in April 1968, 
clinic practice was for A-I referrals to tell PA stories 
for their projective value; B-E Ss completed more 
extensive projective tests and were not asked for 
PA stories. This procedure was altered when the 
study was begun, resulting in 51 A-I Ss and 27 B-E 
Ss who told stories and 31 A-I and 26 B-E Ss who 
did not. 

All testing was performed by psychology tech- 
nicians who were specially trained in the adminis- 
tration and scoring of the tests, and who were 
unaware of the rationale of the experiment. Stories 
were obtained by asking S, after each trial, to 
explain what the PA pictures portrayed. Scoring was 
independent of these explanations. 


RESULTS 


Before any difference in PA scores between 
administrative groups can be attributed to 
procedural variation, it must be assumed that 
the two groups would perform similarly if 
tested under similar conditions, that is, the 
two groups must have similar IQs. To insure 
that such was the case, a matching procedure 
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TABLE 1 
Mean PA SCORE DIFFERENCES BETWEEN 
Matcuep Pairs 
Item N Mois t b 
All Ss 42 2.12 3.13 <.01 
B-E referrals 19 2.00 2.67 | <.02 
A-I referrals 23 2.18 1.93 | <.10 
IQ > 100 it 2.09 2.11 <.10 
IQ < 100 30 2.20 2.44 <.05 
Age <9 20 180 | 413 | <.001 
BAge > 9 21 233 | 158 | >10 


Note.—Positive Mpirt's indicate higher scores for Ss who 
explained PA stories. 


was used by which Ss were matched based on 
Full Scale IQ. 

The use of post hoc matching was particu- 
larly suspect in this case due to the contami- 
nation of the matching variable by the depen- 
dent variable. The result of such contamina- 
tion would be to suppress the appearance of 
a PA difference and bias the experiment in 
favor of the null hypothesis, For lack of a 
better control, the experiment was conducted 
with a conservative bias. 

Criteria for matching were three: (a) Age: 
All Ss were matched with someone within a 
year of the same age. Age at last birthday was 
used. (b) Reason for referral: Each was cate- 
gorized into one of the two types of referrals 
and matched with someone of that type. 
(c) Level of intellectual functioning: Each S 
was matched with someone whose Full Scale 
IQ was within five points of his own. 

Using these criteria, all Ss who could be 
were matched, 62 males and 22 females, for 
a total of 42 pairs, and these pairs were used 
in all comparisons. 

The statistics for the unmatched Ss indi- 
cated that the method of selection of type 
of administration was not random. Unmatched 
Ss who told PA stories had an average IQ 
nearly seven points higher and PA scores of 
three points higher than control Ss, emphasiz- 
ing the necessity for matching. 

The statistical test for all comparisons was 
the ¢ test for correlated samples (McNemar, 
1962). Reported probabilities are all for two- 
tail tests. Table 1 contains mean differences 
between PA scaled scores of Ss who told 
stories and those who did not. 

The first comparison was for the total 
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sample of pairs. Mean difference was 2.12 
scaled score points, a difference significant at 
the .01 level. Explaining PA stories seems to 
facilitate PA performance. 

Mean difference for Ss referred for B-E 
problems was 2.00, significant at the .02 level. 
Mean differences for A-I Ss was 2.18, with 
< .10. Even though the magnitude of dif- 
ference is similar for both groups, the differ- 
ence seems more reliable for the B-E Ss. 

For Ss with Full Scale IQs 100 or greater, 
the mean difference was 2.09, p< .10. Mean 
difference for Ss with Full Scale IQs below 
100 was 2.20, significant at the .05 level, 
suggesting that the effect is more reliable for 
the less intelligent Ss. 

Mean difference for Ss 9 years or less was 
1.80, significant at the .001 level. Mean dif- 
ference for Ss 10 years or older was 2.33, with 
p> .10, Although the difference is larger for 
the older children, the effect is more reliable 
for the younger ones. 


Discussion 


Although results are equivocal with certain 
subsamples, it seems clear that having Ss 
explain PA stories does facilitate performance. 
Perhaps Ss who learn they must justify their 
responses are less careless or attend more 
closely to the task. Whatever the reason, use 
of this procedural variation with clinical cases 
tends to increase S’s scores by as much as 2 
scaled score points, which in turn increases — 
Performance IQ by from 2 to 3 points and — 
Full Scale IQ from 1 to 3 points. Except in 


those rare cases for which an exact numeri- 


cal estimate of intelligence is needed, these 
increments are in themselves insignificant. 
However, when clinicians engage in profile 
analysis or scale comparisons, a differ- 
ence of 2 or 3 points is clinically important. 
Clinicians who do require PA stories on the 
WISC should be aware that they are ob- 
taining higher PA scores than if they followed 
standard procedure, and should interpret find- 
ings accordingly. 
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A recent study of the relationship between 
Extraversion, Manifest Anxiety (MA), 


jorted a correlation of .87 between MA and 
-S (Golin, Herron, Lakota, & Reineck, 


ible effects of item overlap, they concluded 
hat these two scales have practically the 


total of 29 items common to both scales. Such 
extensive overlap might be expected to in- 
~ fluence the magnitude of the relationship to a 
gnificant degree. 
The present study investigated the effects 


i METHOD 


r pe The Ss were 386 males who were evaluated at the 
Des Moines Veterans Administration Center as ap- 
plicants for outpatient psychotherapy during the 
riod from 1955 to 1964 and for whom complete 
MPIs were available. The data for these Ss were 
their scores on the R-S scale, the MA scale, the R-S 
ale omitting the 29 common items (R-S’), and the 
A scale minus the 29 common items (MA’). 


1 Requests for reprints should be sent to Patrick 
Sullivan, Department of Psychology, Albion Col- 
lege, Albion, Michigan 49224. 


REEATIONSHE OF MANIFEST ANXIETY TO REPRESSION- 
SENSITIZATION ON THE MMPI 
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This study investigated the effect of item commonality on the relationship 
between the Manifest Anxiety (MA) and Repression-Sensitization (R-S) 
scales with respect to an earlier conclusion that the scales have similar psy- 
chological meaning. Although it was found that the 29 common items did 
make a significant contribution, a strong relationship was found when these 
common items were omitted. It was concluded that MA and R-S do appear 
to have similar, if not identical, psychological meaning and that research find- 
ings based on MA are probably applicable to R-S, and vice versa. If so, this 
finding would lend support to recent studies which indicate’ that sensitizers 
perform with more facility than do repressors on certain types of tasks, 


The group form numbers of the 29 items omitted 
from R-S’ and MA’ include: 13 (290), 18, 23 (288), 
32, 43, 67, 86, 107, 142, 158, 191%, 217, 238, 241%, 
242, 263*, 301, 317 (362), 321, 322, 337, 340, 352, 
361, 397, 418, 431, 499, 555. (Those items marked 
with an asterisk (*) are scored when answered True 
on MA and when answered False on R-S.) Items 13, 
23, and 317 on MA are identical to items 290, 288, 
and 362 on R-S. 


RESULTS 


The mean score of 78.87 (SD = 23.66) on 
R-S for this sample is somewhat greater than 
that reported by Golin et al. (1967). For MA, 
the mean score was 25.96 (SD = 9.79). The 
omission of the 29 common items resulted in 
a mean decrease of 16.58 (t= 52.45, df= 
385, p < .001) for R-S and 16.03 (t = 47.21, 
df = 385, p < .001) for MA. Although the 
average decreases were not significantly differ- 
ent from each other, these figures represent 
32% of the MA scale as compared to 10. big 
of the R-S scale. 

For this sample, the obtained correlation of 
91 (p< .001) between R-S and MA was 
stronger than that reported by Golin et al. 
When the common items were excluded from 
both scales, the relationship diminished sig- 
nificantly (z = 7.39, p < .001), as expected, 
but continued to be quite high (r= .76, p 
< .001). Similarly, the effect of omitting the 
common items from only one scale resulted in 
a diminished but significant degree of rela- 
tionship between the two scales. The correla- 
tions obtained from this latter comparison 
were r= .86 (p< .001) for MA and R-S' 
and r = .78 (p < .001) for R-S and MA’. 
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DISCUSSION 


It seems that although the 29 common items 
significantly contribute to the relationship be- 
tween MA and R-S, there is a high degree of 
relationship between the scales. These find- 
ings would support the conclusion of Golin et 
al. (1967) that MA and R-S have similar psy- 
chological meaning. Therefore, it seems pos- 
sible that research findings using the MA scale 
may be applicable to the R-S scale and vice 
versa. If so, the finding that high-MA Ss learn 
better than low-MA Ss (Spence, 1960) would 
tend to support the more recent findings that 
sensitizers fun¢tion’ with More facility than 
do repressors in stress and ego-involving situ- 
ations (Hare, 1966; Lazarus & Alfert, 1964; 
Lomont, 1965; Petzel & Gynther, 1968; Tem- 
pone, 1964). Further research may be fruit- 
ful to an understanding of the MA-R-S re- 
lationship. 
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